<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A study design template for identifying usability issues in graphical modeling tools</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jakob Pietron</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alexander Raschke</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michael Stegmaier</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Matthias Tichy</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Enrico Rukzio</string-name>
          <email>enrico.rukziog@uni-ulm.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute of Media Informatics, Ulm University</institution>
          ,
          <addr-line>89081 Ulm</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Institute of Software Engineering and Programming Languages</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Motivation &amp; Problem Statement</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>Model-driven engineering aims at increasing the productivity of software engineering and the quality of the software. These positive results have been con rmed in several empirical studies. However, those studies also report that usability of model-driven engineering tools is generally considered to be poor. This is also the prevalent opinion on usability expressed in discussions in academica as well as in our collaborations with practitioners. Unfortunately, there are scarcely any empirical studies on identifying usability issues nor papers reporting on systematically evaluated usability improvements in model-driven engineering. In this paper, we present a study design template for identifying usability issues speci cally in graphical editors. This template is grounded in usability research as well as empirical research methods. We illustrate the proposed study design on the example of identifying usability issues in a graphical editor for developing state machines.</p>
      </abstract>
      <kwd-group>
        <kwd>Usability Graphical Modeling Tools Model-driven Engineering Study Design</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        should support an analyst in developing a system [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. Therefore, our research
focus is on analyzing and improving the usability of MDE tools.
      </p>
      <p>
        Before the usability of MDE tools can be improved, the actual usability
issues must be identi ed. In this paper, we propose a study design template to
systematically evaluate the usability of graphical modeling tools. The aim of this
study design template is to identify de cits in usability by generating qualitative
data instead of quantitative data. While quantitative data, as collected by e. g.
Condori-Fernandez et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], can indicate problems, make them measurable and
enable the comparison of solutions, the actual problems are not identi ed.
      </p>
      <p>
        Since we are interested in the real causes of poor usability, we propose a
qualitative study design. In our study design template we focus on the creation
and modi cation of graphical representations of models. All graphical
modeling tools have this functionality in common whereas other functionality such as
debugging or model checking is editor and language speci c. We suggest
performing the evaluations in terms of e ciency, e ectiveness and satisfaction, the
main aspects of usability as proposed by ISO 9241-11 [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Our study design
template is meant to form the basis for a family of experiments as advocated by
Basili [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and as such should enable a simple replication of collected data.
      </p>
      <p>
        We suggest a think-aloud observation after which participants are interviewed
and asked to ll in questionnaires. The questionnaires serve to collect experience
level, demographic data, and contains questions for the calculation of the System
Usability Score (SUS) [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and the Technology A nity (TA-EG) [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. To enable a
qualitative analysis of the collected data, coding techniques are used to annotate
captured screen recordings.
      </p>
      <p>After discussing the related work in Section 2, we describe our study design
template in Section 3 and nally conclude this paper in Section 4.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        There is a lot of literature discussing good practices in conducting usability
studies in general (see, e. g., [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] or the literature survey in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]). In all this scienti c
work di erent methods for measuring usability or identifying usability problems
are introduced and discussed. However, the composition of these methods to a
result-oriented study design is often left to the reader. As proposed by Basili
[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], especially for the repeatability of experiments guidelines or frameworks are
indispensable. In [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], for example, such a framework for measuring the usability
of websites is proposed. Similarly, [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] introduces a more general framework, but
based on very low-level interactions and therefore di cult to apply in a given
context. Moreover, both works aim at usability measurement, not at identifying
concrete usability problems.
      </p>
      <p>In the eld of modeling tools, which in this context also includes UML tools
or domain-speci c languages (DSL), the following three di erent areas can be
distinguished:</p>
      <p>
        The rst one includes work that focuses on the usability of modeling
languages themselves (for an overview, see [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]), but not the usability of the used
tools. In the study conducted by Cuenca et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] similar methods are used
as we propose (SUS, observations, and questionnaires, see below), but only the
usability of (textual) DSLs is measured and not the usability of the used tools.
This also happens in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] where the authors present "a way to systematize the
evaluation process" of DSLs. They introduce a generic process for evaluating
DSLs as a user interface of a developer using a DSL. Poltronieri et al. go one
step further and de ne a more precise framework Usa-DSL [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] for this activity
in order to carry out replicated usability studies of DSLs. Both papers try to
evaluate the DSL itself, but not the (graphical) tools for working with (graphical)
DSLs as we do.
      </p>
      <p>
        In the second area, the usability of tools is considered, but with the focus on
tool selection. For example, the work of Safdar et al. [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ] is about comparing the
usability of di erent tools in several diagram types. Rouly et al. [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] describe a
method for understanding usability of Visual Integrated Development
Environments (IDE) used by non-programmers. The considered tools are mostly used for
interactive editing of graphical models, yet not necessarily models in the sense
of MDE. The usability is measured by analyzing their interface characteristics
according to a proposed model. The main goal of this work is the comparison of
the tools' usability in an abstract way.
      </p>
      <p>
        The third area, which is also addressed by us, considers the usability of
modeling tools per se. In [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] the authors conduct an experiment to measure the
usability of six UML modeling tools. They captured the time each of the 58
participants needed to ful ll several tasks under the assumption, that "the time
of performing tasks is one of the usability indicators". The results were
conrmed by an analysis with GOMS [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. GOMS is a formal usability method that
tries to measure the usability of an interface by decomposing typical user tasks
into simple basic actions. For each task, the needed time based on estimations
per basic user action is calculated and valuated. Besides this simple time
comparison, the authors collected participants' comments and suggestions, but not
systematically. Interestingly, most of the reported problems obviously have not
been improved in the last 13 years.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] Condori-Fernandez et al. introduce an evaluation model for
identifying the usability of MDE tools by measuring the completeness of user performed
tasks in relation to the number of steps needed to nish them. Their framework
is de ned in a very abstract way by an evaluation model and a generic process.
When applying the framework to a tool, the authors propose exemplary methods
to be used. Concrete usability problems are discovered by classi cation of
observations against ergonomic criteria. In contrast, our proposed qualitative study
design template, which is explained in detail in the following sections, allows
usability problems to be discovered on a ne-grained level.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Study Design Template</title>
      <p>Our study design template can be seen as a generic guideline for how to
conduct a case study to identify fundamental usability-related problems of graphical
modeling tools. The study design template focuses on usability problems that
occur while users are creating and editing a graphical representation of a model
with the functionality provided by a speci c modeling tool and not usability
problems that are related to a particular DSL.</p>
      <p>
        Yin de nes a case study as \an empirical inquiry that investigates a
contemporary phenomenon within its real-life context " [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ]. Furthermore, we can de ne
the type of case study more precisely: it should be an exploratory and
explanatory single-case case study. Exploratory to identify the actual usability problems
in graphical modeling tools in general and the parts, features, and components
of an editor that are responsible for the usability problems in particular. By
investigating the reasons why these problems occur the case study additionally
becomes explanatory.
      </p>
      <p>
        As required by Yin [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ], in this section, we describe the requirements to the
objects of study (editor and graphical modeling language), how tasks should
be designed, how to nd the right participants, and how to analyze the
collected data. For each aspect we discuss the requirements, introduce the solution
suggested by our study design template and illustrate it by an example. The
example given is based on a study we conducted to evaluate the usability of the
graphical modeling tool Yakindu Statechart Tools.
3.1
      </p>
      <sec id="sec-3-1">
        <title>Objects of Study</title>
        <p>The choice of the right editor for a problem discovery study depends on the used
graphical language. The chosen graphical language should o er the possibility to
de ne real-world tasks with di erent levels of complexity. The chosen language
should be well known by participants to have no negative e ect on the internal
validity.</p>
        <p>If there is no speci c domain speci c language (DSL) prede ned, we suggest
state machines as the graphical language for our study design template; state
machines are well known to students as well as developers. Therefore, participants
can be recruited from a university as well as industrial context. Participants do
not have to learn a new graphical language. State machines support a wide range
of complexity. It is possible to create very simple state machines but also very
complex ones. This makes it easy to create di erent kinds of tasks for the study.
Furthermore, you can choose from a wide range of di erent graphical modeling
editors that support state machines.</p>
        <p>On the other hand, if the editor to be evaluated supports exactly one
graphical modeling language and no real users are available for participating in the
study, the participants, e.g., students, require a training for that DSL.
Regardless of whether real users or just test users participate, for later analysis, it is
important to distinguish between problems related to the tool and problems
related to the used language.</p>
        <p>Example The Eclipse community provides with Eclipse Modeling
Framework (EMF), Graphical Editing Framework (GEF), and Graphical Modeling
Framework (GMF) a rich toolbox to create graphical model-driven DSLs and
corresponding graphical tools. Many graphical tools such as Papyrus3, Graphiti4,
and Yakindu Statechart Tools5 are based on these frameworks. We choose Yakindu
as an editor based on GEF to be evaluated. We prefer Yakindu for the following
two reasons: rst, it supports state machines (see above). Second, it is the tool
which focuses the most on graphical editing. It hides a lot of complexity by
allowing the user to manipulate and create elements directly in the graphical view.
In contrast to, e.g., Papyrus, no dialog windows must be lled in. Most
interactions take place inside the graphical view. This in turn supports the internal
validity of a study.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Tasks</title>
        <p>As mentioned before, our study design template focuses on the interaction with
the graphical representation of a model which is a diagram in most cases. We
suggest to de ne tasks that let participants recreate a given graphically modeled
system, e.g., handed out on the printed task description, or edit and refactor
a prepared diagram by using the tool to be evaluated. A participant's result
does not have to look exactly the same as in the task description, but it must
be functionally equivalent. Setting the task to just create a diagram similar to
the one shown on a screenshot ensures that all observed problems are related to
the editor and not in uenced by a problem in understanding the task or textual
system description. Overall, this supports the internal validity.</p>
        <p>
          The tasks to develop should cover a wide range of scenarios and
functionality of the chosen graphical DSL and modeling tool. Furthermore, the tasks
should become more and more complex in order to support the users' learning
curve [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. On the one hand, the complexity of a task depends on the number of
objects, connections and language concepts used; on the other hand, the more
objects that need to be managed, the more it becomes di cult to keep the
modeled diagram readable and understandable. Therefore, the layout of the diagram
created by a participant should be readable and comprehensible. We provide a
set of rules with the intention to make the participants change a default layouted
diagram to improve its readability and comprehensibility. Some of our rules are
adopted from the work of Purchase [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]:
{ Objects do not overlap
{ Edges do not overlap and have a distinguishable margin in between [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]
{ Prevent crossing edges [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]
{ A label has a clear relation to its edge
{ Prevent bends in edge's path [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]
{ All labels, names, titles, and descriptions are displayed completely without
abridging points
3 https://www.eclipse.org/papyrus/
4 https://www.eclipse.org/graphiti/
5 https://www.itemis.com/en/yakindu/state-machine/
The de ned rules are a minimum set of rules that should be ful lled by a
participant's diagram to complete a task. This enforces users to adapt the layout of
their diagram by using the tools provided by the evaluated editor. These rules
can be extended depending on the chosen graphical modeling language and
evaluated editor.
        </p>
        <p>Example A study can consist of di erent state machine tasks: state machines
should be created from scratch, but also existing ones should be edited, and
refactored. The tasks should cover a wide range of state machine functionality
such as hierarchy, nested states, parallel states, transition with guards in di erent
levels of complexity. Simple state machines consist of only a few connected states
without any hierarchy or parallelism. Complex state machines can consist of more
than 20 states with up to 50 transitions, hierarchy and parallelism.</p>
        <p>All participants receive a prepared Yakindu workspace (following the example
from the previous subsection), and for each task, a printed screenshot of a state
machine. This state machine has to be previously modeled by the researchers,
with the tool of which the usability is to be evaluated. All required events,
variables, and if necessary incomplete state machines are prepared a priori in
the workspace.
3.3</p>
      </sec>
      <sec id="sec-3-3">
        <title>Participants</title>
        <p>At best, real users of the tool can participate in a study. In most cases, however,
this will not be the case. Therefore, in this chapter we describe the requirements
that the participants must ful ll in order to meet the characteristics of the real
users as much as possible.</p>
        <p>
          We assume that users of graphical modeling tools are expert users. Expert
users (in contrast to casual users) use a tool regularly for a long period of time. In
order to achieve a su cient external validity, the recruited participants should
be at least well experienced in working with computers and software in
general, and with graphical modeling tools in particular. The graphical modeling
language must also be well known, if not, participants should complete a
training beforehand. It is not required that participants have prior experience with
the speci c editor to be evaluated but they should have experience with any
graphical modeling tool for at least six months to increase internal validity. To
check whether participants ful ll all requirements, the TA-EG questionnaire [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]
should be used. Additionally, in a questionnaire participants should be explicitly
asked for experience in graphical modeling tools, see following Subsection 3.4.
        </p>
        <p>
          The TA-EG questionnaire (original German title: Technika ntat {
Elektronische Gerate, translated: Technology A nity { Electronic Devices) can be used
to measure a nity to technical devices like computers, mobile phones, or
navigation systems [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. TA-EG consists of 19 Likert-scale questions grouped into the
four categories enthusiasm, competence, negative attitudes, and positive attitudes.
By using TA-EG, we assess the characteristics of our participants regarding each
category. In this way, we can ensure that the participants have a positive
attitude towards technology, which we assume corresponds to that of expert users
in the context of modeling tools.
        </p>
        <p>
          Nielsen and Landauer [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] describe the number of participants that are
required for problem discovery studies as depending on two factors. First, it
depends on the percentage of all usability problems that should at least be found.
Second, it should take into consideration the probability of a single problem
being found by a participant. Across eleven usability studies, Nielsen and
Landauer found the average probability of a problem being found ranges from 0.16
to 0.60 with an average of 0.31. Following the authors' formulation, to detect
90 % of all problems, we require at least nine participants with a problem
detection probability of 0.31 (average case). To discover 90 % of all problems with
a problem detection probability of only 0.16 (worst case), we require at least
15 participants. Additionally, we require at least one informal participant for a
pilot test to x possible errors in the concrete study design.
3.4
        </p>
      </sec>
      <sec id="sec-3-4">
        <title>Data Sources</title>
        <p>Our study design template bene ts from multiple measures that generate
qualitative as well as quantitative data: think-aloud observation, a questionnaire, and
a semi-structured interview. The three methodologies should be conducted one
after the other as listed above.</p>
        <p>
          Using several data sources limits the e ects of a possible wrong interpretation
of one single data source. Triangulation can be used to increase the validity of
observed problems [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ].
        </p>
        <p>
          During the think-aloud observation the participants have to use the system
while continuously thinking out loud, and being observed by an experimenter.
Thinking out loud means that participants have to verbalize their thoughts,
describe what they actually expect and what happens instead. During an
observation session, audio and video are recorded. It should be noted that users might
give false impressions or own theories of the cause of usability problems. The
experimenter should focus on what users actually do instead of what users say
they do. For example, a participant might criticize a missing button even though
the participant just has not seen it. The real problem is therefore the visibility of
the button and not its absence. Later analysis is needed to abstract the observed
problems from the participants and identify the underlying usability problems.
On the other hand, users' comments on user interface elements, what they like
and do not like, can be useful input for later improvements [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ].
        </p>
        <p>Directly after nishing the think-aloud observation, participants are asked
to ll in a questionnaire. The questionnaire consists of three parts: the System
Usability Score (SUS) for measuring the usability of the evaluated tool by a
quantitative score, TA-EG questionnaire, and questions about previous
experience and demographic data.</p>
        <p>
          SUS [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] is a well established tool to measure usability with a quantitative
scale. SUS is used to get a rst impression of the usability of the evaluated tool.
Furthermore, this score can be compared to possible improvements in the future
or other evaluated tools. Our study design contains SUS because it consists of
only ten Likert-scale questions. After a possible long lasting think-aloud
observation, participants do not want to ll in an extensive usability questionnaire.
Although SUS is a short questionnaire, the resulting score, a value between 0
and 100, is meaningful and valid [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. The numerical SUS score is not a
percentage, despite its appearance. To interpret the numerical score, we use adjective
and grade rating scales developed by Bangor et al. [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. Their work is based on
analysis of 1,000 SUS surveys. The average score of the examined SUS-surveys
is 68. For example, this numerical SUS score corresponds to ok on the adjective
rating scale and to D on the grade scale.
        </p>
        <p>As introduced before in Subsection 3.2, the TA-EG questionnaire and asking
for prior experience are used to validate the sample against the previously de ned
requirements. Beside just asking for prior experience in general, we recommend
to ask explicitly for graphical modeling tools that participants are experienced
with and for how long they already use them. This data helps to identify a
possible bias that participants might have.</p>
        <p>
          Finally, a semi-structured interview should be used to explore what the users
have in mind when working with the editor and to get an understanding of
interesting observations during the think-aloud phase [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. Each participant has to
answer pre-determined questions about the overall experience, what was good, and
what was bad when they worked with an editor. Aside from the pre-determined
questions, some of the interview questions should be based on the observations
during the think-aloud phase to get a deeper understanding of the users' behavior
in speci c situations, e.g., error situations.
        </p>
        <p>Example The researcher noted a participant had problems clicking at a
speci c node. In the course of the semi-structured interview, the researcher may
ask the participant to explain that situation in her or his own words, explain
the actual goal and what happened instead. The researcher could also ask for
possible improvements.
3.5</p>
      </sec>
      <sec id="sec-3-5">
        <title>Qualitative Analysis</title>
        <p>
          Our study design mainly generates qualitative data. Coding is our proposed way
to analyze and structure this data. A code is a short phrase or sentence. It is
assigned to a text phrase of the transcribed interviews or video snippet recorded
during the think-aloud observation. The code should be the essence or meaning
of the coded data [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ].
        </p>
        <p>
          As initially de ned, our study design addresses discovery of problems and the
context in which these problems occur. We suggest the open coding technique to
identify problems in the collected data. Open coding breaks down the data into
rst provisional, comparative codes [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ]. Afterwards, we suggest to build up a
category system with the focus on a ected elements and performed actions by
users. The category system should emerge from the coded data.
Example A problem is observed when a participant tries to click at a small
connection with the intention to change its position. Instead of clicking at the
connection, the participant clicks at the underlying box and moves the box
instead. One way to code the described problem might be connection (category
context) and click hit (performed action).
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>Although MDE has a vital research community, its ideas and results are still
not well established in the eld. One reason might be the poor usability of
provided tools and/or provided frameworks to build tools. While there are several
studies that quantitatively measure the usability of MDE tools, we try to discover
concrete problems in order to work on their improvement.</p>
      <p>In this paper, we introduce a study design template that allows for conducting
qualitative usability studies more easily. We describe the di erent aspects of
such a study in detail (objects, tasks, participants, data sources, and qualitative
analysis) and discuss our suggested methods.</p>
      <p>This more abstract description is supplemented by an ongoing example in
which the usability of the GEF-based tool Yakindu was examined during the
creation or modi cation of statecharts. We have actually conducted this study
and are currently working to fully evaluate the results.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Al-Wabil</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Al-Khalifa</surname>
          </string-name>
          , H.:
          <article-title>A framework for integrating usability evaluations methods: The Mawhiba web portal case study</article-title>
          .
          <source>In: 2009 International Conference on the Current Trends in Information Technology (CTIT)</source>
          . pp.
          <volume>1</volume>
          {
          <issue>6</issue>
          (Dec
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Andre</surname>
            ,
            <given-names>T.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hartson</surname>
            ,
            <given-names>H.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Belz</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McCreary</surname>
            ,
            <given-names>F.A.</given-names>
          </string-name>
          :
          <article-title>The user action framework: a reliable foundation for usability engineering support tools</article-title>
          .
          <source>International Journal of Human-Computer Studies</source>
          <volume>54</volume>
          (
          <issue>1</issue>
          ),
          <volume>107</volume>
          {
          <fpage>136</fpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bangor</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kortum</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <source>Determining What Individual SUS Scores Mean: Adding an Adjective Rating Scale</source>
          <volume>4</volume>
          (
          <issue>3</issue>
          ),
          <volume>114</volume>
          {
          <fpage>123</fpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Bangor</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kortum</surname>
            ,
            <given-names>P.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>J.T.</given-names>
          </string-name>
          :
          <article-title>An Empirical Evaluation of the System Usability Scale 24(6</article-title>
          ),
          <volume>574</volume>
          {
          <fpage>594</fpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Barisic</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Amaral</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          , Goula~o,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Barroca</surname>
          </string-name>
          ,
          <string-name>
            <surname>B.</surname>
          </string-name>
          :
          <article-title>Evaluating the Usability of Domain-Speci c Languages</article-title>
          .
          <source>In: Software Design and Development: Concepts</source>
          , Methodologies, Tools, and Applications, pp.
          <volume>2120</volume>
          {
          <fpage>2141</fpage>
          . IGI Global, Hershey, PA, USA (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Basili</surname>
            ,
            <given-names>V.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shull</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lanubile</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Building knowledge through families of experiments</article-title>
          .
          <source>IEEE Transactions on Software Engineering</source>
          <volume>25</volume>
          (
          <issue>4</issue>
          ),
          <volume>456</volume>
          {473 (Jul
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Bobkowska</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reszke</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Usability of UML Modeling Tools</article-title>
          .
          <source>In: Proceedings of the 2005 Conference on Software Engineering: Evolution and Emerging Technologies</source>
          . pp.
          <volume>75</volume>
          {
          <fpage>86</fpage>
          . IOS Press, Amsterdam, The Netherlands (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Bordeleau</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liebel</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raschke</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stieglbauer</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tichy</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Challenges and research directions for successfully applying MBE tools in practice</article-title>
          .
          <source>In: Proceedings of MODELS</source>
          <year>2017</year>
          <article-title>Satellite Event: Workshops (ModComp, ME</article-title>
          , EXE, COMMitMDE, MRT,
          <string-name>
            <surname>MULTI</surname>
          </string-name>
          , GEMOC, MoDeVVa, MDETools, FlexMDE, MDEbug), Posters, Doctoral Symposium, Educator Symposium, ACM Student Research Competition, and
          <article-title>Tools and Demonstrations</article-title>
          .
          <source>CEUR Workshop Proceedings</source>
          , vol.
          <year>2019</year>
          , pp.
          <volume>338</volume>
          {
          <fpage>343</fpage>
          .
          <string-name>
            <surname>CEUR-WS.org</surname>
          </string-name>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Brooke</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>SUS: A "quick and dirty" usability scale</article-title>
          .
          <source>In: Usability Evaluation in Industry. Taylor and Francis</source>
          (
          <year>1986</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Condori-Fernandez</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Panach</surname>
            ,
            <given-names>J.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baars</surname>
            ,
            <given-names>A.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vos</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pastor</surname>
            ,
            <given-names>O.:</given-names>
          </string-name>
          <article-title>An empirical approach for evaluating the usability of model-driven tools</article-title>
          .
          <source>Science of Computer Programming</source>
          <volume>78</volume>
          (
          <issue>11</issue>
          ),
          <volume>2245</volume>
          {
          <fpage>2258</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Cuenca</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bergh</surname>
            ,
            <given-names>J.V.</given-names>
          </string-name>
          <year>d</year>
          .,
          <string-name>
            <surname>Luyten</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Coninx</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>A User Study for Comparing the Programming E ciency of Modifying Executable Multimodal Interaction Descriptions: A Domain-speci c Language Versus Equivalent Event-callback Code</article-title>
          .
          <source>In: Proceedings of the 6th Workshop on Evaluation and Usability of Programming Languages and Tools</source>
          . pp.
          <volume>31</volume>
          {
          <fpage>38</fpage>
          .
          <source>PLATEAU</source>
          <year>2015</year>
          , ACM, New York, NY, USA (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Hornb</surname>
          </string-name>
          k, K.:
          <article-title>Current practice in measuring usability: Challenges to usability studies and research</article-title>
          .
          <source>International Journal of Human-Computer Studies</source>
          <volume>64</volume>
          (
          <issue>2</issue>
          ),
          <volume>79</volume>
          {
          <fpage>102</fpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13. ISO: ISO 9241-
          <fpage>11</fpage>
          :
          <year>1998</year>
          <article-title>Ergonomic requirements for o ce work with visual display terminals (VDTs) { Part 11: Guidance on usability</article-title>
          .
          <source>Tech. rep</source>
          .,
          <source>International Organization for Standardization</source>
          (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>John</surname>
            ,
            <given-names>B.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kieras</surname>
            ,
            <given-names>D.E.</given-names>
          </string-name>
          :
          <article-title>The GOMS Family of User Interface Analysis Techniques: Comparison and Contrast</article-title>
          .
          <source>ACM Trans. Comput.-Hum. Interact</source>
          .
          <volume>3</volume>
          (
          <issue>4</issue>
          ),
          <volume>320</volume>
          {351 (Dec
          <year>1996</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Karrer</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Glaser</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Clemens</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bruder</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Technika nitat erfassen{der Fragebogen TA-EG</article-title>
          . pp.
          <volume>196</volume>
          {
          <fpage>201</fpage>
          . No.
          <article-title>8 in Der Mensch im Mittelpunkt technischer Systeme (</article-title>
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Lazar</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Feng</surname>
            ,
            <given-names>J.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hochheiser</surname>
          </string-name>
          , H.:
          <article-title>Research Methods in Human-Computer Interaction</article-title>
          . Wiley (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Nielsen</surname>
            ,
            <given-names>J.: Usability</given-names>
          </string-name>
          <string-name>
            <surname>Engineering</surname>
          </string-name>
          . Academic Press (
          <year>1993</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Nielsen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Landauer</surname>
            ,
            <given-names>T.K.</given-names>
          </string-name>
          :
          <article-title>A mathematical model of the nding of usability problems</article-title>
          . pp.
          <volume>206</volume>
          {
          <fpage>213</fpage>
          . ACM Press (
          <year>1993</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Poltronieri</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zorzo</surname>
            ,
            <given-names>A.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bernardino</surname>
          </string-name>
          , M.,
          <string-name>
            <surname>de Borba Campos</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Usa-dsl: Usability evaluation framework for domain-speci c languages</article-title>
          .
          <source>In: Proceedings of the 33rd Annual ACM Symposium on Applied Computing</source>
          . pp.
          <year>2013</year>
          {
          <year>2021</year>
          . SAC '18,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <given-names>Poltronieri</given-names>
            <surname>Rodrigues</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Campos</surname>
          </string-name>
          , M.d.B.,
          <string-name>
            <surname>Zorzo</surname>
            ,
            <given-names>A.F.</given-names>
          </string-name>
          :
          <article-title>Usability Evaluation of Domain-Speci c Languages: A Systematic Literature Review</article-title>
          . In:
          <article-title>HumanComputer Interaction. User Interface Design, Development and Multimodality, LNCS</article-title>
          , vol.
          <volume>10271</volume>
          , pp.
          <volume>522</volume>
          {
          <fpage>534</fpage>
          . Springer (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Post</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kagan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>User requirements for OO CASE tools</article-title>
          .
          <source>Information and Software Technology</source>
          <volume>43</volume>
          (
          <issue>8</issue>
          ),
          <volume>509</volume>
          {
          <fpage>517</fpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Purchase</surname>
          </string-name>
          , H.:
          <article-title>Which aesthetic has the greatest e ect on human understanding? In: Graph Drawing</article-title>
          . pp.
          <volume>248</volume>
          {
          <fpage>261</fpage>
          . Springer (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Rouly</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Orbeck</surname>
            ,
            <given-names>J.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Syriani</surname>
          </string-name>
          , E.:
          <article-title>Usability and Suitability Survey of Features in Visual Ides for Non-Programmers</article-title>
          .
          <source>In: Proceedings of the 5th Workshop on Evaluation and Usability of Programming Languages and Tools</source>
          . pp.
          <volume>31</volume>
          {
          <fpage>42</fpage>
          . PLATEAU '14,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Safdar</surname>
            ,
            <given-names>S.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Iqbal</surname>
            ,
            <given-names>M.Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khan</surname>
            ,
            <given-names>M.U.</given-names>
          </string-name>
          :
          <article-title>Empirical Evaluation of UML Modeling Tools{A Controlled Experiment</article-title>
          .
          <source>In: Modelling foundations and applications, Lecture Notes in Computer Science</source>
          , vol.
          <volume>9153</volume>
          , pp.
          <volume>33</volume>
          {
          <fpage>44</fpage>
          . Springer, Cham (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Saldan</surname>
          </string-name>
          <article-title>~a, J.: The Coding Manual for Qualitative Researchers. SAGE, 3rd edn</article-title>
          . (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Yin</surname>
            ,
            <given-names>R.K.</given-names>
          </string-name>
          :
          <source>Case Study Research: Design and Methods</source>
          . SAGE, 5th edn. (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>