<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>On the Impact of Diagram Layout: How Are Models Actually Read?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Harald Storrle</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nick Baltsen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Henrik Christo ersen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anja M. Maier</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dept. of Applied Mathematics and Computer Science</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Dept. of Management Engineering, Technical University of</institution>
          <country country="DK">Denmark</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This poster presents the latest results from a very large eye tracking study (n=29) that explores how modelers read UML diagrams. We nd that various factors like layout quality, modeler experience, and diagram type lead to signi cant di erences in diagram reading strategies. We derive elements of a theory of diagram reading behavior from our ndings. This paper presents only late breaking results: all ndings presented, theories constructed, and conclusions drawn are of a preliminary nature. This paper does not present the amount and degree of evidence that would allow us to consider the contents as being scienti cally validated, yet.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Practical experience suggests that usage and understanding of UML diagrams
is greatly a ected by the quality of their layout: it is a lot harder to spot errors
and weaknesses in a cluttered diagram, and it is a lot easier to understand the|
quite literally|overall picture of a nicely laid out diagram. Previous research
has identi ed what constitutes good and bad practices of layout (see e.g. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]),
but has failed to provide conclusive evidence in support of this hypothesis. In
fact,
      </p>
      <p>
        \We set up an experiment to study the impact of [layout] rules on
understandability [...] operationalized in terms of [...] faults and [...] time. We could
not identify a statistically signi cant relation between [them]."[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
      </p>
      <p>
        On the other hand, our own previous work provided substantial evidence to
this e ect [
        <xref ref-type="bibr" rid="ref4 ref5">4, 5</xref>
        ]. We could demonstrate, for instance, that error rate, time used,
and subjectively assessed cognitive load increased for UML diagrams with bad
layout, and that this e ect is likely independent of the diagram type (we studied
the ve most commonly used diagram types of UML).
      </p>
      <p>Since there is such a stark contrast between our results and the previous
literature, we deemed it necessary to add to the validity of our study by a
replication of the results. The replication varies the previous studies in three
aspects.</p>
      <p>{ First, obviously, di erent participants were recruited for the present study.</p>
      <p>They are of a comparable population, though, so this amounts to a minor
variation point only.
{ Second, the study is designed and conducted by two graduate students (the
second and third authors), so as to reduce impact and bias through the
person of the experimenter of the previous studies (the rst author). Again,
this was expected to be of secondary importance due to the previous study
setup that had reduced personal interactions almost entirely.
{ Third, we use a di erent method (eye tracking) with objective,
physiological measurements of cognitive load, thus addressing critique that subjective
assessments of cognitive load are not su ciently reliable. We also used the
measurements previously employed (score, duration, subjective assessment)
to establish a point of reference.</p>
      <p>
        Next we were curious whether or not we would be able to observe any
identi able behavior patterns, that amount to reading strategies. If such reading
strategies do exist, would there be di erences under the various treatments, e.g.,
would novices exhibit di erent reading strategies than experts, say? Ideally, this
should lead to elements of a theory of how UML models (and similar artifacts)
are processed by human modelers, and what we can do to support this process.
Since these are subtle questions, and since we have not used eye tracking as a
research method before, we have developed and re ned our study design and
experimental procedure in a pilot study, see [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] (n=4). Here, we report initial
results from the main study following on the pilot study (n=29, one excluded
due to technical errors in the measurement process).
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Research Objective 1: Replication of previous results</title>
      <p>By and large, the present study produced the same results as previous studies:
the impact of layout quality on modeler performance could be replicated, though
it showed more in the objective measures (score, time) than in subjective e ort
assessment { in our earlier studies, we found a stronger e ect in cognitive load
than in objective achievement. The di erences are not so big as to be
inexplicable, though, which leads us to attribute it to the di erent experimental setting,
and the di erent populations. As in earlier experiments, the e ect appears
independent of diagram type, but is in uenced by the expertise level, and the size
of diagrams.</p>
      <p>
        Previously, we had only used subjective assessments of cognitive load which
have been criticized as less reliable than physiological indicators like heart rate
or skin conductivity. So we instrumented our stimuli accordingly and measured
the pupil dilation, xation duration, and blink rate associated to diagram
elements and easily localizable layout problems (e.g., line crossings, line bends).
We could clearly show that subjective assessments correlate tightly to
physiological measures, which is in line with existing literature on the subject [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. We
also found, that there is small, yet statistically signi cant evidence that one
objective performance indicator (score) increases with decreasing layout quality.
Obviously, this is counter-intuitive and in contrast to all previous ndings. So,
while we lack an explanation for this observation so far, we tend to attribute
this to a mistake in the experimental procedure.
      </p>
    </sec>
    <sec id="sec-3">
      <title>Research Objective 2: Existence and identi cation of behavioral patterns</title>
      <p>We could clearly identify behavioral patterns in the subjects reading diagrams.
In order to study these patterns, we recorded and analyzed the starting point
and progress of participants' scan paths. We found three di erent behavioral
patterns in them.</p>
      <p>{ Branch Following From Anchor (BFFA): An anchor would be
established either at the top left corner, the largest element, or the center of the
diagram. Starting from there, the graph structure of the diagram would be
followed in a systematic way.
{ Left/Right Top/Bottom (LRTB): starting at the top left corner, proceed
downwards and to the right irrespective of the direction of arrows in the
diagram, i.e., as if reading a text in the normal reading direction for western
languages.
{ Random Walk (RW): Starting at any point, continue to any other point
and so on, at no discernible pattern.</p>
      <p>LRTB coincides with BFFA for diagrams of a diagram type with natural
direction (e.g., Activity Diagrams), when the actual diagram layout agrees with
the reading direction. In the literature, this has been described as \diagram
ow", a notion that could be formalized as the dominant direction of directed
elements in a diagram. This de nition does not cover other kinds of ow like
circular or radial, but our study did not consider examples of these layout types.</p>
      <p>Interestingly, all participants (i.e., both experts and novices) followed the
BFFA or LRTB strategies for diagrams with good layout, irrespective of diagram
type. For diagrams with bad layout, however, novices tended to use the RW
strategy while experts continued to use the BFFA/LRTB strategies.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Research Objective 3: Analysis of underlying cognitive processes</title>
      <p>The stability of behavioral patterns we have observed were relatively stable over
di erent individuals, and even more so for advanced modelers. This might be
interpreted to show that while behavior exhibits individual variation in less skilled
modelers, increasing experience results in a converge of behavior. If this is so, it
is likely that one and the same cognitive process is e ective universally, in all
modelers|but what can we know about this process?</p>
      <p>
        In a recent study [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], we could show that an important factor for decreasing
modeler performance (and increasing di culty) is the size of the diagram, as
measured by the number of model elements. Our present study showed, however,
that diagram aws like line crossings exhibit similar cognitive load pro les as
shapes (i.e., diagram elements representing classes, use cases, and so on), as
measured by pupil dilation, xation duration, and blink rate. That means, that
increasing (or decreasing) the number of diagram elements has the same e ect
on cognitive processing than increasing (or decreasing) the number of layout
aws. This seems to suggest that diagram size and layout quality are just two
sides of a coin, and the same cognitive process is a ected by both.
      </p>
      <p>The literature on layout quality, however, frequently discusses two distinct
layers: one concerned with low-level layout aws like line crossings, and one
concerned with higher-level aspects like ow (this latter level is sometimes called
diagram architecture). And indeed, as we have seen above, there are clearly
discernible reading patterns that obviously are independent of diagram size, but
do depend on the overall layout structure. So, there is some indication that
diagram reading might involve at least two di erent cognitive processes, but it
is not quite clear whether and how they are connected.</p>
      <p>For diagram types with a \natural" ow like Activity Diagrams, it is fairly
obvious to assume that following the reading direction (strategy BR) leads to
good layouts. So we might be tempted to consider \layout ow" as a layout
quality measure. But what does ow mean for diagrams with no natural
directionality? Also, our sample did not contain diagrams with radial or circular
layout where di erent ow might be observed. Finally, it might be that ow is
the wrong notion to begin with, and we should rather consider ow as a special
kind of symmetry, and explore this notion instead. This will require experiments
on speci cally created layouts that highlight one or the other aspect, and to see
whether these layout styles give rise to di erent behavior.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Interpretation</title>
      <p>We conduct a study on the impact of layout quality, expertise level, and diagram
type on the understanding of UML diagrams. Like in earlier studies, we nd
that the diagram understanding outcome (measured in time, errors, and load)
is a ected by layout quality, even though in the present experiment, we used
eye tracking instead of the subjective assessments of cognitive load we had used
before. We conclude that our initial study and our replication corroborate each
other, underlining the validity of the evidence presented.</p>
      <p>But it is not just the outcome that is a ected by layout quality, we also
observe changes to the diagram reading behavior (measured as start location and
subsequent evolution of the scan path). Even though the results presented here
are of preliminary nature they clearly indicate that there are several distinct
reading strategies. Surprisingly, we nd the same reading strategies for di erent
diagram types: one would expect that diagram types with a natural
directionality might be read in a di erent way than diagrams without a \built-in" direction.
More research is needed to explore this question further, possibly the
experimental setup of our study (in particular, the respective layouts) limit the insight of
our study.</p>
      <p>
        We have replaced the global and undi erentiated assessment of cognitive load
used in earlier experiments by three concurrent, objective measurements with
high temporal and spatial resolution: pupil dilation, xation duration, and blink
rate. This allows us to identify the cognitive load of individual diagram elements,
and we observe that shapes (i.e., diagram elements representing classes, use
cases, actions, and so on) provoke a similar cognitive load pro le than layout
aw such as line crossings and line bends. In other words: increasing the number
of diagram elements has the same e ect on cognitive processing than increasing
the number of layout aws. This explains in a very elegant way our previous
nding that diagram size seems to be more important than layout quality: the
number of diagram aws can (often) be reduced, the diagram size (measured in
the number of elements, [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]) cannot.
6
      </p>
    </sec>
    <sec id="sec-6">
      <title>Summary and Future Work</title>
      <p>We replicate earlier experiments on the impact of layout quality (and other
factors) on UML diagram understanding by using eye tracking instead of subjective
assessment. We observe characteristic behavior patterns and study several in
uence factors. We observe a great uniformity in the reading strategies applied (at
least for experienced modelers) to the degree we are almost certain to observe a
universal cognitive process. If it is indeed universal, we might be able to nd a
physiological substrate to the observed behavior. Obviously, our present study
is not suited to explore this thread further, and we have to defer that to future
research employing di erent machinery, in particular brain imaging.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Holger</given-names>
            <surname>Eichelberger</surname>
          </string-name>
          and
          <string-name>
            <given-names>K.</given-names>
            <surname>Schmid</surname>
          </string-name>
          .
          <article-title>Guidelines on the aesthetic quality of UML class diagrams</article-title>
          .
          <source>Information and Software Technology</source>
          ,
          <volume>51</volume>
          (
          <issue>12</issue>
          ):
          <volume>1686</volume>
          {
          <fpage>1698</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Daniel</given-names>
            <surname>Gopher</surname>
          </string-name>
          and
          <string-name>
            <given-names>Rolf</given-names>
            <surname>Braune</surname>
          </string-name>
          .
          <article-title>On the Psychophysics of Workload: Why Bother with Subjective Measures? Human Factors</article-title>
          ,
          <volume>26</volume>
          (
          <issue>5</issue>
          ):
          <volume>519</volume>
          {
          <fpage>532</fpage>
          ,
          <year>1984</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Maier</surname>
          </string-name>
          ,
          <string-name>
            <surname>Anja</surname>
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Baltsen</surname>
          </string-name>
          , Nick and Christo ersen,
          <source>Henrik and Strrle</source>
          , Harald. Towards Diagram Understanding:
          <string-name>
            <given-names>A</given-names>
            <surname>Pilot-Study Measuring Cognitive Workload Through</surname>
          </string-name>
          Eye-Tracking.
          <source>In Proc. Intl. Conf. Human Behavior in Design</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Harald</given-names>
            <surname>Sto</surname>
          </string-name>
          <article-title>rrle. On the Impact of Layout Quality to Unterstanding UML Diagrams</article-title>
          .
          <source>In Proc. IEEE Symp. Visual Languages and Human-Centric Computing (VL/HCC'11)</source>
          , pages
          <fpage>135</fpage>
          {
          <fpage>142</fpage>
          . IEEE Computer Society,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Harald</given-names>
            <surname>St</surname>
          </string-name>
          <article-title>orrle. On the Impact of Layout Quality to Unterstanding UML Diagrams: Diagram Type and Expertise</article-title>
          . In Gennaro Costagliola, Andrew Ko, Allen Cypher,
          <article-title>Je rey Nichols, Christopher Sca di</article-title>
          , Caitlin Kelleher, and Brad Myers, editors,
          <source>Proc. IEEE Symp. Visual Languages and Human-Centric Computing (VL/HCC'12)</source>
          , pages
          <fpage>195</fpage>
          {
          <fpage>202</fpage>
          . IEEE Computer Society,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Harald</given-names>
            <surname>Sto</surname>
          </string-name>
          <article-title>rrle. On the Impact of Layout Quality to Understanding UML Diagrams: Size Matters</article-title>
          . In Jurgen Dingel, Wolfram Schulte, Isidro Ramos, and Emilio Abrahao, Silviaand Insfran, editors,
          <source>Proc. 17th Intl. Conf. Model Driven Engineering Languages and Systems (MoDELS'14)</source>
          , number 8767 in LNCS, pages
          <volume>518</volume>
          {
          <fpage>534</fpage>
          . Springer Verlag,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>