<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Evaluating the Impact of Clutter in Linear Diagrams</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mohanad Alqadah</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gem Stapleton</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>John Howse</string-name>
          <email>john.howseg@brighton.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter Chapman</string-name>
          <email>p.chapman@napier.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Edinburgh Napier University</institution>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Brighton</institution>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Linear diagrams are an effective way of visualizing sets and their relationships. Sets are visualized by a collection of straight line segments and the ways in which the lines overlap indicate subset and disjointness relationships. As with many visualization methods, linear diagrams can become cluttered. In previous research, we established a clutter measure for linear diagrams that was empirically shown to correlate with perceived clutter. The aim of this paper is to determine the impact of linear diagram clutter on user task performance. An empirical study was conducted with three levels of clutter. Surprisingly, we found that diagrams with a medium level of clutter had signi cantly slower task performance than low and high cluttered diagrams. Moreover, we found no signi cant performance difference between the low and high clutter. We concluded that clutter affects the interpretation of linear diagrams. A future research goal is to establish methods for controlling the level of clutter in linear diagrams, such as using multiple diagrams instead of a single diagram, when visualizing sets.</p>
      </abstract>
      <kwd-group>
        <kwd>linear diagrams</kwd>
        <kwd>clutter</kwd>
        <kwd>diagram comprehension</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Presenting a lot of information in a diagram and the layout choices made when
drawing a diagram can result in visual clutter, known to sometimes be a barrier
to cognition. It is therefore essential to understand clutter and how to reduce
clutter when needed. Ellis and Dix state that it is \important to have a clear
understanding of clutter reduction techniques in order to design visualizations
that can effectively uncover patterns and trends" [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Bertini et al. have a similar
standpoint, devising an approach to analyze and reduce clutter in an information
visualization context [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        Visualizing information using diagrams can have huge bene ts over textual
notations, provided the diagrams are effective [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. In the context of visualizing
sets, there are several methods that can be used, including Euler, Venn, and
linear diagrams, with the rst two being well-known. In the context of Euler
and Venn diagrams, clutter was shown to have an impact on user
comprehension [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Linear diagrams were introduced by Leibniz in 1686 [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and they are
      </p>
    </sec>
    <sec id="sec-2">
      <title>Thriller</title>
    </sec>
    <sec id="sec-3">
      <title>Comedy</title>
    </sec>
    <sec id="sec-4">
      <title>Period</title>
    </sec>
    <sec id="sec-5">
      <title>Drama</title>
    </sec>
    <sec id="sec-6">
      <title>Thriller</title>
    </sec>
    <sec id="sec-7">
      <title>Comedy</title>
    </sec>
    <sec id="sec-8">
      <title>Period</title>
    </sec>
    <sec id="sec-9">
      <title>Drama</title>
      <p>
        similar to parallel bargrams [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] and double decker plots [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Recently, Chapman
et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] empirically established that linear diagrams more effectively support
task performance than Euler and Venn diagrams. Sato and Mineshima [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] have
also shown that linear diagrams are superior to the linguistic representations of
syllogisms in the context of logical reasoning. These results motivate the need
to understand the role of clutter in linear diagrams to better place us to exploit
their cognitive advantages.
      </p>
      <p>A linear diagram consists of parallel line segments drawn horizontally. Each
set presented in the diagram is represented by the line segments that share their
y-coordinate. For example, Fig. 1 represents four sets using eight line segments.
The set Thriller is represented by two line segments. Line segments for different
sets can occupy the same vertical space, known as an overlap, to represent the
intersection between the sets. For example, in Fig. 1 the left-most overlap
contains line segments for the sets Thriller, Comedy, and Drama but not Period. If
two sets do not share any overlaps then these two sets are disjoint. For instance,
in Fig. 1 set Period is disjoint from all the other sets in the diagram. Moreover,
if all of the line segments for a set occur in overlaps with line segments from
another set then the former is a subset of the latter.</p>
      <p>
        Research has suggested that to use linear diagrams successfully, we need to
understand when they can be effectively interpreted [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. In this paper we set
out to establish the impact of linear diagram clutter on user task performance, in
term of time and accuracy. The structure of the rest of this paper is as follows.
A theoretical measure of clutter in linear diagrams is given in section 2. We
present the experiment design in section 3. Section 4 describes how we executed
the experiment and section 5 presents the results of the study. In section 6, we
address the threats to validity. In section 7, we offer conclusions and discussion on
future work. The diagrams used in the study, along with the raw data collected,
can be found at https://sites.google.com/site/msapro/phdstudyfour.
2
      </p>
      <sec id="sec-9-1">
        <title>Clutter in Linear Diagrams</title>
        <p>
          In previous work we de ned a clutter measure, called the line score, for linear
diagrams: each set contributes n to the line score, where n is the number of line
segments that are used to represent that set [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. We established that the line
score clutter measure meets user perceptions: diagrams with a higher line score
were generally perceived as more cluttered.
Rico Nina Wong Guss Nora Yeah Uriel Xanti Tina Becky Luce Bela Hetty Imani Kyja Zed Gerry Scot Karl Jade Vern Jere Fares Xara Tonya
Andy Kyah Nash Vila Pat Paul Ciara Ivy Dylan Ouida Fiona Logan Rafe Adam Seth Clara Wahid Xena Dino Yael Qays Syed
Hoke Adin Tobe Rynn Paul Beth Elea Gino Cora
Lana Flora Enzo
Omer
        </p>
        <p>For example, the diagram in Fig. 1 has a line score of 8; the contributions
of each set to the score are 2, 2, 1, and 3, from top to bottom. We can change
the order of the overlaps to get the semantically equivalent diagram in Fig. 2.
This change in the order of overlaps reduces the clutter score to 5. As a more
complex example, the diagrams in Figs 3 to 5 exhibit three different line scores,
with Fig. 3 being the least cluttered and Fig. 5 being most cluttered. In general,
we can permute the order of the overlaps without altering the information being
represented but leading to different clutter scores. Therefore, if we understand
the impact of clutter on task performance, we can chose between competing
overlap orders in order to best support task performance.
3</p>
      </sec>
      <sec id="sec-9-2">
        <title>Experiment Design</title>
        <p>
          To answer the research question `does linear diagram clutter affect user
comprehension?' we designed a within-group empirical study, building on an established
measure of clutter, the line score, for linear diagrams [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. Each participant was
asked to answer questions about the information in a set of linear diagrams
which had varying levels of clutter. Consistent with other research
contributions towards understanding user comprehension, such as [
          <xref ref-type="bibr" rid="ref11 ref13 ref15 ref4">4, 11, 13, 15</xref>
          ], we used
        </p>
        <p>ACCOUNTING
MARKETING
FINANCE
PHYSICS
DESIGN
HISTORY
BIOLOGY</p>
        <p>Bambi Kris Neil Park Maxx Dixie Aila Amy Vicki Donia Frank Niko Hope Zaid Ura Xia Tyler Glen Kipp Wafi Waldo Lainy Yee Joey Esme
Maud Osie Orene Zelda Maya Perry Cathy Gia Jess Quint Bily Elza Irene Joni Fred Una Qabil Nick Fiona Rose Wafa Kyle
Tony Rona Pearl Rita Xenia Esta Owen Isam Luna
Zetta Yasin Trish
Qusai
research software that was designed to record user performance data while
performing the experiment. Two primary variables were recorded: the time taken to
answer the question and whether the question had been answered correctly. We
set a time limit of two minutes to answer each question and all timeouts were
recorded. If clutter level affects user comprehension of linear diagrams then we
would expect to see a signi cant difference between the means of the time taken
to answer the questions or the error rates over the different levels of clutter.
3.1</p>
        <sec id="sec-9-2-1">
          <title>Study Details and Question Types</title>
          <p>
            The participants were staff or students from the School of Computing,
Engineering and Mathematics at the University of Brighton; none of them were members
of the authors' research group. Consistent with Rodgers et al. [
            <xref ref-type="bibr" rid="ref16">16</xref>
            ], the diagrams
were drawn with pseudo-real data, conveying information about students'
interests in university subjects. For example in Figs 3, 4 and 5, the line segments for
each set represent a subject. The student names are written under the overlap
corresponding to the subjects in which they are interested. This design choice
was to ensure the context of the information presented was familiar to the
participants, removing any learning barrier that could arise through participants
learning a new context. The data presented in the diagrams was hypothetical to
ensure that prior knowledge cannot be used.
          </p>
          <p>
            Subject names consisted of one word, did not sound similar to each other and
did not share the same rst letter, because that could confuse the participants
as we found in our previous research [
            <xref ref-type="bibr" rid="ref1">1</xref>
            ]. Furthermore, each subject name was
placed on the left side of the diagram and aligned with, and took the same colour
as, the associated line segments. The order of the subject names was random
and no two diagrams had the same order. This was important to make sure
the participants had to read the question and the diagram each time, reducing
learning effect that could impact on performance.
          </p>
          <p>The student names that were presented in the diagrams were a randomized
mixture of both male and female names across a variety of ethnicities, to reduce
any bias that could occur if we chose the names manually. Moreover, the names
GEOLOGY
ARCHAEOLOGY
BUSINESS
NETWORKING
ECONOMICS
HISTORY
MUSIC
TECHNOLOGY
RESEARCH
PHYSICS</p>
          <p>Taryn Jim Itzel Lisa Zane Ravi Cesar Kenny Dylan Oisin Eva Cheri Betty Kylie Mona Gavin Cary Katy Lea Kent Colm Paris Kiera Jody Jake
Roni Seb Rhett Avery Nader Hazel Garth Mia Erik Asia Clark Zeb Axel Oran Lacey Kent Ervin
Tom Hugo Jorja Wyatt Aine Rylan Aden Annie Alton Isla Jalen Giles</p>
          <p>Uriel Kyrin Joyce Jess Glenn</p>
          <p>Marla
were rst name only and of a similar length (three, four or ve letters) to reduce
the risk of having a name that was more prominent than the others (e.g very long
names). We avoided using names more than once in the same diagram. Moreover,
the names that appear in the same overlap sounded different and started with
different rst letters. This design choice was made to limit any confusion and
make the names easier to remember.</p>
          <p>
            Three types of multiple choice questions were used in the study: `Who', `How
many', and `Which', consistent with previous research [
            <xref ref-type="bibr" rid="ref1 ref16 ref4">1, 4, 16</xref>
            ] 3. There were ve
choices of answer of which one was correct. Example questions are:
1. Who of the following is taking PHYSICS, HISTORY, GEOLOGY,
RE
          </p>
          <p>SEARCH and nothing else?
2. How many students are taking GEOLOGY, ARCHAEOLOGY, MUSIC,</p>
          <p>RESEARCH, PHYSICS and nothing else?
3. Which one of the following modules is being taken by 23 students?</p>
          <p>
            Questions 1, 2, and 3 were asked of the diagrams in Figs 3, 6, and 7
respectively. The set names that appear in the `Who' and `How many' questions, as
well as in the answers, match the same order in the relevant diagram from top
to bottom. This choice should make it easier for the participants to nd the
required sets to answer the questions. After choosing an answer for the `How
many' questions, the participants were asked to type the names of the students
they had counted. For their answer to be classed as correct, the names typed in
had to be correct. For `Which' questions, there was only one set in the diagram
that had the number of students' names speci ed in the question.
3 The empirical studies on linear diagrams in [
            <xref ref-type="bibr" rid="ref17">17</xref>
            ] used tasks that required people to
identify subset, disjointness, and non-empty intersection relationships between sets
and are, thus, different from the tasks we use in this study.
          </p>
          <p>GEOLOGY
NETWORKING
COMPUTING
MARKETING
TECHNOLOGY
RESEARCH
BUSINESS
FASHION
ACCOUNTING
HISTORY
ECONOMICS
DESIGN
PHYSICS</p>
          <p>Jude Yeah Isis Zeb Neal Tobe Logan Shon Umar Zoey Toby Josh Oneta Vera Jason Pink Elin Kyra Viola Wong Thea Gabe Xan Diana Qays
Brody Beth Lexi Aniya Rufus Kylie Riley Kent Kyah Kyler Amari Flora Clara Cass Paul Ouida Omar Matt Anna Nina
Qadir Mike Dora Lesly West Amil Xiao York Rick
Reine Gail Brook Dyan Evie
Jade
We used three different levels of clutter in the study, as measured by the line
score, to establish whether diagram clutter impacts user performance when
interpreting linear diagrams. It was felt important that the diagrams were sufficiently
complex to demand notable cognitive effort by the study's participants. Given
this, the diagrams were chosen so that their line scores (LSs) were: 10 20 (low
LS), 30 40 (medium LS), and 50 60 (high LS). For example, the three
diagrams in Figs 3, 6 and 7 have LSs of 18, 40, and 60 respectively. For the study, 27
diagrams were drawn (nine for each level of clutter). These 27 diagrams were
created from nine initial diagrams, which were then redrawn with different overlap
orders to create different levels of clutter.
3.3</p>
        </sec>
        <sec id="sec-9-2-2">
          <title>Clutter by Number of Sets</title>
          <p>To widen the generality of the results from our study, the number of sets
visualized in the diagrams varied: they had either 7, 10 or 13 sets. The distribution
of the number of sets to clutter scores is shown in table 1. For example, the
diagrams in Fig. 3, 6, and 7 represent 7, 10, and 13 sets respectively. The three
types of questions were asked from each set of diagrams. For instance, as can be
seen in table 1, the three types of questions were asked of the information in the
diagrams drawn with seven sets (d1, d2, and d3).</p>
          <p>By including a variety of numbers of sets represented, we will be able to
determine whether there is an interaction with the clutter score. Suppose, for
example, that we have two diagrams, d7 and d10, each with a clutter score of 20,
where d7 and d10 represent 7 and 10 sets respectively. Then d7 has, on average 2:9
lines per set, whereas d10 has just 2 lines per set. In general, when the diagram
clutter is high and the number of sets is small, the diagram has a high number
of line segments in a small (vertical) space, as measured by the number of sets
represented. When the number of sets increases there is more vertical space to
t the same number of line segments. Our study allows us to explore whether
the measure of clutter relative to the number of sets in uences cognition.
3.4</p>
        </sec>
        <sec id="sec-9-2-3">
          <title>Linear Diagram Layout and Characteristics</title>
          <p>As can be seen in table 1, the diagrams were divided into nine characteristic types
(d1 to d9); the number of sets represented appears under each characteristic
type. For each characteristic type there were three diagrams: a diagram for
each diagram clutter level. The three diagrams of each characteristic type were
allocated the same type of question. So, for example, d1 represented 7 sets, and
corresponded to a `Who' question, and was drawn in three different ways, to
give low, medium and high levels of clutter. The number of data items (student
names) was chosen to be large enough so that answering the question required
cognitive effort. There were one to ve students names within each overlap and
all the diagrams had 60 names. We randomly distributed the data items within
the overlaps in the diagrams. However, the number of items in each overlap was
the same in the three diagrams of each characteristic type. For example, the low
clutter diagram for d1 may have an overlap with, say, three student names. In
the medium clutter diagram for d1, this overlap need not occupying the same
vertical space but will still have three names written under it.</p>
          <p>
            We were careful to control the layout features for the diagrams drawn for the
study. Therefore, we adopted the following six layout guidelines to ensure
consistent layout features that re ected best-practice in linear diagram drawing [
            <xref ref-type="bibr" rid="ref17">17</xref>
            ]:
1. The diagrams were drawn in a horizontal direction.
2. All diagrams were drawn with coloured straight line segments and no two
sets appearing in the same diagram had the same colour.
3. The sets in the diagrams had the same distances between them.
4. The overlaps in the diagrams had the same length, xed at 50 pixels.
5. Vertical grid lines with grey colour' were used to represent the beginning
and the end of each overlap.
6. The line segments height was set to 6 pixels.
          </p>
          <p>
            Guidelines 1-6 are from [
            <xref ref-type="bibr" rid="ref17">17</xref>
            ], but they are not sufficient to control variability
across diagrams in our study. We stipulate ve more layout guidelines:
7. The order of the colour of sets is the same in all the diagrams.
8. All the diagrams had 25 overlaps.
9. The stroke width for the vertical grid lines was set to 2 pixels.
10. The set labels were written using upper case letters in Time New Roman,
14 point size, font in bold.
11. Data items were presented using lowercase letters, except that the rst letter
was capitalized, with black colour and Ariel 12 point size font.
          </p>
          <p>
            Consistent with [
            <xref ref-type="bibr" rid="ref5 ref8">5, 8</xref>
            ], a palette of thirteen colours was generated using
colorbrewer2.org (accessed January 2016). Colour generation using the Brewer colour
palette is recognized as a valid approach for empirical studies, such as in the
context of maps [
            <xref ref-type="bibr" rid="ref19">19</xref>
            ]. So that the colours were distinguishable, but not
sequential or suggestive (e.g. increasingly vivid shades of red used to denote heat), they
were generated using the `qualitative' option, based on work by Ihaka [
            <xref ref-type="bibr" rid="ref10">10</xref>
            ].
4
          </p>
        </sec>
      </sec>
      <sec id="sec-9-3">
        <title>Experiment Execution</title>
        <p>The initial pilot study was conducted with ve participants (4 M, 1 F, ages
18 38). The pilot study showed that the two minute time limit to answer
each question was enough for most participants, only one participant had 3
timeouts and 1 error, and another participant had 5 errors. The three remaining
participants answered all the questions correctly in the time limit. However, a
problem was revealed with the number of items that appear under the overlaps in
the diagrams: they were not equally distributed between the three diagrams for
each characteristic type. This unevenness may have caused bias across the clutter
levels, because the speed or accurately of answering questions could depend on
the distribution of items. We xed this issue by equalizing the number of items
in the same overlaps across the three diagrams for each characteristic type.
For example, in the left-most overlap in the diagram in Fig. 3 there are three
students names (Neil, Will, and Tina), the equal overlap in the middle part of
Fig. 4 contains three students names (Imani, Rafe, and Paul) as well, and in
same overlap in Fig. 5 also has three names (Glen, Fred, and Owen). The main
experiment was carried out with 30 participants (19 M, 11 F, ages 17 34). Each
participant was given $6, in the form of a canteen voucher, to take part.</p>
        <p>There were three phases for the experiment. First is the introduction and the
paper-based training phase, the experimental facilitator used the introduction
script to introduce the participant to linear diagrams and the type of questions
they will see during the experiment. The participants then started the second,
computer-based training, phase where the participants had the chance to use
the experiment software to answer three questions in the two minutes given for
each question without interruption from the experiment facilitator. If a training
question was answered incorrectly, the experimental facilitator went through
the question with the participant. If the participant wished to continue then we
proceeded to the third (main) study phase where we collected the quantitative
data for our analysis. Each participant signed a consent form to say they would
like to participate in the study. The participant was not interrupted or given any
help during this phase. After answering all the 27 questions, the participant was
given the debrief script, including the contact information of the experimental
facilitator so that they could access the results of the study.
5</p>
      </sec>
      <sec id="sec-9-4">
        <title>Experiment Results</title>
        <p>The results are based on the data collected from the 30 participants in the main
study phase, each answering 27 questions, yielding 27 30 = 810 observations.
5.1</p>
        <sec id="sec-9-4-1">
          <title>Overall Impact of Clutter</title>
          <p>We analyzed the time and error data to determine whether signi cant
differences exist, overall, between the three clutter levels. This analysis allowed us to
determine whether clutter level signi cantly impacts on user task performance.
Analysis of Time Data The observed means are 42.93 (sd: 27.65) seconds for
low diagram clutter, 46.35 (sd: 25.89) seconds for medium diagram clutter, and
42.23 (sd: 24.94) seconds for high diagram clutter. The data are not normally
distributed and have a high skew. To make the data suitable for conducting an
ANOVA, it was subjected to a log transformation (resulting skewness: -0.05).
Therefore, the ANOVA was conducted in this transformed data, yielding a
pvalue of 0.002 when testing for differences between the means for diagram clutter.
We therefore conclude that the mean time taken to interpret linear diagrams
alters signi cantly as the clutter score changes.</p>
          <p>To reveal how the signi cant differences arose, we conducted a Tukey test to
rank the clutter levels. Surprisingly, this test revealed that the difference between
the mean time taken to interpret linear diagrams with low diagram clutter versus
high diagram clutter is not signi cant. However, the difference between the mean
time taken to interpret linear diagrams with medium diagram clutter versus low
and high diagram clutter is signi cant. In conclusion, medium diagram clutter
has a signi cantly higher mean time as compared to low and high clutter. The
medium clutter level questions took roughly 3:5 to 4 seconds longer, on average,
to answer than the low and high clutter questions.</p>
          <p>Analysis of Errors Of the 810 observations (30 participants 27 questions)
there were a total of 31 errors and 18 timeouts which gives an error rate of 3.83%
and a timeout rate of 2.22%. Timeouts were counted as neither correct responses
nor errors. Table 2 shows the observed number of correct answers, errors and
timeouts for each clutter level. We performed a 2 test, giving a p-value of 0:287,
which is not signi cant 4. We conclude that diagram clutter does not affect the
comprehension of linear diagrams in terms of accuracy.
4 Treating timeouts as errors changes the p-value to 0:266.</p>
        </sec>
        <sec id="sec-9-4-2">
          <title>Interaction between Clutter and Number of Sets</title>
          <p>Analysis of Time As discussed earlier, by including a variety of numbers of
sets we are able to determine whether there is an interaction between the number
of sets and the clutter score. The following analysis seeks to address whether the
measure of clutter relative to the number of sets in uences cognition. The mean
times for each clutter level (along with the standard deviations) are shown in
Table 3. It can be seen that, for 10 and 13 sets, the medium level mean time is
higher than that for the low and high clutter levels, consistent with our overall
analysis. However, in the case of 7 sets, the mean times are all similar. This may
indicate that the level of cognitive difficult associated with the number of sets
represented in relation to the level of clutter is low in this case, and perhaps
suggests that clutter has more of an effect as the number of sets rises.</p>
          <p>Our ANOVA revealed a signi cant interaction, with p = 0:001. In table 4, the
statistical results of the pairwise comparisons of the interaction between diagram
clutter and number of sets. This reveals that:
1. Participants took signi cantly longer to answer questions of the 10 set
diagrams with medium clutter than the 10 and 13 set diagrams with high and
low clutter respectively.
2. Participants took signi cantly longer to answer questions of the 7 set
diagrams with low and medium clutter than the 10 and 13 set diagrams with
high and low clutter respectively.
3. Participants took signi cantly longer to answer questions of the 7 set
diagrams with high clutter than the 13 set diagrams with low clutter.</p>
          <p>The results suggest that the combination of diagram clutter and the number
of sets impacts on the time taken to interpret linear diagrams.
Analysis of Errors We will compare the error rates between the number of
sets represented. This will reveal whether increasing the number of sets, whilst
keeping the clutter level constant, impacts on cognition. However, we note that
the following analysis should be treated with caution, as the observed values
in some cases are very low. Firstly, in the case of low clutter, conducting a 2
test yields a p-value of 0:819, with the data presented in table 5. We conclude
that there is no signi cant difference between performance, in terms of errors,
as the number of sets varies when the clutter level is low. However, we saw in
our interaction analysis of the time data that low cluttered diagrams with 7 sets
took signi cantly longer for participants than the 10 and 13 set diagrams.</p>
          <p>In the case of medium clutter, conducting a 2 test yields a p-value of 0:068,
with the data presented in table 5. We again conclude that there is no signi
cant difference between performance, in terms of errors, as the number of sets
varies. This is consistent with our time analysis, where we found no signi cant
performance difference between the medium clutter diagrams as the number of
sets varied (see table 4).</p>
          <p>Lastly, for high clutter, conducting a 2 test yields a p-value of 0:823, with
the data presented in table 5. We again conclude that there is no signi cant
difference between performance, in terms of errors, as the number of sets varies.
This is again consistent with our time analysis, where we found no signi cant
performance difference between the high clutter diagrams as the number of sets
varied (see table 4).
5.3</p>
        </sec>
        <sec id="sec-9-4-3">
          <title>Discussion</title>
          <p>
            We now seek to explain the ndings of the study. We found no impact of clutter
on accuracy. The low error rates could be partially attributed to our design
choices: the diagrams were drawn based on guidelines that have been empirically
established to have positive impact on user comprehension [
            <xref ref-type="bibr" rid="ref17">17</xref>
            ].
          </p>
          <p>The only statistically signi cant differences found arose through our analysis
of the time data, which revealed, surprisingly, that a medium level of clutter
generally led to poor task performance. To understand why this might have
arisen, we begin by making some observations about the relationship between
clutter, the number of sets represented, and the lengths of the corresponding line
segments. Now, the average number of line segments per set is nuclmutbteerr osfcosreets .
Therefore, as the clutter score increases, the length of the line segments used
decreases. As we used a xed number (25) of overlaps this means that diagrams
with a high clutter score will exhibit more short lines, relatively speaking, than
diagrams with a low clutter score. Diagrams with a medium level of clutter
will fall somewhere in the middle. Consequently, in low clutter diagrams, many
sets are typically represented by just one or two line segments. Medium clutter
diagrams represent sets by around 2 to 3 line segments and High clutter diagrams
represent sets by around 3 to 5 line segments. Therefore, we might expect many
long line segments in the low clutter diagrams and short line segments in the
high clutter diagrams. However, the medium clutter diagrams having will have
more of a mix of the two, and therefore perhaps appear less uniform in layout.
This can be seen in Fig. 3 (low clutter: many long line segments), Fig. 5 (high
clutter: many short line segments) and Fig. 4 (medium clutter: more variability
in line segment length).</p>
          <p>Through our statistical analysis, we have demonstrated that extracting
information takes signi cantly longer from a medium cluttered diagram than from
a low or high cluttered diagram. We can infer, then, that understanding linear
diagrams with a mixture of short and long line segments is more difficult than
understanding diagrams which predominantly use only long, or short, line
segments. We also found that there is no signi cant difference between the high
cluttered diagrams and the low cluttered diagrams, overall. We hypothesize that
different reading strategies, which we call reading vertically and reading
horizontally, are employed for each type of diagram. The short line segments in highly
cluttered diagrams could make it easier to identify an overlap by reading
vertically, since the short line segments produce visually distinct blocks. We can
thus easily associate an overlap with its items. By contrast, when a diagram
is low cluttered it consists of long line segments which could make it easier to
read horizontally, allowing an easy overview of the entire set. It might be that a
mixture of these strategies best helps read medium cluttered diagrams, or that
neither are ideal, increasing cognitive load. Further work is needed to establish
whether people actually employ these reading strategies generally.
6</p>
        </sec>
      </sec>
      <sec id="sec-9-5">
        <title>Threats to Validity</title>
        <p>
          We now discuss the limitations of the study and how threats were managed to
minimize their impact on the results. We categorized the threats to validity as
internal, construct, and external as in [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. In regard to the internal validity, the
following factors have been considered during the study design:
1. Motivation effect : this could a threat if the participants did not freely
volunteered to take part. To manage this effect, the participants were invited
to take part as volunteers and they were given $6 refreshment voucher for
their time.
2. Laboratory : it is important to expose all participants to the same
environment in this type of study. The experiment took place in an usability
laboratory where the environment was free from noise and interruption and all
participants were treated in the same way.
3. Fatigue effect : fatigue could be a threat if the participants where exposed to
repeated tasks for a long time. To control this, the study was designed to
nish in less than one hour.
4. Questions order : we presented the questions in a random order for each
participant to reduce any effect (e.g. tiredness, learning) owing to order.
5. Layout variation: many choices about diagram layout were made, such as
the colour and the thickness of the lines [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. To manage the threat of layout
variations, we followed drawing guidelines.
        </p>
        <p>
          Construct validity considers whether the identi ed dependent (primary) and
independent (secondary) variables are the correct choices. Consistent with other
researchers who studied user comprehension, for instance [
          <xref ref-type="bibr" rid="ref11 ref13 ref15 ref4">4, 11, 13, 15</xref>
          ],
comprehension is measured by the time taken to answer the questions and the number of
errors made. In addition, we also considered false negatives: these could arise if a
participant selected the wrong answer while reading it to be the correct answer.
To mange this, the similarity of university subject (i.e. set) names and student
names was controlled. Furthermore, when participants were asked a `How many'
question, they were required to type in the names of the students they had
counted to reduce false positives.
        </p>
        <p>A secondary variable is the choice of the diagrams. It could be a threat
if participants did not spend enough time understanding the diagrams before
answering the questions. To manage this threat we created diversity in the
diagrams and designed them to convey an appropriate level of information in order
to require cognitive effort when answering questions.</p>
        <p>With regard to external validity, the following factor indicate the limitations
of the results and to which extent we can generalize them. It is a common threat
to the external validity of user studies if the participant have not represent a
wider population. All participants were staff or students from the University
of Brighton with little or no previous experience of using linear diagrams. The
question styles could be a threat if there was no variety in the questions asked
of the linear diagrams. To allow more generalization to the study we used three
styles of questions.
7</p>
      </sec>
      <sec id="sec-9-6">
        <title>Conclusion</title>
        <p>In this paper, we determined whether, and how, clutter affects user
understanding when interpreting linear diagrams. The results show that clutter does affect
the comprehension of linear diagrams. Surprisingly, increasing clutter does not
always negatively affect comprehension: the medium cluttered diagrams required
signi cantly longer to interrogate than low or high cluttered diagrams. Moreover,
we found no signi cant difference between the time taken to interpret high or
low cluttered diagrams. Furthermore, the error data analysis showed that there
is no signi cant relationship between error rates and complexity of the diagram.
In summary, we found that diagram clutter affects time taken to interpret
information in linear diagrams, but it does not affect the accuracy with which
participants answered questions about the data visualized in linear diagrams.</p>
        <p>These results inform the direction of further research into the effect of the
complexity of linear diagrams on comprehension. For instance, it would be
interesting to establish strategies people employed for reading linear diagrams, as
hypothesized in our discussions above. A further fruitful step for this research
is to determine whether it is bene cial to use multiple linear diagrams to
convey information, rather than a single diagram, to control the amount of clutter
present in any individual diagram, informed by the results of this study.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Alqadah</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stapleton</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Howse</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chapman</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Evaluating the impact of clutter in Euler diagrams</article-title>
          .
          <source>In: Diagrammatic Representation and Inference</source>
          , pp.
          <volume>108</volume>
          {
          <fpage>122</fpage>
          . No. 8578
          <string-name>
            <surname>in</surname>
            <given-names>LNAI</given-names>
          </string-name>
          , Springer (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Alqadah</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stapleton</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Howse</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chapman</surname>
            ,
            <given-names>P.:</given-names>
          </string-name>
          <article-title>The perception of clutter in linear diagrams</article-title>
          . In:
          <article-title>Diagrammatic Representation and Inference (accepted for)</article-title>
          .
          <source>LNAI</source>
          , Springer (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bertini</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dell'Aquila</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santucci</surname>
          </string-name>
          , G.:
          <article-title>Reducing infovis cluttering through non uniform sampling, displacement, and user perception</article-title>
          .
          <source>In: Electronic Imaging</source>
          <year>2006</year>
          . pp.
          <source>60600L{60600L. International Society for Optics and Photonics</source>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Blake</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stapleton</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodgers</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cheek</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Howse</surname>
          </string-name>
          , J.:
          <article-title>Does the orientation of an Euler diagram affect user comprehension</article-title>
          ?
          <source>In: The 18th International Conference on Distributed Multimedia Systems, International Workshop on Visual Languages and Computing</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Chapman</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stapleton</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodgers</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Micallef</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blake</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Visualizing sets: an empirical comparison of diagram types</article-title>
          .
          <source>In: Diagrammatic Representation and Inference</source>
          , pp.
          <volume>146</volume>
          {
          <fpage>160</fpage>
          . No. 8578
          <string-name>
            <surname>in</surname>
            <given-names>LNAI</given-names>
          </string-name>
          , Springer (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Couturat</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <string-name>
            <surname>Opuscules</surname>
          </string-name>
          et fragments inedits de Leibniz. Felix
          <string-name>
            <surname>Alcan</surname>
          </string-name>
          (
          <year>1903</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ellis</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dix</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>A taxonomy of clutter reduction for information visualisation</article-title>
          .
          <source>IEEE Transactions on Visualization and Computer Graphics</source>
          <volume>13</volume>
          (
          <issue>6</issue>
          ),
          <volume>1216</volume>
          {
          <fpage>1223</fpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Harrower</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brewer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Colorbrewer.org: An online tool for selecting colour schemes for maps</article-title>
          .
          <source>Cartographic Journal</source>
          <volume>40</volume>
          (
          <issue>1</issue>
          ),
          <volume>27</volume>
          {
          <fpage>37</fpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Hofmann</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Siebes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wilhelm</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Visualizing Association Rules with Interactive Mosaic Plots</article-title>
          .
          <source>In: 6th ACM SIGKDD international conference on Knowledge discovery and data mining</source>
          . pp.
          <volume>227</volume>
          {
          <fpage>235</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Ihaka</surname>
          </string-name>
          , R.:
          <article-title>Colour for presentation graphics</article-title>
          .
          <source>In: In 3rd Int. Workshop on Distributed Statistical Computing</source>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Isenberg</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bezerianos</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dragicevic</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fekete</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>A study on dual-scale data charts</article-title>
          .
          <source>In: IEEE Transactions on Visualization and Computer Graphics</source>
          . pp.
          <volume>2469</volume>
          {
          <fpage>2478</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Larkin</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Simon</surname>
          </string-name>
          , H.:
          <article-title>Why a diagram is (sometimes) worth ten thousand words</article-title>
          .
          <source>Journal of Cognitive Science</source>
          <volume>11</volume>
          ,
          <issue>65</issue>
          {
          <fpage>99</fpage>
          (
          <year>1987</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Puchase</surname>
          </string-name>
          , H.:
          <article-title>Which aesthetic has the greatest effect on human understanding?</article-title>
          <source>In: 5th International Symposium on Graph Drawing</source>
          . pp.
          <volume>248</volume>
          {
          <fpage>261</fpage>
          . Springer (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Purchase</surname>
            ,
            <given-names>H.C.</given-names>
          </string-name>
          :
          <article-title>Experimental human-computer interaction: a practical guide with visual examples</article-title>
          . Cambridge University Press (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Riche</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dwyer</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Untangling Euler diagrams</article-title>
          .
          <source>IEEE Transactions on Visualization and Computer Graphics</source>
          <volume>16</volume>
          (
          <issue>6</issue>
          ),
          <volume>1090</volume>
          {
          <fpage>1099</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Rodgers</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Purchase</surname>
          </string-name>
          , H.:
          <article-title>Wellformedness properties in Euler diagrams: Which should be used</article-title>
          ?
          <source>IEEE Transactions on Visualization and Computer Graphics</source>
          <volume>18</volume>
          (
          <issue>7</issue>
          ),
          <volume>1089</volume>
          {
          <fpage>1100</fpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Rodgers</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stapleton</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chapman</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Visualizing sets with linear diagrams</article-title>
          .
          <source>ACM Transactions on Computer-Human Interaction (TOCHI) 22(6)</source>
          ,
          <volume>27</volume>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Sato</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mineshima</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>How diagrams can support syllogistic reasoning: An experimental study</article-title>
          .
          <source>Journal of Logic, Language and Information</source>
          <volume>24</volume>
          (
          <issue>4</issue>
          ),
          <volume>409</volume>
          {
          <fpage>455</fpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Madeira</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>There is more to color scales than meets the eye: A review on the use of color in visualization</article-title>
          .
          <source>In: 11th International Conference on Information Visualization</source>
          . pp.
          <volume>943</volume>
          {
          <fpage>950</fpage>
          . IEEE Computer Society (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Wittenburg</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lanning</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heinrichs</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stanton</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Parallel Bargrams for Consumer-based Information Exploration and Choice</article-title>
          . In:
          <article-title>14th annual ACM symposium on User interface software and technology</article-title>
          . pp.
          <volume>51</volume>
          {
          <fpage>60</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>