<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>High-Level Gaze Metrics From Map Viewing</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Charting Ambient/Focal Visual Attention</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Krzysztof Krejtz</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrew T. Duchowski</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Arzu C¸ o¨ ltekin</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Geography, University of Zurich</institution>
          ,
          <addr-line>Zurich</addr-line>
          ,
          <country country="CH">Switzerland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Introduction &amp; Background</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>National Information Processing Institute and University of Social Sciences and Humanities</institution>
          ,
          <addr-line>Warsaw</addr-line>
          ,
          <country country="PL">Poland</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>School of Computing, Clemson University</institution>
          ,
          <addr-line>Clemson, SC</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2014</year>
      </pub-date>
      <fpage>37</fpage>
      <lpage>41</lpage>
      <abstract>
        <p>Distinguishing ambient and focal attention, we demonstrate the use of the K coe cient, which can serve as a cue for recommender systems in deciding when to o er information to the user, e.g., when focally attending during search. Gaze-based recommender systems are designed to respond with information contingent on the viewer's gaze, e.g., in geographic contexts, when directed to a particular location in physical or virtual space (such as on a map). These geographic gaze-based recommender systems have been referred to as location-aware (mobile) eye tracking systems [7]. For the system to provide an appropriate response, analysis of the viewer's gaze is required to infer the viewer's desire for information. Generally, this is accomplished via computation of an interest metric [11,10,4]. Recent approaches characterize interest or boredom via Support Vector Machines [6] or Area Of Interest (AOI) revisitation [5]. In this paper we use the K coe cient to distinguish ambient and focal attention. Ambient attention is characterized by relatively short fixations followed by high amplitude saccades. Conversely, focal attention is described by long fixations followed by saccades of low amplitude [12]. The K coe cient captures the temporal relation between standardized fixation duration and subsequent saccade amplitude. K &gt; 0 indicates focal viewing while K &lt; 0 suggests ambient viewing [9]. Attention becoming more focal over time, or oscillating between focal and ambient modes, could indicate changing cognitive load corresponding to stimulus complexity. The K coe cient can thus potentially act as a contextual cue which could be exploited by recommender systems, e.g., do not interrupt the user when in ambient search mode, or oscillating between ambient and focal search. We demonstrate the utility of K with an experiment comparing visual search behavior over di erent geographic representations of two cities: Amsterdam and Barcelona.</p>
      </abstract>
      <kwd-group>
        <kwd>maps</kwd>
        <kwd>eye tracking</kwd>
        <kwd>visual attention</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>(a) Apparatus and setting.</p>
      <p>(b) Satellite rendering.</p>
      <p>
        (c) Map rendering.
The experiment consisted of localizing map objects (features), arguably the most
fundamental map-reading task [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Localization, or visual search, i.e., finding an object of
interest among others, is a fundamental map use task, because regardless of the map
type or the final goal of the map reader, an object or area of interest must be found
before it can be studied further. Geographic task taxonomies widely acknowledge this task
among the most basic and common [
        <xref ref-type="bibr" rid="ref2 ref8">8,2</xref>
        ]. The localization tasks were carried out when
viewing a city representation in either (cartographic) map or satellite view (Google’s
roadmap or hybrid rendering; see Fig. 1 and text below for technical details). Unlike
previous work on route learning [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], we hypothesized there would be an e ect of map
type on visual search, with localization (search) taking longer in satellite view than in
map view.
      </p>
      <p>Participants. Sixty-three (63) university students took part in the study, with 7
excluded due to technical and procedural problems (e.g., poor calibration). The final
sample included 56 participants (19 M, 37 F, ages M = 24:62; SD = 4:01).</p>
      <p>Apparatus. All stimuli were presented on a computer monitor (1680 1050
resolution; 2200 LCD, 60 Hz refresh rate) connected to a standard PC computer. Eye
movements were recorded at 250 Hz with an SMI RED 250 eye tracking system.</p>
      <p>Stimulus. Map images were created using Google Mapstm JavaScript API v3. Maps
were rendered (see Fig. 1) by disabling visual user interface controls for navigation,
scale, rotate, pan, and zoom, and limiting the number of Points Of Interest (POIs)
to two. Specifically, the Barcelona maps (using Google’s latitude/longitude
coordinates: 41.375384, 2.141004) displayed only the “park” and “sports complex” POIs
at zoom level 17 with the transit layer turned on. The Amsterdam maps (lat./lon.:
52.3614777, 4.8837351) were also displayed at zoom level 17 with two POIs
(“attraction” and “business”) and transit layer turned on. All maps were rendered to
1280 1024 resolution, then screen-captured and cropped to the same dimensions.</p>
      <p>Procedure. Participants were randomly assigned to either experimental condition,
map (N = 29) or satellite view (N = 27). Prior to viewing of the map, the eye tracking
system was calibrated to each individual. Following calibration, participants carried out
the localization task (after having located a start point).</p>
      <p>For the city of Barcelona (see Fig. 1), participants were asked to find the intersection
of Carrer de Vilardell and Carrer d’Hostafrancs de Sio´, indicating its localization by
dwelling on it for 3 seconds. For Amsterdam, they were told to find the Rijksmuseum.
3</p>
    </sec>
    <sec id="sec-2">
      <title>Results &amp; Discussion</title>
      <p>Participants’ Familiarity with Presented Areas and Google Maps. Participants were
not very familiar with the areas of Barcelona or Amsterdam that were presented to them
during the experiment. We drew this conclusion after analyzing questionnaire answers.
The percentage of participants who had visited either city at least once in their lives was
39% for Amsterdam and 25% for Barcelona. Of these, they had visited the city at most
one time (32.1% for Amsterdam and 19.6% for Barcelona).</p>
      <p>Google maps was fairly popular among participants, with only 7.1% claiming they
had never used the service. Table 1 gives the detailed distribution of answers to the
question “How often do you use Google Maps?”</p>
      <p>Performance (time to completion) and process (visual attention) measures we report
from the experiment therefore speak to more or less typical use of Google Maps. The
importance of our findings lies in their potential indication of online map use when
encountering a new location (e.g., as one would do prior to travel to the destination) by
users already familiar with the service.</p>
      <p>Localization Task Performance. All subjects successfully completed the tasks. To
gauge localization performance, we first considered basic eye movement metrics, using
a linear mixed model (LMM) analysis with task duration as the dependent variable and
experimental condition (map vs. satellite) as the between-subjects fixed factor and city
(Amsterdam vs. Barcelona) as the within-subject fixed factor.</p>
      <p>As expected, analysis revealed a significant main e ect of experimental condition,
F(2; 54) = 26:26; p &lt; 0:01, indicating that task completion time was shorter for the map
view (M = 37:33 s; SE = 41:63) than for the satellite view (M = 47:36 s; SE = 55:27).</p>
      <p>Unexpectedly, however, the city appeared to be a moderating factor resulting in a
significant interaction term, F(1; 54) = 9:15; p &lt; 0:01. Investigation of contrasts related
to this interaction e ect revealed that the di erence between map and satellite views
was significant only for Barcelona. For Amsterdam, no significant di erence between
completion times for the two view types was found. Analysis also revealed a main e ect
of city, F(1; 54) = 31:68; p &lt; 0:01, meaning that, overall, time to localization on either of
Answer Percent
Everyday 7.1%
2-3 times a week 32.1%
Once a week 12.5%
2-3 times a month 19.6%
Once a month 21.4%</p>
      <p>Never 7.1%
t 0.2
n
e
iiff
c
e
o
C
K
la0.0
c
o
F
−
t
n
e
i
b
m
A
−0.2
t 0.2
n
e
iiff
c
e
o
C
K
la0.0
c
o
F
−
t
n
e
i
b
m
A
−0.2</p>
      <sec id="sec-2-1">
        <title>Exp. condition</title>
        <p>Roadmap
Satel ite</p>
      </sec>
      <sec id="sec-2-2">
        <title>Exp. condition</title>
        <p>Roadmap
Satel ite
1
2 Time sequence 4
3
5
1
2 Time sequence 4
3
5
(a) Amsterdam.</p>
        <p>(b) Barcelona.
the Barcelona maps (map or satellite views) took significantly longer (M = 61:7 s; SE =
1:9) than on the Amsterdam maps (M = 25:44 s; SE = 1:5).</p>
        <p>
          The studied sections of the two cities show strong di erences in spatial organization,
that is, the way the buildings and other features are organized in relation to each other.
In the section from the Barcelona map there is more regularity (more rows of buildings
similar to each other), making the visual search potentially harder. This is possibly one
of the reasons why we found a strong di erence between the cities and one that is also
supported by literature [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ].
        </p>
        <p>Temporal Dynamics of Visual Attention. To delve deeper into the visual search
process of viewers over each of the city maps, we performed a similar LMM analysis
but with K as the dependent variable, with the same independent variables of map view
and city as before. Because we believe plotting K over time depicts the dynamic dilation
and constriction of the viewer’s field of view during visual search, we used time as a
fixed within-subject factor, discretized into 5 equal periods (see Fig. 2).</p>
        <p>Considering the city as a moderating factor, analysis of K showed significant
interaction among city, map type, and time, F(4; 9910) = 2:44; p &lt; 0:05. Temporal
ambient/focal attention patterns are influenced by map type (satellite vs. cartographic) and city.</p>
        <p>When working with the Barcelona satellite view, attention generally becomes
increasingly more focal over time, especially over the satellite view, as expected (see
Fig. 2(b)). Over the cartographic map view, the pattern is more sinusoidal, eventually
becoming increasingly focal over the last two time segments. When viewing the
Amsterdam maps, however, the search pattern is less predictable (see Fig. 2(a)).
4</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Conclusion</title>
      <p>Analysis of K strongly suggests di erent cognitive strategies involved in visual search
depending on the stimulus and/or task di culty. Faster search over less complex
stimulus may a ord ambient search sooner, longer, or with greater ambient/focal oscillation,
possibly indicating boredom. More complex stimulus may require increasingly focal
attention later on into the search, possibly indicating greater interest.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. Boe´r, A.,
          <string-name>
            <surname>C¸</surname>
          </string-name>
          <article-title>o¨ltekin,</article-title>
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Clarke</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.C.</surname>
          </string-name>
          :
          <article-title>An Evaluation of Web-based Geovisualizations for Di erent Levels of Abstraction and Realism-What do users predict</article-title>
          ? In: International Cartographic Conference. pp.
          <fpage>209</fpage>
          -
          <lpage>220</lpage>
          . Dresden,
          <string-name>
            <surname>Germany</surname>
          </string-name>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Carter</surname>
            ,
            <given-names>J.R.:</given-names>
          </string-name>
          <article-title>The many dimensions of map use</article-title>
          .
          <source>In: Proceedings of the International Cartographic Conference</source>
          (
          <year>2005</year>
          ), http://www.cartesia.org/geodoc/icc2005/pdf/oral/ TEMA12/Session3/JAMESCARTER.pdf
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Francelet</surname>
          </string-name>
          , R.:
          <article-title>Realism and Individual Di erences in Route-Learning</article-title>
          .
          <source>Master's thesis</source>
          , University of Zurich (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Hammer</surname>
            ,
            <given-names>J.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maurus</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beyerer</surname>
          </string-name>
          , J.:
          <article-title>Real-time 3d gaze analysis in mobile applications</article-title>
          .
          <source>In: Proceedings of the 2013 Conference on Eye Tracking South Africa</source>
          . pp.
          <fpage>75</fpage>
          -
          <lpage>78</lpage>
          . ETSA '13,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY (
          <year>2013</year>
          ), http://doi.acm.
          <source>org/10</source>
          .1145/2509315.2509333
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Kiefer</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Giannopoulos</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kremer</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schlieder</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raubal</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Starting to get bored: An outdoor eye tracking study of tourists exploring a city panorama</article-title>
          .
          <source>In: Proceedings of the Symposium on Eye Tracking Research and Applications</source>
          . pp.
          <fpage>315</fpage>
          -
          <lpage>318</lpage>
          . ETRA '14,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY (
          <year>2014</year>
          ), http://doi.acm.
          <source>org/10</source>
          .1145/2578153.2578216
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Kiefer</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Giannopoulos</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raubal</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>Using eye movements to recognize activities on cartographic maps</article-title>
          .
          <source>In: Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems</source>
          . pp.
          <fpage>488</fpage>
          -
          <lpage>491</lpage>
          . SIGSPATIAL'13,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2013</year>
          ), http://doi.acm.
          <source>org/10</source>
          .1145/2525314.2525467
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Kiefer</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Straub</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raubal</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Towards location-aware mobile eye tracking</article-title>
          .
          <source>In: Proceedings of the Symposium on Eye Tracking Research and Applications</source>
          . pp.
          <fpage>313</fpage>
          -
          <lpage>316</lpage>
          . ETRA '12,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY (
          <year>2012</year>
          ), http://doi.acm.
          <source>org/10</source>
          .1145/2168556.2168624
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Knapp</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>A Task Analysis Approach to the Visualization of Geographic Data</article-title>
          . In: Nygeres,
          <string-name>
            <given-names>T.L.</given-names>
            ,
            <surname>Mark</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.M.</given-names>
            ,
            <surname>Laurini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Egenhofer</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.J</surname>
          </string-name>
          . (eds.)
          <article-title>Cognitive Aspects of Human Computer Interaction for Geographic Information Systems</article-title>
          ,
          <source>chap. 7</source>
          , pp.
          <fpage>355</fpage>
          -
          <lpage>371</lpage>
          . Springer (
          <year>1995</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Krejtz</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Szarkowska</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krejtz</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Walczak</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Duchowski</surname>
            ,
            <given-names>A.T.</given-names>
          </string-name>
          :
          <article-title>Audio Description as an Aural Guide of Children's Visual Attention: Evidence from an Eye-Tracking Study</article-title>
          .
          <source>In: Proceedings of the 2012 Symposium on Eye-Tracking Research and Applications. ETRA '12</source>
          ,
          <string-name>
            <surname>ETRA</surname>
          </string-name>
          , ACM, New York,
          <source>NY (March</source>
          <volume>28</volume>
          -30
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Qvarfordt</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhai</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Conversing with the user based on eye-gaze patterns</article-title>
          .
          <source>In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems</source>
          . pp.
          <fpage>221</fpage>
          -
          <lpage>230</lpage>
          . CHI '05,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2005</year>
          ), http://doi.acm.
          <source>org/10</source>
          .1145/1054972.1055004
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Starker</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bolt</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          :
          <article-title>A gaze-responsive self-disclosing display</article-title>
          .
          <source>In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems</source>
          . pp.
          <fpage>3</fpage>
          -
          <lpage>10</lpage>
          . CHI '90,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>1990</year>
          ), http://doi.acm.
          <source>org/10</source>
          .1145/97243.97245
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Velichkovsky</surname>
            ,
            <given-names>B.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joos</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Helmert</surname>
            ,
            <given-names>J.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pannasch</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Two Visual Systems and Their Eye Movements: Evidence from Static and Dynamic Scene Perception</article-title>
          .
          <source>In: CogSci 2005: Proceedings of the XXVII Conference of the Cognitive Science Society</source>
          . pp.
          <fpage>2283</fpage>
          -
          <lpage>2288</lpage>
          . Stresa,
          <string-name>
            <surname>Italy</surname>
          </string-name>
          (
          <year>July 2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Ware</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Chapter 2: What We Can Easily See</article-title>
          .
          <source>In: Visual Thinking for Design</source>
          , pp.
          <fpage>23</fpage>
          -
          <lpage>42</lpage>
          . Morgan Kaufmann (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>