<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The Use of Eye Tracking in Search of Indoor Landmarks</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>P. Viaene</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>K. Ooms</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>P. Vansteenkiste</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>M. Lenoir</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>P. De Maeyer</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Ghent University, Department of Movement and Sport Sciences Watersportlaan 2</institution>
          ,
          <addr-line>9000 Ghent</addr-line>
          ,
          <country country="BE">Belgium</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Ghent University, Geography Department Krijgslaan 281 (S8)</institution>
          ,
          <addr-line>9000 Ghent</addr-line>
          ,
          <country country="BE">Belgium</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Pepijn.Viaene; Kristien.Ooms; Pieter.Vansteenkiste; Matthieu.Lenoir</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2014</year>
      </pub-date>
      <fpage>52</fpage>
      <lpage>56</lpage>
      <abstract>
        <p>The detection of indoor landmarks remains a troublesome endeavour. The rise of more performant and user-friendly mobile eye tracking devices might offer a solution. A small-scale study was conducted in which a test population was given a navigational task and whereby eye movement measures and think aloud protocols were compared. The first results indicate that eye tracking has high potential for the specific task of identifying indoor landmarks, while thinking aloud offers minor additions to the information provided by eye tracking with respect to landmark identification.</p>
      </abstract>
      <kwd-group>
        <kwd>Think Aloud</kwd>
        <kwd>Cognitive Processes</kwd>
        <kwd>Wayfinding</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        In line with the growing interest in indoor navigation and its challenges, indoor
landmarks call for attention. These prominent elements in an environment enable an
observer to locate himself and to set objectives like reaching a destination or selecting
an optimal route [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Hence, indoor landmarks can serve as powerful wayfinding
tools. Specifically, as part of view-action-pairs, they specify the location where a
wayfinding action, which is needed to reach a certain destination, should take place
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In addition to this, landmarks are key elements in the construction of a spatial
representation, which is central in our ability to navigate, as they anchor zones and
form a hierarchical structure [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        However, both in- and outdoors, it is not clear how landmarks should be detected
and identified by researchers so that these objects can be studied and implemented in
route instructions, maps and other wayfinding tools. A broad range of methods have
been applied in the past with their specific (dis)advantages [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Some (e.g. [
        <xref ref-type="bibr" rid="ref5 ref6 ref7">5, 6, 7</xref>
        ])
tried to define landmarks by quantifying the features that contribute to the overall
saliency of a landmark. However, these features and the way of quantifying the
landmark’s saliency vary. Moreover, the datasets on which these methods are based are in
general not available for indoor environments.
      </p>
      <p>ET4S 2014, September 23, 2014, Vienna, Austria
Copyright © 2014 for the individual papers by the papers' authors. Copying permitted for private and
academic purposes. This volume is published and copyrighted by its editors.</p>
      <p>
        With the development of more accurate and mobile eye trackers, measuring eye
movements might be an adequate solution to identify indoor landmarks. First, the
eyemind hypothesis states that certain aspects of the gaze during a task may be analysed
in order to examine cognitive processes [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. While navigating, these processes are
associated with the cognitive model of the environment, which is based on landmarks.
The aspects that can be examined include the locus of the eye fixation and its
duration. The locus indicates the element that is being processed internally even if subjects
are not consciously aware of this and the duration is related, but not necessarily
identical, to the time needed to encode and to operate on that element [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Second,
landmarks are eye catching as they are highly distinguishable in their environment and
differ from other objects based on visual, semantic and structural features [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Study Design</title>
      <p>
        In order to assess the validity of the eye tracking method to detect indoor landmarks,
the results of the eye movement analysis will be compared with the think aloud
method, which is more commonly used to study cognitive processes related to (indoor)
wayfinding (e.g. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]) and therefore considered to be a valid representation of the
cognitive processes related to the use of landmarks. A similar comparison was conducted
by Spiers and Maguire [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. However, they assessed to what extent the eye loci
corroborated with the verbalizations in order to validate the verbal protocols. In this
study, we wish to provide arguments for the validity of eye tracking itself.
      </p>
      <p>
        Concurrent think aloud (CTA) is based on the analysis of verbal protocols formed
by participants voicing their thoughts that come to mind while executing a
problemsolving task [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. In order to detect possible reactivity due to the extra workload of
verbalizing, cued retrospective think aloud (CRTA) will also be part of this study.
This method allows participants to execute the task silently – in this way not inducing
an additional workload – and to verbalize their thoughts afterwards while watching a
video recording of their performance on which their eye movements at the time are
also displayed. These should cue the participants in revealing more about their
thoughts at the time verbally [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. However, as CRTA requires participants to
remember information, it is possible that they forget important information [
        <xref ref-type="bibr" rid="ref11 ref12">11, 12</xref>
        ].
      </p>
      <p>Twelve participants completed a route in a complex building1 twice. The first time
they had to follow the experimenter. The second time, they were asked to complete
the same route independently. The experimenter only intervened if the participant was
lost or asked for help. All participants wore the eye tracker during both completions
of the route. Due to technical problems with the head mounted eye tracking device
(iViewX HED by SMI), the recordings of three participants were excluded from the
analysis. The remaining test population consisted of four subjects applying CTA
during both traversals of the route and five applying CRTA based on the recording of the
second traversal. The route itself had a total length of 440 meters and covered four
floor levels. The participants, who had never been in the building, were made
ac1 University building: S8, Krijgslaan 281, 9000 Ghent, Belgium
quainted with thinking aloud before the experiment. Furthermore, they were told to
verbalise everything, ranging from visual stimuli to feelings related to the
navigational task and the building itself. Finally, the participants were aware that the goal of the
study was related to indoor navigation and the use of landmarks.</p>
      <p>The transcriptions of the verbal protocols were analysed with the aid of Elan
EUDICO Linguistic Annotator (version 4.6.2). The protocols were split into
verbalisation segments (e.g. one landmark referral, one explanation, one silence) and each
segment was attributed with a time interval. The eye tracking data was analysed by
using BeGaze 3.4. All fixations were transferred to a reference image that displayed
25 landmark categories (attributed with areas of interest) by using the semantic gaze
mapping tool offered by SMI. Finally, verbalisations were compared with the eye
movements (i.e. fixation locus and duration) around the same point in time.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>In total, 59 % (58 % (CTA), 61 % (CRTA)) of the verbalisation segments did not
refer to a structural or object landmark. This accounted for 68 % (68 % (CTA), 69 %
(CRTA)) of the observation time. A fourth of this 59 % consisted of segments
containing additional information (e.g. explanations). This quarter was equal to 288
verbal utterances from which 118 were considered to be completely irrelevant for this
study. This means that the information content of 170 relevant verbalisation segments,
which represented 16 % of all segments, was not part of the eye tracking data.
Following, the remaining three quarters of the 59 % represented silences and
corresponded to 57 % (66 % (CTA), 56 % (CRTA)) of the total observation time.</p>
      <p>With respect to the 41 % of segments that did refer to a potential landmark, the
following can be said. On average, 69 % (71 % (CTA), 66 % (CRTA)) of the mentioned
potential structural and object landmarks were clearly fixated on. On the other hand,
13 % (12 % (CTA), 14 % (CRTA)) of the described landmarks were not visible at the
moment they were verbalised. The remaining 18 % (16 % (CTA), 20 % (CRTA))
represented the number of indoor landmarks that could not be unambiguously
identified solely based on the eye tracking data and therefore verbalisations were needed for
a true determination.</p>
      <p>
        We now turn to the locations were landmarks were most needed, namely locations
were a change of direction took place and where multiple directional possibilities
were present. The most fixated on landmark categories at these decision points are
shown in Table 1. Often a single object caused the rise in fixations for a specific
category. These object landmarks, defined as elements that are independent of the
building’s structure [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], are listed in Table 1 as well. As it is not clear how people visually
perceive structural elements (i.e. staircases and corridors), these elements were
excluded from the eye tracking analysis. The fixated on objects at the seventeen
decision points were compared with the objects mentioned at these locations in the
thirteen verbal protocols. In 59 cases there was a match, while other objects were
mentioned 73 times. Often, these other objects were staircases (34 times). Finally, 89
times there were no referrals to objects.
object landmark
grey double door
exhibition display
sign (“Geography”)
brown double door
window and view
pair of sticks / car batteries
brown doors with windows
big plant
red elevator
wooden information board
grey double door
glass main entrance
sign (“Paleontology”)
brown double door
window and view
brown double door
single door
The general comparison between eye recordings and verbal protocols leads to two
findings. First, a considerable share of the information originating from the think
aloud method is not deductible by tracking eye movements, namely a quarter of all
verbalisations: 13 % non-visible landmarks and 16 % relevant verbalisations without
referral to landmarks. However, the latter is not considered to be a loss of information
since these do not contain references to potential indoor landmarks, given that the
goal of this study is to determine if eye tracking could be used specifically to identify
indoor landmarks. Second, although all participants stated that they did not experience
difficulties with respect to voicing their thoughts, the think aloud method did not
supply information during more than half of the observation time. Pointing out that the
quality of verbal protocols depends on the skills of the respondent. Respondents
sometimes only verbalize part of their thoughts or have difficulties translating their
thoughts into words [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Moreover, subjects can only provide data on processes that
they are aware of [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. In contrast, eye tracking provided data continuously.
      </p>
      <p>
        With respect to the most fixated on objects, there is a poor resemblance. However,
when neglecting referrals to staircases, as fixations on staircases were not seen
reliable, one can conclude that there were only 39 mismatches. Consequently, there were
no referrals to objects in 123 of the cases, which is in line with the observation that
thinking aloud does not supply information in more than half of the observations.
Furthermore, the fact that staircases were often mentioned does not automatically
mean that these structural elements were remembered as wayfinding aids. An
explanation might be found in the physically perceivable interaction with these elements
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Finally, there were no indications that CTA caused reactivity that had significant
effects on task performance or concentration.
      </p>
      <p>In conclusion, the results indicate that eye tracking can provide qualitative and
complete data which can be used to identify indoor landmarks. Although eye tracking
captures most information relevant for the identification of landmarks, it is advisable
to record verbal protocols which can be consulted to clarify specific fixations in order
to obtain a more complete outline of potential landmarks. However, having the
timeconsuming analysis of verbal protocols in mind, these should not be the subject of a
separate secondary analysis since the added value is limited.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Sorrows</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hirtle</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>The nature of landmarks for real and electronic spaces</article-title>
          . In: Freska,
          <string-name>
            <given-names>C.</given-names>
            and
            <surname>Mark</surname>
          </string-name>
          , D.M. (eds.)
          <article-title>Spatial information theory</article-title>
          .
          <source>Cognitive and Computional Foundations of GIS</source>
          . pp.
          <fpage>37</fpage>
          -
          <lpage>50</lpage>
          . Springer-Verlag, Berlin, Germany (
          <year>1999</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Lovelace</surname>
            ,
            <given-names>K.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hegarty</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Montello</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          :
          <article-title>Elements of Good Route Directions in Familiar and Unfamiliar Environments</article-title>
          .
          <article-title>Spatial information theory</article-title>
          .
          <source>Cognitive and Computional Foundations of GIS</source>
          . pp.
          <fpage>65</fpage>
          -
          <lpage>82</lpage>
          . Springer-Verlag, Berlin, Germany (
          <year>1999</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Stankiewicz</surname>
            ,
            <given-names>B.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kalia</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          :
          <article-title>Acquisition of structural versus object landmark knowledge</article-title>
          .
          <source>J. Exp. Psychol. Hum. Percept. Perform</source>
          .
          <volume>33</volume>
          ,
          <fpage>378</fpage>
          -
          <lpage>390</lpage>
          (
          <year>2007</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Sefelin</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bechinie</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Müller</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Seibert-Giller</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Messner</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tscheligi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Landmarks: yes; but which?: five methods to select optimal landmarks for a landmark-and speech-based guiding system. 7th international conference on Human computer interaction with mobile devices and services</article-title>
          . pp.
          <fpage>287</fpage>
          -
          <lpage>290</lpage>
          . ACM Press, Salzburg, Austria (
          <year>2005</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Raubal</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Winter</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Enriching Wayfinding Instructions with Local Landmarks</article-title>
          . In: Egenhofer,
          <string-name>
            <given-names>M.J.</given-names>
            and
            <surname>Mark</surname>
          </string-name>
          , D.M. (eds.) Geographic Information Science.
          <source>GIScience 2002</source>
          . pp.
          <fpage>243</fpage>
          -
          <lpage>259</lpage>
          . Springer-Verlag, Berlin, Germany (
          <year>2002</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Fang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shaw</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>A GIS data model for landmark-based pedestrian navigation</article-title>
          .
          <source>Int. J. Geogr. Inf. Sci. 26</source>
          ,
          <fpage>1</fpage>
          -
          <lpage>22</lpage>
          (
          <year>2011</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Elias</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Determination of Landmarks and Reliability Criteria for Landmarks</article-title>
          . Fifth workshop on progress in
          <source>Automated Map Generalization Paris</source>
          . pp.
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          . , Paris (
          <year>2003</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Just</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carpenter</surname>
            ,
            <given-names>P.A.</given-names>
          </string-name>
          :
          <article-title>Eye fixations and cognitive processes</article-title>
          .
          <source>Cogn. Psychol</source>
          .
          <volume>8</volume>
          ,
          <fpage>441</fpage>
          -
          <lpage>480</lpage>
          (
          <year>1976</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Hölscher</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meilinger</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vrachliotis</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brösamle</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Knauff</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Up the down staircase: Wayfinding strategies in multi-level buildings</article-title>
          .
          <source>J. Environ. Psychol</source>
          .
          <volume>26</volume>
          ,
          <fpage>284</fpage>
          -
          <lpage>299</lpage>
          (
          <year>2006</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Spiers</surname>
            ,
            <given-names>H.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maguire</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <article-title>a: The dynamic nature of cognition during wayfinding</article-title>
          .
          <source>J. Environ. Psychol</source>
          .
          <volume>28</volume>
          ,
          <fpage>232</fpage>
          -
          <lpage>249</lpage>
          (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Van Elzakker</surname>
            ,
            <given-names>C.P.J.M.:</given-names>
          </string-name>
          <article-title>The Use Of Maps In The Exploration Of Geographic Data. Koninklijk Nederlands aardrijkskundig genootschap, Faculteit geowetenschappen, Universiteit Utrecht / Internationaal Instituut voor Geo-Information Science and Earth observation</article-title>
          , Utrecht/Enschede (
          <year>2004</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12. Van Gog,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Paas</surname>
          </string-name>
          , F.,
          <string-name>
            <surname>van Merriënboer</surname>
            ,
            <given-names>J.J.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Witte</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Uncovering the problem-solving process: cued retrospective reporting versus concurrent and retrospective reporting</article-title>
          .
          <source>J. Exp. Psychol. Appl</source>
          .
          <volume>11</volume>
          ,
          <fpage>237</fpage>
          -
          <lpage>244</lpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>