<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Artificial Spatial Cognition for Robotics and Mobile Systems: Brief Survey and Current Open Challenges</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Paloma de la Puente</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>M. Guadalupe</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sánchez-Escribano Universidad Politécnica de</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Madrid (UPM) Madrid</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Spain</string-name>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <fpage>52</fpage>
      <lpage>53</lpage>
      <abstract>
        <p>-Remarkable and impressive advancements in the areas of perception, mapping and navigation of artificial mobile systems have been witnessed in the last decades. However, it is clear that important limitations remain regarding the spatial cognition capabilities of existing available implementations and the current practical functionality of high level cognitive models [1, 2]. For enhanced robustness and flexibility in different kinds of real world scenarios, a deeper understanding of the environment, the system, and their interactions -in general termsis desired. This long abstract aims at outlining connections between recent contributions in the above mentioned areas and research in cognitive architectures and biological systems. We try to summarize, integrate and update previous reviews, highlighting the main open issues and aspects not yet unified or integrated in a common architectural framework.</p>
      </abstract>
      <kwd-group>
        <kwd>spatial cognition</kwd>
        <kwd>surveys</kwd>
        <kwd>perception</kwd>
        <kwd>navigation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>I. BRIEF SURVEY</title>
      <p>A. Initial models for spatial knowledge representation and
main missing elements</p>
      <p>
        Focusing on the spatial knowledge representation and
management, the first contributions inspired by the human
cognitive map combined metric local maps, as an Absolute
Space Representation (ASR), and topological graphs [3]. As a
related approach, the Spatial Semantic Hierarchy (SSH) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]
was the first fundamental cognitive model for large-scale
space. It evolved into the Hybrid SSH [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], which also included
knowledge about small-scale space. This fundamental work
was undoubtedly groundbreaking, but it did not go beyond
basic levels of information abstraction and conceptualization
[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Moreover, the well-motivated dependencies among
different types of knowledge (both declarative and procedural)
were not further considered for general problem solving [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
The SSH model was considered suitable for the popular
schema of a “three layer architecture”, without explicitly
dealing with processes such as attention or forgetting
mechanisms. This lack of principled forgetting mechanisms has
been identified by the Simultaneous Localization and Mapping
(SLAM) robotics community as a key missing feature of most
existing mapping approaches [
        <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
        ].
      </p>
      <p>
        B. The role of cognitive architectures and their relation to
other works in the robotics community
Cognitive architectures provide a solid approach for
modeling general intelligent agents and their main
commitments support the ambitious requirements of high level
behavior in arbitrary situations for robotics [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. A more recent
model of spatial knowledge, the Spatial/Visual System (SVS)
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] designed as an extension of the Soar cognitive
architecture, proposed a different multiplicity of
representations, i.e. symbolic, quantitative spatial and visual
depictive. The spatial scene is a hierarchy tree of entities and
their constitutive parts, with intermediate nodes defining the
transformation relations between parts and objects. Other
works in robotics employ similar internal representation ideas
[
        <xref ref-type="bibr" rid="ref12 ref13 ref14">12-14</xref>
        ], and other ones included the possibility to hypothesize
geometric environment structure in order to build consistent
maps [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. While a complete implementation of this approach
for all kind of objects requires solving the corresponding
segmentation and recognition problems in a domain
independent manner (which is far beyond the state of the art),
keeping the perceptual level representations within the
architecture enhances functionality. A very active research
community address these difficult challenges.
      </p>
      <p>
        The recognition process should not only use visual, spatial
and motion data from the Perceptual LTM but also conceptual
context information [
        <xref ref-type="bibr" rid="ref16 ref7">7, 16</xref>
        ] and episodic memories of
remembered places [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], from Symbolic LTM. This should
also apply to the navigation techniques for different situations
[
        <xref ref-type="bibr" rid="ref18 ref19">18, 19</xref>
        ]. The existence of motion models for the objects can
improve navigation in dynamic environments, which is one of
the main problems in real world robotic applications [
        <xref ref-type="bibr" rid="ref20 ref21">20, 21</xref>
        ].
      </p>
      <p>
        A novel cognitive architecture specifically designed for
spatial knowledge processing is the Casimir architecture [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ],
which presents rich modeling capabilities pursuing human-like
behavior. Navigation, however, has not been addressed, and
this work has scarcely been discussed in the robotics domain.
      </p>
      <p>
        One of the latest spatial models is the NavModel [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ],
designed and implemented for the ACT-R architecture. Besides
considering multi-level representations, this model presents
three navigation strategies with varying cognitive cost. The
first developed implementation assumes known topological
localization at room level, while a subsequent implementation
incorporates a mental rotation model. This work focuses on the
cognitive load and does not deal with lower level issues.
      </p>
      <p>To point out how topics are addressed by the respective
communities, we compiled Table I as a comparison. The
contrast regarding memory management and uncertainty seems
to be relevant. The lack of approaches combining both
allocentric and egocentric representations is also remarkable.
To conclude, Table II shows a summary of surveys.</p>
      <p>COMPARISON OF TOPICS ADDRESSED BY THE COGNITIVE</p>
      <p>ARCHITECTURES AND ROBOTICS COMMUNITIES</p>
      <p>← Topic →
Egocentric spatial models</p>
      <p>Allocentric spatial models
Explicit motion models / dynamic
information about the environment
Memory management, forgetting</p>
      <p>mechanisms
Casimir, LIDA, SOAR-SVS</p>
      <p>
        Object based/ semantic representations
Extended LIDA [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ]
      </p>
      <p>Uncertainty considerations</p>
    </sec>
    <sec id="sec-2">
      <title>II. CURRENT OPEN CHALLENGES</title>
      <p>The big challenge is closing the gap between high level
models and actual implementations in artificial mobile systems.
To reduce this existing gap, we identify three main goals:


</p>
      <p>Combination of allocentric and egocentric models
using different levels of features/objects +
topology/semantics.</p>
      <p>Acquisition and integration of motion models and
dynamic information for the elements/objects.</p>
      <p>Integration of global mapping &amp; loop closure
capabilities with extensive declarative knowledge
about features relevance and forgetting
mechanisms with episodic memory.</p>
    </sec>
    <sec id="sec-3">
      <title>ACKNOWLEDGMENT The authors want to thank the EUCog community for fostering interdisciplinary research in Artificial Cognitive Systems and organizing inspiring meetings and events.</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>G. Eason M.</given-names>
            <surname>Jefferies</surname>
          </string-name>
          and
          <string-name>
            <given-names>W.K.</given-names>
            <surname>Yeap</surname>
          </string-name>
          .
          <article-title>Robotics and cognitive approaches to spatial mapping</article-title>
          . Springer,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>T.</given-names>
            <surname>Madl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Montaldi</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Trappl</surname>
          </string-name>
          .
          <article-title>Computational cognitive models of spatial memory in navigation space: A review</article-title>
          .
          <source>Neural Networks</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>W.K.</given-names>
            <surname>Yeap</surname>
          </string-name>
          .
          <article-title>Towards a computational theory of cognitive maps</article-title>
          .
          <source>Journal of Artificial Intelligence</source>
          ,
          <year>1988</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>B.</given-names>
            <surname>Kuipers</surname>
          </string-name>
          .
          <article-title>The spatial semantic hierarchy</article-title>
          .
          <source>Artificial Intelligence</source>
          .
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Kuipers</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Modayil</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Beeson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <article-title>MacMahon</article-title>
          and
          <string-name>
            <given-names>F.</given-names>
            <surname>Savelli</surname>
          </string-name>
          .
          <article-title>Local metrical and global topological maps in the hybrid Spatial Semantic Hierarchy</article-title>
          . ICRA,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Pronobis</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Jensfelt</surname>
          </string-name>
          .
          <article-title>Large-scale semantic mapping and reasoning with heterogeneous modalities</article-title>
          .
          <source>ICRA</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.D.</given-names>
            <surname>Lathrop</surname>
          </string-name>
          .
          <article-title>Extending cognitive architectures with spatial and visual imagery mechanisms</article-title>
          .
          <source>PhD Thesis</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J.A.</given-names>
            <surname>Fernandez-Madrigal</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.L.</given-names>
            <surname>Blanco</surname>
          </string-name>
          .
          <article-title>Simultaneous localization and mapping for mobile robots: iIntroduction and methods</article-title>
          .
          <source>IGI</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>C.</given-names>
            <surname>Cadena</surname>
          </string-name>
          et al.
          <article-title>Past, present, and future of simultaneous localization and mapping: towards the robust-perception age</article-title>
          .
          <source>T-RO</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>U.</given-names>
            <surname>Kurup</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Lebiere</surname>
          </string-name>
          .
          <article-title>What can cognitive architectures do for robotics?</article-title>
          <source>Biologically Inspired Cognitive Architectures</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S.D.</given-names>
            <surname>Lathrop</surname>
          </string-name>
          .
          <article-title>Exploring the functional advantages of spatial and visual cognition from an architectural perspective</article-title>
          .
          <source>TopiCS</source>
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>R.F.</given-names>
            <surname>Salas-Moreno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.A</given-names>
            :
            <surname>Newcombe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Strasdat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.H.J</given-names>
            <surname>Kelly</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.J.</given-names>
            <surname>Davison</surname>
          </string-name>
          . SLAM++
          <article-title>: Simultaneous localisation and mapping at the Level of objects</article-title>
          .
          <source>CVPR</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>S.</given-names>
            <surname>Eslami</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Williams</surname>
          </string-name>
          .
          <article-title>A generative model for parts-based object segmentation</article-title>
          .
          <source>Advances Neural Information Processing Systems</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Uckermann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Eibrechter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Haschke</surname>
          </string-name>
          and
          <string-name>
            <given-names>H.</given-names>
            <surname>Ritter</surname>
          </string-name>
          .
          <article-title>Real time hierarchical scene segmentation and classification</article-title>
          .
          <source>Humanoids</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>P. de la Puente</surname>
          </string-name>
          and
          <string-name>
            <surname>D.</surname>
          </string-name>
          Rodriguez-Losada.
          <article-title>Feature based graph SLAM in structured environments</article-title>
          . Autonomous Robots,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>L.</given-names>
            <surname>Kunze</surname>
          </string-name>
          et al.
          <article-title>Combining top-down spatial reasoning and bottom-up object class recognition for scene understanding</article-title>
          .
          <source>IROS</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>M.B Moser</surname>
            and
            <given-names>E.I. Moser.</given-names>
          </string-name>
          <article-title>The brain's GPS</article-title>
          .
          <source>Scientific American</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>G.</given-names>
            <surname>Gunzelmann</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Lyon</surname>
          </string-name>
          (
          <year>2007</year>
          )
          <article-title>Mechanisms for human spatial competence</article-title>
          . Spatial
          <string-name>
            <surname>Cognition</surname>
            <given-names>V</given-names>
          </string-name>
          , LNAI-Springer,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>F.</given-names>
            <surname>Dayoub</surname>
          </string-name>
          , G. Cielniak and
          <string-name>
            <given-names>T.</given-names>
            <surname>Duckett</surname>
          </string-name>
          .
          <article-title>Eight weeks of episodic visual navigation inside a non-stationary environment using adaptive spherical views</article-title>
          .
          <source>FSR</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>N.</given-names>
            <surname>Hawes</surname>
          </string-name>
          et al.
          <article-title>The STRANDS project: long-term autonomy in everyday environments</article-title>
          .
          <source>Robotics and Automation Magazine</source>
          , in press.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>P. de la Puente</surname>
          </string-name>
          et al.
          <article-title>Experiences with RGB-D navigation in real home robotic trials</article-title>
          .
          <source>ARW</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>H.</given-names>
            <surname>Schultheis</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Barkowsky</surname>
          </string-name>
          .
          <article-title>Casimir: an architecture for mental spatial knowledge processing</article-title>
          .
          <source>TopiCS</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>C.</given-names>
            <surname>Zhao</surname>
          </string-name>
          .
          <article-title>Understanding human spatial navigation behaviors: A cognitive modeling</article-title>
          .
          <source>PhD Thesis</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>R.</given-names>
            <surname>Drouilly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rives</surname>
          </string-name>
          and
          <string-name>
            <given-names>B.</given-names>
            <surname>Morisset</surname>
          </string-name>
          .
          <article-title>Semantic representation for navigation in large-scale environments</article-title>
          .
          <source>ICRA</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>L.F.</given-names>
            <surname>Posada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Hoffmann</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Bertram</surname>
          </string-name>
          .
          <article-title>Visual semantic robot navigation in indoor environments</article-title>
          .
          <source>ISR</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>A.</given-names>
            <surname>Richardson</surname>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Olson</surname>
          </string-name>
          .
          <article-title>Iterative path optimization for practical robot planning</article-title>
          .
          <source>IROS</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>R.</given-names>
            <surname>Ambrus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Bore</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Folkesson</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Jensfelt</surname>
          </string-name>
          .
          <article-title>Meta-rooms: building and maintaining long term spatial models in a dynamic world</article-title>
          .
          <source>IROS</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <surname>D. M. Rosen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Mason</surname>
            and
            <given-names>J. J.</given-names>
          </string-name>
          <string-name>
            <surname>Leonard</surname>
          </string-name>
          .
          <article-title>Towards lifelong featurebased mapping in semi-static environments</article-title>
          .
          <source>ICRA</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>T.</given-names>
            <surname>Madl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Franklin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Montaldi</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Trappl</surname>
          </string-name>
          .
          <article-title>Towards realworld capable spatial memory in the LIDA cognitive architecture</article-title>
          .
          <source>BICA</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <surname>J. J. DiCarlo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Zoccolan</surname>
            and
            <given-names>N. C.</given-names>
          </string-name>
          <string-name>
            <surname>Rust</surname>
          </string-name>
          .
          <article-title>How does the brain solve visual object recognition?</article-title>
          <source>Neuron</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>