<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Toward a Case-Based Framework for Imitation Learning in Robotic Agents</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Interactive Computing, Georgia Institute of Technology</institution>
          ,
          <addr-line>Atlanta, Georgia, 30332</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <fpage>265</fpage>
      <lpage>267</lpage>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Imitation learning is a skill essential to human development and cognition [
        <xref ref-type="bibr" rid="ref5 ref6">6,
5</xref>
        ]. Naturally, imitation learning has become a topic of focus for robotics
research as well, particularly in interactive robots [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. In imitating the actions
of a teacher, a cognitive agent learns the demonstrated action such that it may
perform a similar action later and achieve a similar goal. Thus, we expect that
a cognitive robot that learns from imitation would reuse what it has learned
from one experience to reason about addressing related, but di↵erent, problem
scenarios.
      </p>
      <p>The eventual goal of this work is to use a case-based approach to enable
imitation learning in interactions such as the following. A human teacher guides
the robot to complete a task, such as scooping the contents of one container
into another. The robot records the demonstrated actions and observed objects,
saving the demonstration as a source case in its case memory. At a later time,
the robot is asked to repeat the scooping task, but in a new, target environment
containing a di↵erent set of object features to parameterize and execute the
task. Next, the robot would transfer its representation of the scooping task to
accommodate for the di↵erences between the source and target environments,
and then execute an action based on the transferred representation to achieve
the goal state in the target environment.</p>
      <p>
        Using a case-based framework to address this problem allows us to
represent demonstrations as individual experiences in the robot’s case memory, and
provides us with a framework for identifying, transferring, and executing a
relevant source case demonstration in an unfamiliar target environment. The main
research questions we plan to address are as follows:
– How should task demonstrations be represented in case memory?
– How do we determine which features of a robot’s environment are relevant
to completing a task, and thus should be stored in the source case?
– What features should be considered in retrieving a source case demonstration
for reuse in a target environment? How should these features be prioritized
during source case retrieval?
Copyright © 2015 for this paper by its authors. Copying permitted for private and
academic purposes. In Proceedings of the ICCBR 2015 Workshops. Frankfurt, Germany.
We have defined a case-based approach to transfer for enabling imitation in
robotic agents, consisting of two separate processes (as shown in Figure 1): the
Case Storage process in which the robot receives demonstrations of a task and
stores each demonstration as a case in source memory, and a Case Adaptation
process which is used at a later time when the robot is asked to repeat a task in
a target environment.
Demonstration and Learning We have implemented the first step in the
Case Storage process, where the robot records and stores each task
demonstration as a source case in memory. We define each case as the tuple C =
&lt;L, D, T, O, Si, Sf &gt;, where:
– L represents the label of the task which was demonstrated, e.g. ”scooping”.
– D represents the set of action models which encode the demonstrated motion,
represented as Dynamic Movement Primitives as defined in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
– T is the set of parameterization functions which relate the set of action
models to the locations of objects in the robot’s environment. For example,
a parameterization function may be used to represent how the robot’s hand
must be located above a bowl prior to completing a pouring action.
– O is the set of salient object IDs which are relevant to the task.
– Si and Sf are the initial and final states, respectively, which represent the
set of objects observed in an overhead view of the robot’s environment.
2.2
      </p>
      <p>Case Adaptation Process
At a later time, the robot may be asked to repeat a learned task in an unfamiliar
target environment. Using the framework shown in Figure 1, the robot may
address a target environment using the following steps.</p>
      <p>Observation The robot is given a target problem to address, under the
assumption that it has a relevant source case in memory which can be used to
address the target problem. The robot observes the target environment by
viewing the objects located in the table-top environment using an overhead camera,
providing it with the initial state Si of the target case.
Retrieval and Mapping The robot must then choose a source case from
memory containing the demonstration that is most relevant to the current target
problem. Once a relevant source case has been retrieved, a mapping must be
generated that encodes the di↵erences between the source and target environments.
This mapping is later used to transfer the source case such that di↵erences in the
target environment are addressed. We have not yet implemented the Retrieval
and Mapping steps, but will be addressing them in upcoming work.
Transfer and Execution We have implemented the last two steps of the Case
Adaptation process, the Transfer and Execution steps. Currently, we manually
provide the robot with the most relevant source case demonstration and a
mapping between objects in the source and target environments.</p>
      <p>
        We take a similarity-based approach to transfer, where we consider the
similarity between the source case and target environments when defining transfer
processes. As we encounter transfer problems in which the source and target
problems become less similar, the source case is transferred at a di↵erent level of
abstraction, such that only high-level features of that case are transferred. We
have implemented three transfer methods, each of which operates by
transferring the source case at a di↵erent level of abstraction (further described in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]).
Once the source case has been transferred, it is used to plan and execute a new
action trajectory to address the target problem. Preliminary experiments have
evaluated each method under the assumption that we select the approach, and
thus the level of abstraction at which transfer occurs, to be used for a given
transfer problem.
3
      </p>
    </sec>
    <sec id="sec-2">
      <title>Future Work</title>
      <p>Our current implementation assumes that we manually provide a mapping
between equivalent objects in the source and target environments. We plan to
identify (i) a method for autonomously determining this object mapping and (ii) a
process for identifying and retrieving an appropriate source case demonstration.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Argall</surname>
            ,
            <given-names>B.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chernova</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Veloso</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Browning</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>A survey of robot learning from demonstration</article-title>
          .
          <source>Robotics and Autonomous Systems</source>
          <volume>57</volume>
          (
          <issue>5</issue>
          ),
          <fpage>469</fpage>
          -
          <lpage>483</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Chernova</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thomaz</surname>
            ,
            <given-names>A.L.</given-names>
          </string-name>
          :
          <article-title>Robot learning from human teachers</article-title>
          .
          <source>Synthesis Lectures on Artificial Intelligence and Machine Learning</source>
          <volume>8</volume>
          (
          <issue>3</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>121</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Fitzgerald</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goel</surname>
            ,
            <given-names>A.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thomaz</surname>
            ,
            <given-names>A.L.:</given-names>
          </string-name>
          <article-title>A similarity-based approach to skill transfer</article-title>
          .
          <source>Women in Robotics Workshop at Robotics: Science and Systems</source>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Pastor</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , Ho↵mann, H.,
          <string-name>
            <surname>Asfour</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schaal</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Learning and generalization of motor skills by learning from demonstration</article-title>
          .
          <source>In: Robotics and Automation</source>
          ,
          <year>2009</year>
          . ICRA'09. IEEE International Conference on. pp.
          <fpage>763</fpage>
          -
          <lpage>768</lpage>
          . IEEE (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Piaget</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cook</surname>
          </string-name>
          , M.T.:
          <article-title>The origins of intelligence in children</article-title>
          . (
          <year>1952</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Tomasello</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kruger</surname>
            ,
            <given-names>A.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ratner</surname>
            ,
            <given-names>H.H.</given-names>
          </string-name>
          :
          <article-title>Cultural learning</article-title>
          .
          <source>Behavioral and brain sciences</source>
          <volume>16</volume>
          (
          <issue>03</issue>
          ),
          <fpage>495</fpage>
          -
          <lpage>511</lpage>
          (
          <year>1993</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>