<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Toward a Case-Based Framework for Imitation Learning in Robotic Agents</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Tesca</forename><surname>Fitzgerald</surname></persName>
							<email>tesca.fitzgerald@cc.gatech.edu</email>
							<affiliation key="aff0">
								<orgName type="department">School of Interactive Computing</orgName>
								<orgName type="institution">Georgia Institute of Technology</orgName>
								<address>
									<postCode>30332</postCode>
									<settlement>Atlanta</settlement>
									<region>Georgia</region>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Toward a Case-Based Framework for Imitation Learning in Robotic Agents</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">E0BC56426D9A2D44600486C4A2F301C1</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T21:32+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract/>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Imitation learning is a skill essential to human development and cognition <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b4">5]</ref>. Naturally, imitation learning has become a topic of focus for robotics research as well, particularly in interactive robots <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b1">2]</ref>. In imitating the actions of a teacher, a cognitive agent learns the demonstrated action such that it may perform a similar action later and achieve a similar goal. Thus, we expect that a cognitive robot that learns from imitation would reuse what it has learned from one experience to reason about addressing related, but di↵erent, problem scenarios.</p><p>The eventual goal of this work is to use a case-based approach to enable imitation learning in interactions such as the following. A human teacher guides the robot to complete a task, such as scooping the contents of one container into another. The robot records the demonstrated actions and observed objects, saving the demonstration as a source case in its case memory. At a later time, the robot is asked to repeat the scooping task, but in a new, target environment containing a di↵erent set of object features to parameterize and execute the task. Next, the robot would transfer its representation of the scooping task to accommodate for the di↵erences between the source and target environments, and then execute an action based on the transferred representation to achieve the goal state in the target environment.</p><p>Using a case-based framework to address this problem allows us to represent demonstrations as individual experiences in the robot's case memory, and provides us with a framework for identifying, transferring, and executing a relevant source case demonstration in an unfamiliar target environment. The main research questions we plan to address are as follows:</p><p>-How should task demonstrations be represented in case memory? -How do we determine which features of a robot's environment are relevant to completing a task, and thus should be stored in the source case? -What features should be considered in retrieving a source case demonstration for reuse in a target environment? How should these features be prioritized during source case retrieval?</p><p>Copyright © 2015 for this paper by its authors. Copying permitted for private and academic purposes. In Proceedings of the ICCBR 2015 Workshops. Frankfurt, Germany. We have defined a case-based approach to transfer for enabling imitation in robotic agents, consisting of two separate processes (as shown in Figure <ref type="figure" target="#fig_0">1</ref>): the Case Storage process in which the robot receives demonstrations of a task and stores each demonstration as a case in source memory, and a Case Adaptation process which is used at a later time when the robot is asked to repeat a task in a target environment.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Case Storage Process</head><p>Demonstration and Learning We have implemented the first step in the Case Storage process, where the robot records and stores each task demonstration as a source case in memory. We define each case as the tuple C = &lt;L, D, T, O, S i , S f &gt;, where:</p><p>-L represents the label of the task which was demonstrated, e.g. "scooping".</p><p>-D represents the set of action models which encode the demonstrated motion, represented as Dynamic Movement Primitives as defined in <ref type="bibr" target="#b3">[4]</ref>. -T is the set of parameterization functions which relate the set of action models to the locations of objects in the robot's environment. For example, a parameterization function may be used to represent how the robot's hand must be located above a bowl prior to completing a pouring action. -O is the set of salient object IDs which are relevant to the task.</p><p>-S i and S f are the initial and final states, respectively, which represent the set of objects observed in an overhead view of the robot's environment.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Case Adaptation Process</head><p>At a later time, the robot may be asked to repeat a learned task in an unfamiliar target environment. Using the framework shown in Figure <ref type="figure" target="#fig_0">1</ref>, the robot may address a target environment using the following steps.</p><p>Observation The robot is given a target problem to address, under the assumption that it has a relevant source case in memory which can be used to address the target problem. The robot observes the target environment by viewing the objects located in the table-top environment using an overhead camera, providing it with the initial state S i of the target case.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Retrieval and Mapping</head><p>The robot must then choose a source case from memory containing the demonstration that is most relevant to the current target problem. Once a relevant source case has been retrieved, a mapping must be generated that encodes the di↵erences between the source and target environments. This mapping is later used to transfer the source case such that di↵erences in the target environment are addressed. We have not yet implemented the Retrieval and Mapping steps, but will be addressing them in upcoming work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Transfer and Execution</head><p>We have implemented the last two steps of the Case Adaptation process, the Transfer and Execution steps. Currently, we manually provide the robot with the most relevant source case demonstration and a mapping between objects in the source and target environments.</p><p>We take a similarity-based approach to transfer, where we consider the similarity between the source case and target environments when defining transfer processes. As we encounter transfer problems in which the source and target problems become less similar, the source case is transferred at a di↵erent level of abstraction, such that only high-level features of that case are transferred. We have implemented three transfer methods, each of which operates by transferring the source case at a di↵erent level of abstraction (further described in <ref type="bibr" target="#b2">[3]</ref>). Once the source case has been transferred, it is used to plan and execute a new action trajectory to address the target problem. Preliminary experiments have evaluated each method under the assumption that we select the approach, and thus the level of abstraction at which transfer occurs, to be used for a given transfer problem.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Future Work</head><p>Our current implementation assumes that we manually provide a mapping between equivalent objects in the source and target environments. We plan to identify (i) a method for autonomously determining this object mapping and (ii) a process for identifying and retrieving an appropriate source case demonstration.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Case-Based Process for Task Demonstration Transfer</figDesc><graphic coords="2,134.77,116.83,345.81,71.04" type="bitmap" /></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">A survey of robot learning from demonstration</title>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">D</forename><surname>Argall</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Chernova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Veloso</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Browning</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Robotics and Autonomous Systems</title>
		<imprint>
			<biblScope unit="volume">57</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="469" to="483" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Robot learning from human teachers</title>
		<author>
			<persName><forename type="first">S</forename><surname>Chernova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">L</forename><surname>Thomaz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Synthesis Lectures on Artificial Intelligence and Machine Learning</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="1" to="121" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">A similarity-based approach to skill transfer</title>
		<author>
			<persName><forename type="first">T</forename><surname>Fitzgerald</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">K</forename><surname>Goel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">L</forename><surname>Thomaz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Women in Robotics Workshop at Robotics: Science and Systems</title>
				<imprint>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Learning and generalization of motor skills by learning from demonstration</title>
		<author>
			<persName><forename type="first">P</forename><surname>Pastor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Ho↵mann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Asfour</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Schaal</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ICRA&apos;09. IEEE International Conference on</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2009">2009. 2009</date>
			<biblScope unit="page" from="763" to="768" />
		</imprint>
	</monogr>
	<note>Robotics and Automation</note>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">The origins of intelligence in children</title>
		<author>
			<persName><forename type="first">J</forename><surname>Piaget</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">T</forename><surname>Cook</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1952">1952</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Cultural learning</title>
		<author>
			<persName><forename type="first">M</forename><surname>Tomasello</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">C</forename><surname>Kruger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">H</forename><surname>Ratner</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Behavioral and brain sciences</title>
		<imprint>
			<biblScope unit="volume">16</biblScope>
			<biblScope unit="issue">03</biblScope>
			<biblScope unit="page" from="495" to="511" />
			<date type="published" when="1993">1993</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
