<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Cognitive Control and Adaptive Attentional Regulations for Robotic Task Execution</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Riccardo Caccavale</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alberto Finzi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>DIETI, Universita` degli Studi di Napoli Federico II</institution>
          ,
          <addr-line>via Claudio 21, 80125, Naples</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <fpage>56</fpage>
      <lpage>57</lpage>
      <abstract>
        <p>-We propose a robotic cognitive control framework that exploits supervisory attention and contention scheduling for flexible and adaptive orchestration of structured tasks. Specifically, in the proposed system, top-down and bottom-up attentional processes are exploited to modulate the execution of hierarchical robotic behaviors conciliating goal-oriented and reactive behaviors. In this context, we propose a learning method that allows us to suitably adapt task-based attentional regulations during the execution of structured activities.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>I. INTRODUCTION</title>
      <p>
        In this paper, we present a robotic cognitive control
framework that permits flexible and adaptive orchestration of
multiple structured tasks. Following a supervisory attentional
system approach [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], we propose an executive system
that exploits top-down (task-based) and bottom-up
(stimulusbased) attentional mechanisms to conciliate reactive and
goaloriented behaviors [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. In this paper, we describe
adaptive mechanisms suitable for this framework. Specifically, we
propose a learning method that allows us to regulate the
topdown and bottom-up attentional influences according to the
environmental state and the tasks to be accomplished. In
contrast with typical task-learning approaches [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], our aim
here is to adapt and refine attentional parameters that affect
the competition among active tasks and reactive processes.
Learning methods for robotic supervisory attentional system
have been proposed to enhance action execution automaticity
and reduce the need of attentional control [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], instead here we
are interested in flexible orchestration of hierarchical tasks.
      </p>
      <p>In the following sections, we present the architecture of the
executive system and briefly introduce the associated adaptive
mechanisms.</p>
    </sec>
    <sec id="sec-2">
      <title>II. SYSTEM ARCHITECTURE</title>
      <p>
        The cognitive control framework presented in this paper is
based on a supervisory attentional system that regulates the
execution of hierarchical tasks and reactive behaviors. The
system architecture is illustrated in Fig. 1. The attentional
executive system is endowed with a long term memory (LTM)
that contains the behavioral repertoire available to the
system, including structured tasks and primitive actions; these
tasks/behaviors are to be allocated and instantiated in the
Working Memory (WM) for their actual execution. In
particular, the cognitive control cycle is managed by the alive process
that continuously updates the WM by allocating and
deallocating hierarchical tasks/behaviors according to their denotations
in the LTM. We assume a hierarchical organization for tasks
and activities [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and this hierarchy is represented
in the WM as a tree data structure that collects all the tasks
currently executed or ready for the execution (see Fig. 2). More
specifically, each node of the tree stands for a behavior with
the edges representing parental relations among sub-behaviors.
In this context, abstract behaviors identify complex activities
to be hierarchically decomposed into different sub-activities,
instead concrete behaviors are for sensorimotor processes that
compete for the access to sensors and actuators. The allocated
concrete behaviors are collected into the attentional
behaviorbased system illustrated in Fig. 1.
      </p>
      <p>Fig. 1. System Architecture. The LTM provides the definitions of the available
tasks, which can be allocated/deallocated in the WM by the alive behavior.</p>
      <p>
        In this framework, each behavior is associated with an
activation value, which is regulated by an adaptive clock
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. This clock represents a frequency-based attentional
mechanism: the higher is the frequency, the higher is the
resolution at which a process/behavior is monitored and controlled.
The clock period is bottom-up and top-down regulated by a
behavior-specific monitoring function f (s ; e) = l according to
the behavioral stimuli s and the overall state of the WM e. In
particular, the bottom-up frequency 1=l is directly affected by
behavior-specific stimuli (e.g. distance of a target), while the
top-down regulation is provided by a value m that summarizes
the overall top-down influences of the WM. In this context,
bottom-up stimuli emphasize actions that are more
accessible to the robot (e.g. object affordances), while top-down
influences are affected by the task structures and facilitate
the activations of goal-oriented behaviors. In this framework,
multiple tasks can be executed at the same time and
several behaviors can compete in the WM generating conflicts,
impasses, and crosstalk interferences [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Contentions
among alternative behaviors competing for mutually exclusive
state variables (representing resources, e.g. sensors, actuators,
etc.) are solved exploiting the attentional activations: following
a winner-takes-all approach, the behaviors associated with the
higher activations are selected with the exclusive access to
mutually exclusive resources.
      </p>
    </sec>
    <sec id="sec-3">
      <title>III. ADAPTIVE REGULATIONS</title>
      <p>In the proposed framework, action selection depends on
the combined effect of top-down and bottom-up attentional
regulations. In order to set these regulations, we associate each
edge of the WM with a weight w j;i that regulates the intensity
of the attentional influence from the behavior j to the
subbehavior i (bottom-up for i = j, top-down otherwise). This
way, the overall activation value associated with each node is
obtained as the weighted sum å j wi; jci; j of the contributions
from the top-down and bottom-up sources. These weights
are to be suitably adapted with respect to the tasks and the
environment. For this purpose, we propose to deploy a neural
network approach. Specifically, during the execution the WM
tree is associated with a multi-layered neural network, while
the weights associated with the nodes are refined exploiting
error backpropagation. In this setting, the system can be
trained by a user that takes the control of the robot to correct
the execution of a task. The training session is associated
with an adaptive process: the difference between the system
behavior and the human correction is interpreted as an error
to be backpropagated through the task hierarchy in order to
adapt the associated weights.</p>
      <p>As an exemplification, we consider the instance of the
WM illustrated in Fig. 2. In this case, a mobile robot has
to take a colored object (ob jRed) and return it to the home
position. Here, five concrete behaviors compete to acquire two
contended variables ( f orwardS peed and turnS peed) which
are used to control the robots movements. For instance, the
avoidObstacle behavior is affected by two top-down
influences (reach(ob jRed) and goto(home) subtasks), while the
bottom-up influence is inversely proportional to the distance
of the closest obstacle. During the execution of the task, the
system selects the most emphasized behavior and produces a
vector of values ~v representing motor patterns for the shared
variables. The robot navigation is monitored by the human,
which is ready to change the robot trajectory using a joypad
when a correction is needed. The user interventions generate
a new set of values for the shared variables ~v that dominate
and override the ones produced by the other behaviors. As
long as the user drives the robot, the difference between the
systems output ~v and the suggested values ~v is exploited
to estimate the total error of the task execution. This error
is backpropagated from the concrete behaviors to the rest of
the hierarchy, in so adjusting the weights associated with the
behavior and sub-behaviors which are active in the WM.</p>
    </sec>
    <sec id="sec-4">
      <title>IV. CONCLUSIONS</title>
      <p>We presented an adaptive mechanism suitable for a
cognitive control framework based on a supervisory attentional
system approach. The proposed method permits adaptive and
interactive adaptation of top-down and bottom-up attentional
regulations in order to execute structured hierarchical tasks.</p>
      <p>Acknowledgment: The research leading to these results
has been supported by the H2020-ICT-731590 REFILLs
project.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>M. M. Botvinick</surname>
            ,
            <given-names>T. S.</given-names>
          </string-name>
          <string-name>
            <surname>Braver</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          <string-name>
            <surname>Barch</surname>
            ,
            <given-names>C. S.</given-names>
          </string-name>
          <string-name>
            <surname>Carter</surname>
            , and
            <given-names>J. D.</given-names>
          </string-name>
          <string-name>
            <surname>Cohen</surname>
          </string-name>
          , “
          <article-title>Conflict monitoring and cognitive control.” Psychological review</article-title>
          , vol.
          <volume>108</volume>
          , no.
          <issue>3</issue>
          , p.
          <fpage>624</fpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>X.</given-names>
            <surname>Broque</surname>
          </string-name>
          <article-title>`re,</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Finzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mainprice</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Rossi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Sidobre</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Staffa</surname>
          </string-name>
          , “
          <article-title>An attentional approach to human-robot interactive manipulation</article-title>
          ,
          <source>” I. J. Social Robotics</source>
          , vol.
          <volume>6</volume>
          , no.
          <issue>4</issue>
          , pp.
          <fpage>533</fpage>
          -
          <lpage>553</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>E.</given-names>
            <surname>Burattini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Rossi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Finzi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Staffa</surname>
          </string-name>
          , “
          <article-title>Attentional modulation of mutually dependent behaviors,”</article-title>
          <source>in Proc. of SAB</source>
          <year>2010</year>
          ,
          <year>2010</year>
          , pp.
          <fpage>283</fpage>
          -
          <lpage>292</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>R.</given-names>
            <surname>Caccavale</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Finzi</surname>
          </string-name>
          , “
          <article-title>Plan execution and attentional regulations for flexible human-robot interaction,”</article-title>
          <source>in Proc. of SMC</source>
          <year>2015</year>
          ,
          <year>2015</year>
          , pp.
          <fpage>2453</fpage>
          -
          <lpage>2458</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5] --, “
          <article-title>Flexible task execution and attentional regulations in humanrobot interaction,”</article-title>
          <source>IEEE Trans. Cognitive and Developmental Systems</source>
          , vol.
          <volume>9</volume>
          , no.
          <issue>1</issue>
          , pp.
          <fpage>68</fpage>
          -
          <lpage>79</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>G.</given-names>
            <surname>Chang</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Kulic</surname>
          </string-name>
          ´, “
          <article-title>Robot task learning from demonstration using petri nets,” in 2013 IEEE RO-MAN</article-title>
          . IEEE,
          <year>2013</year>
          , pp.
          <fpage>31</fpage>
          -
          <lpage>36</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R. P.</given-names>
            <surname>Cooper</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Shallice</surname>
          </string-name>
          , “
          <article-title>Hierarchical schemas and goals in the control of sequential behavior,” Psychological Review</article-title>
          , vol.
          <volume>113</volume>
          (
          <issue>4</issue>
          ), pp.
          <fpage>887</fpage>
          -
          <lpage>916</lpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J.</given-names>
            <surname>Garforth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. L.</given-names>
            <surname>McHale</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Meehan</surname>
          </string-name>
          , “
          <article-title>Executive attention, task selection and attention-based learning in a neurally controlled simulated robot</article-title>
          .
          <source>” Neurocomputing</source>
          , vol.
          <volume>69</volume>
          , no.
          <fpage>16</fpage>
          -
          <issue>18</issue>
          , pp.
          <fpage>1923</fpage>
          -
          <lpage>1945</lpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Lashley</surname>
          </string-name>
          , “
          <article-title>The problem of serial order in behavior,” in Cerebral Mechanisms in Behavior</article-title>
          , Wiley, New York, L. In Jeffress, Ed.,
          <year>1951</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Mozer</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Sitton</surname>
          </string-name>
          , “
          <article-title>Computational modeling of spatial attention</article-title>
          ,
          <source>” Attention</source>
          , vol.
          <volume>9</volume>
          , pp.
          <fpage>341</fpage>
          -
          <lpage>393</lpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M. N.</given-names>
            <surname>Nicolescu</surname>
          </string-name>
          and
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Mataric</surname>
          </string-name>
          , “
          <article-title>Natural methods for robot task learning: Instructive demonstrations, generalization and practice</article-title>
          ,”
          <source>in Proc. of AAMAS 2003. ACM</source>
          ,
          <year>2003</year>
          , pp.
          <fpage>241</fpage>
          -
          <lpage>248</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>D. A.</given-names>
            <surname>Norman</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Shallice</surname>
          </string-name>
          , “
          <article-title>Attention to action: Willed and automatic control of behavior,” in Consciousness and self-regulation: Advances in research</article-title>
          and theory,
          <year>1986</year>
          , vol.
          <volume>4</volume>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>18</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>