<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Mediating Joint Intentions with a Dialogue Management System</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Maitreyee Tewari</string-name>
          <email>maittewa@cs.umu.se</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michele Persiani</string-name>
          <email>michelep@cs.umu.se</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Joint Intentions, Robotics, Goal Recognition, Reinforcement Learn-</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Umeå University</institution>
          ,
          <addr-line>Umeå</addr-line>
          ,
          <country country="SE">Sweden</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Umeå University</institution>
          ,
          <addr-line>Umeå</addr-line>
          ,
          <country country="SE">Sweden</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>ing</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>A necessary skill which enables machines to take part in decision making processes with their users is the ability to participate in the mediation of joint intentions. This paper presents a formalisation of an architecture to create and mediate joint intentions with an artificial agent. The proposed system is loosely based on the framework of we-intentions and embodied on a combination of Plan Recognition techniques to identify the user intention, and a Reinforcement Learning network which learns how to best interact with the inferred intention.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>No, you can set
the table instead
set
table
prepare
salad</p>
    </sec>
    <sec id="sec-2">
      <title>INTRODUCTION</title>
      <p>The socio-technological evolution of human society has motivated
the integration of robots in social and personal spaces. Hence, it
is becoming a pressuring requirement for social robotics to
understand human intentions and adapt to social values and needs.</p>
      <p>
        Among other reasons, humans interact to understand and
mediate intentions with other human participants [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. A successful
mediation of intention enable participants to decide profitable
collaboration, to manage expectations, or to decide whether to trust
the other participant. Natural language dialogues are among the
primitive modes [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] of human-human interaction, and are also
consistently used to mediate intentions. Dialogue management
strategies have exploited joint intention theory for building team
dialogues [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. However, this work views joint intentions with an
accent on joint task planning [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] for a human and a robot
participant, rather than on the communicative protocols being involved.
      </p>
      <p>
        The objective of this work is to model mediation of intentions
for Human-Robot Interaction (HRI) in a household scenario, and is
loosely based on the framework of we-intentions [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. Within the
scenario, we explore the cases where a person could need assistance
from a robot such as: in cooking, finding diferent objects in the
house, preparing for a visit to the supermarket, doctor or a friend.
For instance, the person might say “I want to prepare a salad” to
a robot, possibly having an intention for the robot to help her in
cooking the dinner.
      </p>
      <p>Hence, we explore the following research question: how to
create joint intention with machines? The motivation behind this
research question belongs to desired specifications of AI systems,
including the need of an integrated cognition and collaboration
mechanism, and a natural interaction between human and AI
systems. Ultimately, it is about investigating the boundaries between
the Eco-system of AI with that of human-beings.
prepare
salad</p>
      <p>I want to
prepare a salad</p>
      <p>
        To answer the research question, the formalisation of joint
intentions in the context of shared task planning, and defining dialogue
act functions [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] was done. This formalism ofered a turn-based
interaction scheme that allows two participant (human and a robot)
to mediate an intention regarding a shared task.
2
      </p>
    </sec>
    <sec id="sec-3">
      <title>METHODOLOGY</title>
      <p>
        Some of the previous work [
        <xref ref-type="bibr" rid="ref11 ref7">7, 11</xref>
        ] proposed team rationality for
building collaborative multi-agent systems, for example, in [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] the
authors used Shared Plan [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and Propose Trees to model
collaboration as multi-agent planning problem, where a rational team will
perform an action only if the benefits from performing an action
is less than its cost. In [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] the authors formalised communication
protocols using joint intention theory. The authors used joint
persistent goals and persistent weak achievement goals to build joint
intentions, and speech acts such as request, ofer, inform, confirm,
refuse, acknowledge, and standing-ofer for their mediation.
Copyright ©2020 for this paper by its authors. Use permitted under Creative Commons
License Attribution 4.0 International (CC BY 4.0).
      </p>
      <p>Ok, I can fry
some eggs
set
table</p>
      <p>OK</p>
      <p>As later described, we propose certain assumptions to lift some
of the complexity that previous research utilizes in the—context
of joint intention theory. We believe that such complexities, while
theoretically sound, make implementations on real systems dificult
and brittle; for this reason, we utilize a simplification of previous
work’s formalization for our needs. The rest of the section provides
our simplified formalisation of mediating joint intention theory
and attempts to briefly reason about the constraints posed.</p>
      <p>
        Our proposed approach is based on predicate logic combined
with planning, and is influenced by logic based semantics proposed
in [
        <xref ref-type="bibr" rid="ref1 ref14 ref2">1, 2, 14</xref>
        ]. Agents are represented by x, y, . . . x1, x2, . . . y1, y2, . . .
and their actions by a1, a2, an . An intention of a single agent x is a
plan π = {a0, a1, ..., an } of actions together with a goal д the agent
is committed to [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] and the intention is partially observed through
O ⊆ π .
      </p>
      <p>Know(x, p) ≡ p ∧ Bel (x, p) represents the knowledge of agents
and Mut Bel (x, y, p) that x and y share a mutual belief about p—In
our formulation an agent’s intention is represented by the predicate
Intend(x, д, O) while a joint intention JointIntend(x, y, д, O). An
agent has an intention if following holds:</p>
      <p>Intend(x, д, O) ≡ Know(x, ∃π O ⊆ π ∧</p>
      <p>Goal (π ) = д∧</p>
      <sec id="sec-3-1">
        <title>Commit (x, π ))</title>
        <p>i.e. not only is true that the agent has an intention and is committed
to it, but the agent also has a belief about it. The set O is an explicit
subset of π for which it is known that the agent already committed
to it, and contains past observations or declarations about future
commitments about π .</p>
        <p>Eq. 2 expresses that to provide agents an intention doesn’t require
to explicit their full intention π , but only a part of it (see Figure 1),
with the full intention being instead inferred by grounding the
observed commitments in the task space.</p>
        <p>A joint intention is an intention shared by the agents x and y
with the same goal д. Therefore, a joint intention is a plan π =
{ax 0, ay0, ax 1, ..., ayn } together with a goal д where the actions
in π can be allocated to either participants x or y. Furthermore,
the involved agents have a mutual believe Mut Bel about each
others’ commitments. Hence, two agents hold a joint intention if the
following holds true:</p>
        <p>JointIntend(x, y, д, O) ≡ Intend(x, д, O) ∧ Intend(y, д, O)∧</p>
      </sec>
      <sec id="sec-3-2">
        <title>Mut Bel (x, y, JointIntend(x, y, д, O))</title>
        <p>
          By this formulation x and y are allowed to have separate beliefs and
inference mechanisms through which they find π ; but are bound
to have the same goal and observed commitments. Notice that this
is a simplification of how joint intentions have been previously
formalized in the literature, and to which we invite the reader.
Nevertheless, this formalization is suficient for our purpose of creating
a dialogue manager that allows mediation of joint intentions. In
this context of a dialogue between—two agents x and y we further
make the following assumption:
|= ∃O∃д JointIntend(x, y, д, O)
(1)
(2)
(3)
that translates as there is always a joint intention between x and
y. This assumption, while being quite strong, is quite reasonable
for our context as the proposed DM is specifically tailored to
mediate joint intentions. During every dialogue a joint intention is
always obtained, and when the user leaves the conversation there
is always an intention that was formed and is shared with the DM.
Furthermore, it is always the case that the user utilises the DM to
instantiate joint intentions. We do not take into consideration the
cases in which the joint intention is bootstrapped or terminated as
for example shown in [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
        </p>
        <p>Following the given definitions, we propose an interaction
mechanism that allows two participants to collaboratively build O, by
being able to add or remove actions from it. Currently, we have
the following assumptions: 1) for every trial two participants are
present, that is a human user and a Dialogue Manager (DM), that
can be integrated for example in a house robot. 2) the DM is
modelled to be user initiated, which always proposes the first action
that will enter the set O.</p>
        <p>
          Having an observed set O in a form of a partial plan, the DM can
infer the most likely full intention π by utilizing plan recognition
techniques as later described. This inference is based on the current
state of the world that we assume to be available to the DM in the
form of truth predicates. A possible architecture for maintaining
an updated world description is not provided by this paper but can
be for example implemented as in [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
2.1
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Goal Recognition and plan generation</title>
      <p>
        At every turn of the dialogue the agent is required to infer the joint
plan π to be able to participate in its mediation. For this purpose, we
utilize plan recognition techniques based on the Planning Domain
Definition Language (PDDL) [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. PDDL belongs to the group of
planning techniques known as classical planning, and allows to
easily create non-hierarchical task domains.
      </p>
      <p>
        For a given task domain we select the set of goals G as possible
goals a user can pursue. Example of possible goals for Figure 1
can be to prepare dinner, lunch or breakfast. Plan recognition is
achieved by a modified version of the method proposed in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] with
the following diferences: 1) we allow the PDDL planner to plan
using partially instantiated actions1, and 2) the observations O are
treated as a set rather than a sequence. Given an eventually empty
set of observations O, goal recognition is performed as:
дˆ = arдmax C(∅, д)
д ∈G C(O, д)
(4)
where C(O, д) is the cost of a plan achieving д and constrained to
contain O, C(∅, д) is the cost of an optimal plan achieving д without
being constrained by O. Hence, 0 ≤ CC((O∅,,дд)) ≤ 1 gives indication
on how costly it is to deviate from an optimal plan achieving д for
compliance with O. Finally, for an inferred goal дˆ we obtain π as
the optimal plan achieving д while being constrained to contain
the observations O.
1We define a PDDL action as partially instantiated if not all of its arguments are
grounded in the task domain. An action is fully instantiated when all arguments are
grounded.
      </p>
    </sec>
    <sec id="sec-5">
      <title>2.2 Mediation of Joint Intention</title>
      <p>The agent and the user have to use a medium to communicate their
joint intention, and to negotiate goals д and commitments O. In
order to do that, we formalise a finite-state negotiation dialogue
strategy with following dialogue acts: (ofer, counter-ofer, accept,
and reject). The dialogue strategies will be implemented with a
spoken dialogue management system (SDS) 2.</p>
      <p>Traditionally, an SDS consists of speech synthesis that recognises
and generates speech, natural language understanding (NLU)
transforms the human generated natural language to knowledge for the
machine. A dialogue manager (DM) makes the decision based on
the NLU and other components such as previous history, database
etc, and natural language generation (NLG) receives the decision
from the DM, transforms it to human understandable format and
sends to speech synthesizer.</p>
      <p>When the user generates its first utterance, it is transformed
from speech to text and arrives at the NLU component. The NLU
transforms the text to knowledge (semantic roles) and assigns a
dialogue act ofer 2. An ofer from the user instantiates the plan
π by performing plan recognition, and creates a joint intention as
described by JointIntend.</p>
      <p>We define five dialogue acts Ofer a , Ofer д , Counter-ofer, Accept
and Reject with which both user and DM can mediate the intention’s
goal д and commitments O. Table 1 contains the efects of these
dialogue acts with respect to three sets: θ is a set of ofers, R and
O are respectively the sets of rejected and accepted commitments.
An Ofer a represents ofer about an action, Ofer g indicates ofer
about a goal, Counter-ofer is an action a1 is not accepted and an
alternative a2 is instead proposed. An Accept and Reject can be used
to accept or reject proposed commitments.
2At this stage we define a finite state SDS and only the user can initiate a dialogue
only using ofer together with a goal or an action.</p>
      <p>
        Since a dialogue policy based on Finite-State-Machines is not
realizable as it would need to consider all possible intentions, also
based on the state of the task, we propose to learn the DM dialogue
policy with Reinforcement Learning methods. This approach is not
new in the context of dialogue management, and by this method
the user is simulated by an Agenda [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
At every turn, a Q-Network [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] evaluates the current inferred π
together with the actions in the sets θ, R and O and the current
PDDL state, selecting which dialogue act to perform by an ϵ-greedy
policy computed on the dialogue acts expected return. In RL, agents
learn which policy to adopt by maximising the reward they receive
during each episode. The current version of the reward function is:
R = −αT + β
π ∩ π + γ C(π, д)
π ∪ π C(π, д)
(5)
where π and д form the user’s original desired joint intention (held
by the Agenda). The first terms penalises every turn that the
interaction takes, hence making interactions as short as possible. The
second term evaluates how the final resulting intention is similar
to the one the user had as objective for the interaction. The third
term evaluates instead the cost the resulting mediated intention
has, compared to the user’s original one. α, β and γ determine how
the three components of the reward function are weighted. Notice
that the system cannot access π and д, that are instead only used at
the end of every interaction for evaluation. Thus, the the DM learns
to mediate and improve the unobservable user intention π, д.
3
      </p>
    </sec>
    <sec id="sec-6">
      <title>FUTURE WORK</title>
      <p>The research is still in its early stages and we are currently
implementing the described system. We developed the goal recognition
and the reinforcement learning components together with a simple
user Agenda. The Agenda is based on PDDL and simulates how the
user would modify the joint intention during its turn, while having
as objective a randomly generated joint intention.</p>
      <p>Initial experiments gave positive results, in the sense that the RL
is able to learn the structure of the problem for simple scenarios, and
successfully maximises the possible rewards. Several investigations
are needed and are still open: what is the Q-Network learning?
Does our current setting allows any generalisation? The current
implementation requires hundreds of episodes to converge. Can
the process be made faster/simpler? How to facilitate the online
adaptation over real users?</p>
      <p>
        Encapsulation of the joint intention model with SDS is still to
be implemented. For early prototypes of the system we plan to
implement the dialogue manager as described in Section 2.2. Later
versions could see the implementation of a more complete SDS
through for example a POMDP model [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. This could allow to have
dialogues that are not strictly related to the mediation of the joint
intention, but rather more flexible and intuitive for the user. Finally,
investigation about the soundness of this approach in real scenarios
for example in user studies is still to be performed.
      </p>
    </sec>
    <sec id="sec-7">
      <title>ACKNOWLEDGMENTS</title>
      <p>This work has received funding from the European Union’s Horizon
2020 research and innovation program under the Marie
SkłodowskaCurie grant agreement No 721619 for the SOCRATES project.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Philip</surname>
            <given-names>R</given-names>
          </string-name>
          <string-name>
            <surname>Cohen</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Foundations of Collaborative Task-Oriented Dialogue: What's in a Slot?</article-title>
          .
          <source>In Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue</source>
          .
          <volume>198</volume>
          -
          <fpage>209</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Philip</surname>
            <given-names>R</given-names>
          </string-name>
          <string-name>
            <surname>Cohen and Hector J Levesque</surname>
          </string-name>
          .
          <year>1990</year>
          .
          <article-title>Intention is choice with commitment</article-title>
          .
          <source>Artificial intelligence 42</source>
          ,
          <fpage>2</fpage>
          -
          <lpage>3</lpage>
          (
          <year>1990</year>
          ),
          <fpage>213</fpage>
          -
          <lpage>261</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Sandra</given-names>
            <surname>Devin</surname>
          </string-name>
          and
          <string-name>
            <given-names>Rachid</given-names>
            <surname>Alami</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>An implemented theory of mind to improve human-robot shared plans execution</article-title>
          .
          <source>In</source>
          <year>2016</year>
          11th ACM/IEEE International Conference on
          <article-title>Human-Robot Interaction (HRI)</article-title>
          . IEEE,
          <fpage>319</fpage>
          -
          <lpage>326</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4] New World Encyclopedia.
          <year>2017</year>
          . Dialogue - New World Encyclopedia,. //www. newworldencyclopedia.org/p/index.php?title=Dialogue&amp;oldid=1007366 [Online; accessed 24-January-2020].
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Barbara</given-names>
            <surname>Grosz</surname>
          </string-name>
          and
          <string-name>
            <given-names>Sarit</given-names>
            <surname>Kraus</surname>
          </string-name>
          .
          <year>1996</year>
          .
          <article-title>Collaborative plans for complex group action</article-title>
          .
          <source>Artificial Intelligence</source>
          (
          <year>1996</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Bunt</given-names>
            <surname>Harry</surname>
          </string-name>
          , Petukhova Volha, Traum David, and
          <string-name>
            <given-names>Alexandersson</given-names>
            <surname>Jan</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Dialogue Act Annotation with the ISO 24617-</article-title>
          2 Standard. Springer International Publishing,
          <volume>109</volume>
          -
          <fpage>135</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Ece</given-names>
            <surname>Kamar</surname>
          </string-name>
          ,
          <string-name>
            <surname>Ya'akov Gal</surname>
          </string-name>
          , and
          <string-name>
            <surname>Barbara</surname>
          </string-name>
          J Grosz.
          <year>2009</year>
          .
          <article-title>Incorporating helpful behavior into collaborative planning</article-title>
          .
          <source>In Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS)</source>
          . Springer Verlag.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Drew</given-names>
            <surname>McDermott</surname>
          </string-name>
          .
          <year>2003</year>
          .
          <article-title>The Formal Semantics of Processes in PDDL</article-title>
          .
          <source>In Proc. ICAPS Workshop on PDDL. ICAPS Workshop</source>
          , Trento, Italy,
          <volume>8</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Michael</given-names>
            <surname>Frederick</surname>
          </string-name>
          <string-name>
            <surname>McTear</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Zoraida</given-names>
            <surname>Callejas</surname>
          </string-name>
          , and
          <string-name>
            <given-names>David</given-names>
            <surname>Griol</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>The conversational interface</article-title>
          .
          <source>Vol. 6</source>
          . Springer, Spain.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Miguel</given-names>
            <surname>Ramírez</surname>
          </string-name>
          and
          <string-name>
            <given-names>Hector</given-names>
            <surname>Gefner</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Probabilistic plan recognition using of-the-shelf classical planners</article-title>
          .
          <source>In Twenty-Fourth AAAI Conference on Artificial Intelligence</source>
          . MIT Press, USA, 6.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Timothy</surname>
            <given-names>W.</given-names>
          </string-name>
          <string-name>
            <surname>Rauenbusch</surname>
            and
            <given-names>Barbara J.</given-names>
          </string-name>
          <string-name>
            <surname>Grosz</surname>
          </string-name>
          .
          <year>2003</year>
          .
          <article-title>A Decision Making Procedure for Collaborative Planning</article-title>
          .
          <source>In Proceedings of the Second International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS '03)</source>
          .
          <source>Association for Computing Machinery</source>
          ,
          <fpage>1106</fpage>
          -
          <lpage>1107</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Jost</surname>
            <given-names>Schatzmann</given-names>
          </string-name>
          , Blaise Thomson, and
          <string-name>
            <given-names>Steve</given-names>
            <surname>Young</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>Statistical user simulation with a hidden agenda</article-title>
          .
          <source>In Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue</source>
          .
          <volume>273</volume>
          -
          <fpage>282</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>David</surname>
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Schweikard</surname>
          </string-name>
          and Hans Bernhard Schmid.
          <year>2013</year>
          .
          <article-title>Collective Intentionality</article-title>
          . In The Stanford Encyclopedia of Philosophy (summer
          <year>2013</year>
          ed.), Edward N.
          <string-name>
            <surname>Zalta</surname>
          </string-name>
          (Ed.). Metaphysics Research Lab, Stanford University, USA.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Ira</surname>
            <given-names>A Smith</given-names>
          </string-name>
          , Philip R Cohen, Jefrey M Bradshaw,
          <string-name>
            <given-names>Mark</given-names>
            <surname>Greaves</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Heather</given-names>
            <surname>Holmback</surname>
          </string-name>
          .
          <year>1998</year>
          .
          <article-title>Designing conversation policies using joint intention theory</article-title>
          .
          <source>In Proceedings International Conference on Multi Agent Systems (Cat. No. 98EX160)</source>
          . IEEE,
          <fpage>269</fpage>
          -
          <lpage>276</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Rajah</surname>
            <given-names>Subramanian</given-names>
          </string-name>
          , Sanjeev Kumar, and Philip Cohen.
          <year>2006</year>
          .
          <article-title>Integrating Joint Intention Theory, Belief Reasoning, and Communicative Action for Generating Team-Oriented Dialogue.</article-title>
          .
          <source>In Proceedings of the National Conference on Artificial Intelligence</source>
          , Vol.
          <volume>21</volume>
          . MIT Press, Boston, USA, 6.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Michael</given-names>
            <surname>Tomasello</surname>
          </string-name>
          , Malinda Carpenter, Josep Call, Tanya Behne, and
          <string-name>
            <given-names>Henrike</given-names>
            <surname>Moll</surname>
          </string-name>
          .
          <year>2005</year>
          .
          <article-title>Understanding and sharing intentions: The origins of cultural cognition</article-title>
          .
          <source>Behavioral and Brain Sciences</source>
          <volume>28</volume>
          ,
          <issue>5</issue>
          (
          <year>2005</year>
          ),
          <fpage>675</fpage>
          -
          <lpage>691</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>Raimo</given-names>
            <surname>Tuomela</surname>
          </string-name>
          .
          <year>2005</year>
          .
          <article-title>We-intentions revisited</article-title>
          .
          <source>Philosophical Studies</source>
          <volume>125</volume>
          ,
          <issue>3</issue>
          (
          <year>2005</year>
          ),
          <fpage>327</fpage>
          -
          <lpage>369</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Hado</surname>
            <given-names>Van Hasselt</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Arthur Guez</surname>
            , and
            <given-names>David</given-names>
          </string-name>
          <string-name>
            <surname>Silver</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Deep reinforcement learning with double q-learning</article-title>
          .
          <source>In Thirtieth AAAI conference on artificial intelligence</source>
          . MIT Press, Arizona, USA, 7.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>