<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Artificial Trust for Decision-Making in Human-AI Teamwork: Steps and Challenges</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Carolina Centeio Jorge</string-name>
          <email>C.Jorge@tudelft.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Catholijn M. Jonker</string-name>
          <email>C.M.Jonker@tudelft.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Myrthe L. Tielman</string-name>
          <email>M.L.Tielman@tudelft.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Interactive Intelligence, Delft University of Technology</institution>
          ,
          <addr-line>Delft</addr-line>
          ,
          <country country="NL">Netherlands</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>LIACS, Leiden University</institution>
          ,
          <addr-line>Leiden</addr-line>
          ,
          <country country="NL">Netherlands</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <fpage>26</fpage>
      <lpage>27</lpage>
      <abstract>
        <p>Human-AI teams count on both humans and artificial agents to work together collaboratively. In human-human teams, we use trust to make decisions. Similarly, our work explores how an AI can use trust (in human teammates) to make decisions while ensuring the team's goal and mitigating risks for the humans involved. We present the several steps and challenges towards the development of an artificial-trust-based decision-making model.</p>
      </abstract>
      <kwd-group>
        <kwd>Challenges</kwd>
        <kwd>artificial trust</kwd>
        <kwd>human trustworthiness</kwd>
        <kwd>human-AI teamwork</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Our work focuses on how an artificially intelligent agent (also referred to as AI) can understand
its human teammates in a wide range of teams, from search and rescue to healthcare, cooking,
etc. In particular, we explore how an AI can use artificial trust to predict and understand
whether a human will do a certain task and, if so, how well. With artificial trust, the agent
might be able to make informed decisions which will improve team eficiency and reduce risks.
The main body of research on trust in human-AI teams is growing, see e.g., [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4 ref5 ref6">1, 2, 3, 4, 5, 6</xref>
        ], but
does not address artificial trust in humans often. For research that touches upon artificial trust
      </p>
      <p>Artificial trust (a term described to the process of trust where the trustor is an AI [ 12]),
similarly to natural trust (when the trustor is human), can be seen as a construct of aspects
that are relevant when a human teammate is asked to collaborate on a certain task, such as
ability, benevolence, integrity, capacity, preference, etc, see e.g. [14, 15]. However, it is still an
open question how relevant these aspects are in human-AI teams. Most of these constructs
come from human-human studies and need to be tested in scenarios where humans and AIs
(M. L. Tielman)
(M. L. Tielman)
(C. M. Jonker); https://ii.tudelft.nl/Myrthe/ (M. L. Tielman)
are teammates. On the other hand, the multi-agent systems (MAS) community has since long
used beliefs of trust and trustworthiness for decision-making, see e.g., [16, 17, 18, 19]. Our work
aims at computing trust beliefs for the agent, as in MAS literature, while having a human as
trustee (the entity being trusted). This requires us to reach inspiration from social sciences and
run user studies to explore and validate our possibilities. In this short paper, we present the
steps and challenges towards our artificial-trust-based decision-making model.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Towards beliefs of Artificial Trust in Human Teammates</title>
      <p>The goal of this work is to enable an AI teammate to make decisions taking its human teammates’
trustworthiness into account. For this, the AI teammate needs to be able to form beliefs of
artificial trust regarding humans. In [ 20], the authors present an artificial agent’s trust in another
as a construct of Competence belief, Willingness belief and Dependence belief. Competence belief
deals with believing the trustee has the necessary abilities to perform a task, whereas willingness
translates into believing the trustee will do a task given the context, independently of their
abilities. Finally, dependence belief lies on the trustor’s side, and it is crucial for the
decisionmaking process as it tells how much the trustor depends on the trustee for the execution of a
certain task. When we consider a team and joint goals, not only the trustor’s (the one making
the decision to trust) dependence belief is important but also the trustee’s dependence on the
trustor (e.g., the AI may choose to help a human because it believes the human is in a risky
situation and depends on the AI for success). To the best of our knowledge, these beliefs are
yet to be implemented and tested in human-AI interaction. It is challenging to do so as we do
not know how to form such beliefs from humans, i.e., as said before, which aspects constitute
human trustworthiness and how an artificial agent can perceive them. However, we aim at
having such beliefs as a starting point for our artificial trust model, which will be used to make
decisions.</p>
      <p>Figure 1 presents the overview of the goal model. The human presents a manifesta (i.e.,
set of behavioural cues) which represent a krypta (i.e., set of characteristics of the human)
[21, 22]. The AI can use the manifesta to model beliefs of artificial trust in a hybrid way,
i.e., with both data-driven and knowledge-based techniques, from the manifesta along with
environmental factors (which give context). Beliefs of artificial trust can be, as mentioned before,
competence, willingness, and dependence beliefs. With some of these beliefs, such as competence
and willingness, the AI should be able to predict some human behaviour and then, keeping the
context in mind (and the (inter)dependencies), make a decision. Finally, given the outcome of
this action, the agent should update its model of artificial trust.</p>
      <p>In summary, the modelling steps towards a model of artificial trust for AI teammates’
decisionmaking are:
1. Investigating krypta and manifesta of human trustworthiness towards an artificial teammate.</p>
      <p>This included exploring which unobservable characteristics (the krypta) constitute human
trustworthiness in human-AI teams and how they can be observed (the manifesta) through
a user study in an online 2D grid-world supermarket environment. The task consisted
of helping two artificial agents by collecting the necessary products in the supermarket
(inspired by the increase of online supermarket orders during the COVID-19 pandemic).
The agents would ask for diferent products and the human could choose which agent
to help. After choosing to help one of the agents, the subject could either complete the
task, lie about it, or give up on that task, and then go to the next one. We included
metrics and conditions based on Mayer’s ABI model [14], which proposes that someone’s
trustworthiness depends on their ability, benevolence, and integrity. The results of the
experiment are currently in the publishing process, but it is possible to find a preliminary
report in [23].
2. Updating artificial trust based on interaction given a context. After studying which aspects
may play a role in a human’s trustworthiness towards an artificial teammate, and grasping
how it may be possible to perceive them, it is important to explore how this trust can be
updated. Trust is dynamic[24], and an artificial teammate should be able to constantly
update its trust values throughout the interaction. Here, we can integrate existing models
such as [11].
3. Using artificial trust to make decisions. The goal of this step is to propose a decision-making
model for an AI agent which takes into account values of artificial trust for each sub-task
of a joint goal and, at the same time, the interdependencies. We use principles of Coactive
Design [25], including Interdependence Analysis.
4. Evaluating artificial trust and decision-making models. It is, in general, hard to evaluate
artificial trust models, since we do not have ground truth. We work on defining metrics
to compare diferent artificial trust models with baselines. The values of artificial trust
can lead to decisions that may impact the environment and possibly the human teammate.
It is then easier to compare the outcome of the decisions, rather than the trust values
alone. The baselines include never-trusting models, always-trusting models, and random
models.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Challenges</title>
      <p>Although the steps towards having an artificial-trust-based decision-making model are
deifned, they present several challenges. Overall, designing and evaluating experiments presents
methodological and theoretical issues, such as trying to explore complex real-world scenarios
in controlled environments [26, 27]. Our research is no exception, and we have learned along
the way about the main obstacles.</p>
      <p>The first and main challenge is the lack of ground truth. Human trustworthiness has no
available ground truth, making it hard to evaluate our artificial trust models. When we propose
objective measures of trustworthiness, which may be manifestations of the krypta, we cannot
prove that these are correct, or good, as there is nothing to compare them with. There is even
the risk that, when we propose what human trustworthiness in a certain context may be, define
the measures and then design the study, and all of this without other models to compare with
or ground truth, we fall into a self-filling prophecy. We have tried to compare our measures
with humans’ perception of their own trustworthiness, but this is far from the ground truth, as
their perception can be far from reality. This challenge afects the first step. This being said,
we develop metrics and baselines (mentioned in step 4) that help us evaluate our model by
comparison.</p>
      <p>Another challenge every time we start designing an experiment within this topic is the task
design. The task is the platform in which we want to explore several aspects, but finding one
task that accommodates all aspects is hard. Furthermore, to study human-AI teamwork, we
need to ensure the human participant understands the collaborative aspect of the environment.
Otherwise, participants might focus on solving the task rather than caring about the interaction
(which, we believe, would not necessarily be the same in a real-world situation). For example,
one may present diferent narratives to diferent participants, such as the context or background
story of the AI system or their (human and AI) relationship, without integrating them into the
task. This can lead to the participants focusing mainly on completing the task and ignoring the
rest of the information they were given. For this reason, we invested in visual representations
of the teamwork and of the levels of interdependencies, with the aim that the participants
understand that the AI is collaborative and that they can team up.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>This paper presents an overview of the work that is being developed towards an
artificialtrust-based decision-making model. The goal of such a model is to enable an AI teammate to
make informed decisions within a team context, taking into account its human teammate’s
trustworthiness and the context. The model should be updated throughout the interactions.
We explain how we try to bridge multi-agent systems with social sciences and present the main
steps that we take to develop such a model. Finally, we conclude with some of the challenges
we have encountered throughout our research, mainly related to the user studies, and reflect on
tentative mitigation strategies. Although there is still a long gap to fill, we believe this model
will be important for human-AI teams, improving their eficiency and mitigating possible risks.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This research was supported by the Delft AI Initiative and by EU Horizon 2020 research and
innovation programme under GA Numbers 952215 (TAILOR), and 820437 (Humane AI Net),
and supported by the National Science Foundation (NWO) under Grant Numbers 024.004.022
(Hybrid Intelligence), 024.004.031 (ESDiT), and 024.005.017 (ALGOSOC). The support is gratefully
acknowledged. Any opinions, findings, conclusions, or recommendations expressed in this
material are those of the author(s) and do not necessarily reflect the views of the supporting
organisations.
conceptual framework based on risk, ACM Transactions on Interactive Intelligent Systems
8 (2018). doi:10.1145/3152890.
[9] S. Vinanzi, M. Patacchiola, A. Chella, A. Cangelosi, Would a robot trust you? developmental
robotics model of trust and theory of mind, Philosophical Transactions of the Royal Society
B: Biological Sciences 374 (2019). doi:10.1098/rstb.2018.0032.
[10] V. Surendran, A. Wagner, Your robot is watching: Using surface cues to evaluate the
trustworthiness of human actions, 2019 28th IEEE International Conference on Robot and
Human Interactive Communication (RO-MAN) (2019) 1–8.
[11] A. Ali, H. Azevedo-Sa, D. M. Tilbury, L. P. Robert, Heterogeneous human–robot task
allocation based on artificial trust, Scientific Reports 12 (2022) 1–15.
[12] H. Azevedo-Sa, X. J. Yang, L. P. Robert, D. M. Tilbury, A unified bi-directional model for
natural and artificial trust in human-robot collaboration, IEEE Robotics Autom. Lett. 6
(2021) 5913–5920. doi:10.1109/LRA.2021.3088082.
[13] S. Vinanzi, A. Cangelosi, C. Goerick, The collaborative mind: intention reading and trust in
human-robot interaction, iScience 24 (2021) 102130. doi:10.1016/j.isci.2021.102130.
[14] R. C. Mayer, J. H. Davis, F. D. Schoorman, An integrative model of organizational trust,</p>
      <p>Source: The Academy of Management Review 20 (1995) 709–734.
[15] M. Johnson, J. M. Bradshaw, The role of interdependence in trust, in: Trust in Human-Robot</p>
      <p>Interaction, Elsevier, 2021, pp. 379–403.
[16] J. Sabater-Mir, L. Vercouter, Trust and reputation in multiagent systems, Multiagent
systems (2013) 381.
[17] J. Urbano, A. P. Rocha, E. Oliveira, Computing confidence values: Does trust dynamics
matter?, in: Portuguese Conference on Artificial Intelligence, Springer, 2009, pp. 520–531.
[18] C. Castelfranchi, R. Falcone, Trust is much more than subjective probability: Mental
components and sources of trust, in: Proceedings of the 33rd annual Hawaii international
conference on system sciences, IEEE, 2000, pp. 10–pp.
[19] R. Falcone, G. Pezzulo, C. Castelfranchi, A fuzzy approach to a belief-based trust
computation, in: Trust, Reputation, and Security: Theories and Practice, volume 2631, Springer
Verlag, 2003, pp. 73–86. doi:10.1007/3-540-36609-1_7.
[20] R. Falcone, C. Castelfranchi, Trust dynamics: How trust is influenced by direct experiences
and by trust itself, in: AAMAS, IEEE Computer Society, 2004, pp. 740–747. doi:10.1109/
AAMAS.2004.10084.
[21] R. Falcone, M. Piunti, M. Venanzi, C. Castelfranchi, From manifesta to krypta: The
relevance of categories for trusting others, ACM Transactions on Intelligent Systems and
Technology 4 (2013). doi:10.1145/2438653.2438662.
[22] M. Bacharach, D. Gambetta, Trust as type detection, in: Trust and deception in virtual
societies, Springer, 2001, pp. 1–26.
[23] C. Centeio Jorge, M. L. Tielman, C. M. Jonker, Assessing artificial trust in human-agent
teams: a conceptual model, in: C. Martinho, J. Dias, J. Campos, D. Heylen (Eds.), IVA ’22:
ACM International Conference on Intelligent Virtual Agents, Faro, Portugal, September 6
9, 2022, ACM, 2022, pp. 24:1–24:3. doi:10.1145/3514197.3549696.
[24] L. Huang, N. J. Cooke, R. S. Gutzwiller, S. Berman, E. K. Chiou, M. Demir, W. Zhang,
Distributed dynamic team trust in human, artificial intelligence, and robot teaming, in:
Trust in human-robot interaction, Elsevier, 2021, pp. 301–319.
[25] M. Johnson, J. M. Bradshaw, P. J. Feltovich, C. M. Jonker, M. B. Van Riemsdijk, M. Sierhuis,
Coactive design: Designing support for interdependence in joint activity, Journal of
Human-Robot Interaction 3 (2014) 43–69.
[26] A. L. Brown, Design experiments: Theoretical and methodological challenges in creating
complex interventions in classroom settings, The journal of the learning sciences 2 (1992)
141–178.
[27] A. Collins, D. Joseph, K. Bielaczyc, Design research: Theoretical and methodological issues,
Journal of the Learning Sciences 13 (2004) 15 – 42. doi:10.1207/s15327809jls1301_2,
cited by: 1090.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.-S.</given-names>
            <surname>Ulfert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Georganta</surname>
          </string-name>
          ,
          <article-title>A model of team trust in human-agent teams</article-title>
          ,
          <source>in: Companion Publication of the 2020 International Conference on Multimodal Interaction, ICMI '20 Companion</source>
          , Association for Computing Machinery, New York, NY, USA,
          <year>2020</year>
          , p.
          <fpage>171</fpage>
          -
          <lpage>176</lpage>
          . doi:
          <volume>10</volume>
          .1145/3395035.3425959.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Sycara</surname>
          </string-name>
          ,
          <article-title>Deep learning, transparency, and trust in human robot teamwork, in: Trust in Human-Robot Interaction</article-title>
          , Elsevier,
          <year>2020</year>
          , pp.
          <fpage>321</fpage>
          -
          <lpage>352</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>K. E.</given-names>
            <surname>Schaefer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. S.</given-names>
            <surname>Perelman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. M.</given-names>
            <surname>Gremillion</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Marathe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. S.</given-names>
            <surname>Metcalfe</surname>
          </string-name>
          ,
          <article-title>A roadmap for developing team trust metrics for human-autonomy teams</article-title>
          , in: Trust in HumanRobot Interaction, Academic Press,
          <year>2021</year>
          , pp.
          <fpage>261</fpage>
          -
          <lpage>300</lpage>
          . doi:https://doi.org/10.1016/ B978- 0
          <source>- 12- 819472- 0</source>
          .
          <fpage>00012</fpage>
          -
          <lpage>5</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>E. J. D.</given-names>
            <surname>Visser</surname>
          </string-name>
          , Marieke,
          <string-name>
            <given-names>M. M.</given-names>
            <surname>Peeters</surname>
          </string-name>
          ,
          <string-name>
            <surname>Malte</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Jung</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Kohn</surname>
            , Tyler,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Shaw</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Pak</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Neerincx</surname>
          </string-name>
          ,
          <article-title>Towards a theory of longitudinal trust calibration in human-robot teams</article-title>
          ,
          <source>International Journal of Social Robotics</source>
          <volume>12</volume>
          (
          <year>2020</year>
          )
          <fpage>459</fpage>
          -
          <lpage>478</lpage>
          . doi:
          <volume>10</volume>
          .1007/s12369- 019- 00596- x.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>B. G.</given-names>
            <surname>Schelble</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Lancaster</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Duan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mallick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. J.</given-names>
            <surname>McNeese</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. Lopez,</surname>
          </string-name>
          <article-title>The efect of AI teammate ethicality on trust outcomes and individual performance in human-ai teams</article-title>
          , in: T. X.
          <string-name>
            <surname>Bui</surname>
          </string-name>
          (Ed.),
          <source>56th Hawaii International Conference on System Sciences, HICSS</source>
          <year>2023</year>
          , Maui, Hawaii, USA, January 3-
          <issue>6</issue>
          ,
          <year>2023</year>
          , ScholarSpace,
          <year>2023</year>
          , pp.
          <fpage>322</fpage>
          -
          <lpage>331</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>N. J.</given-names>
            <surname>McNeese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Demir</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. K.</given-names>
            <surname>Chiou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. J.</given-names>
            <surname>Cooke</surname>
          </string-name>
          ,
          <article-title>Trust and team performance in humanautonomy teaming</article-title>
          ,
          <source>Int. J. Electron. Commer</source>
          .
          <volume>25</volume>
          (
          <year>2021</year>
          )
          <fpage>51</fpage>
          -
          <lpage>72</lpage>
          . doi:
          <volume>10</volume>
          .1080/10864415.
          <year>2021</year>
          .
          <volume>1846854</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Wagner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. C.</given-names>
            <surname>Arkin</surname>
          </string-name>
          ,
          <article-title>Recognizing situations that demand trust</article-title>
          ,
          <source>in: 2011 RO-MAN, IEEE</source>
          ,
          <year>2011</year>
          , pp.
          <fpage>7</fpage>
          -
          <lpage>14</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Wagner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Robinette</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Howard</surname>
          </string-name>
          ,
          <article-title>Modeling the human-robot trust phenomenon: A</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>