<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>T. Miller, Explanation in artificial intelligence: Insights from the social sciences, Artifi-
cial Intelligence</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Study on Criteria for Explainable AI for Laypeople</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Thorsten Zylowski</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Karlsruhe University of Applied Sciences (HKA)</institution>
          ,
          <addr-line>Moltkestraße 30, 76133 Karlsruhe</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Hohenheim</institution>
          ,
          <addr-line>Schloss Hohenheim 1, 70599 Stuttgart</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <volume>267</volume>
      <issue>2019</issue>
      <fpage>41</fpage>
      <lpage>58</lpage>
      <abstract>
        <p>Artificial intelligence (AI) is increasingly influencing everyday situations. Humans are becoming more and more dependent on the decisions made by AI. Due to the black-box nature of AI models, the decisions of these models can hardly be understood by AI experts and certainly not by laypeople. This results in a potential trust problem in systems that use AI. Methods from the field of Explainable AI are being used to try to counteract these trust problems. Unfortunately, most of the methods are designed by AI experts for AI experts and do not take into account the specific requirements of laypeople. This paper presents a study on criteria for Explainable AI for laypeople to help design AI systems that meet the needs of laypeople and aim to increase trust. A survey was conducted with 103 participants, exploring diferent areas for the design of Explainable AI for laypeople. These include the importance of explanations, the extensiveness of an explanation, which interactive elements are useful, and the role of AI and human certainty when making decisions with explanations.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;User Study</kwd>
        <kwd>Explainable AI</kwd>
        <kwd>Laypeople</kwd>
        <kwd>Trust</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>inputs to the model on outputs to understand what internal processes have been learned. The set
of these methods is summarized by the term Explainable AI (XAI). There are methods designed
for explanations of specific architectures (e.g. [ 1]), as well as model-agnostic approaches that
can be applied to a variety of models (e.g. [2]). In addition, methods can be categorized into
local methods that explain individual AI decisions or recommendations (e.g. counterfactuals
[3]) and global methods that explain the model as a whole (e.g. model class reliance [4]).
However, the field is still in its infancy, which means that the overwhelming majority of the
methods have been created by AI experts for AI experts. However, if AI experts understand
the system better, it doesn’t mean that laypeople do as well. It follows that there is a great
need for Explainable AI methods that address the requirements of laypeople, helping them to
understand AI decision processes, increase transparency and ideally work as a basis for trust.
In order to design Explainable AI methods that take into account requirements of laypeople, it
is important to systematically collect these requirements. Therefore, a study was conducted to
capture the essential criteria of Explainable AI for laypeople. A key focus was on examining
trust-building aspects of the methods. With the results from this study, Explainable AI methods
can be designed and tested in future work for their suitability for laypersons and hopefully help
to increase trust in AI systems.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        A frequently cited work on what are good explanations for humans is [5]. He presents insights
from many years of social science research for the use in Explainable AI. For him, a good
explanation is, first, contrastive. Humans ask not only why an event occurred, but why it
occurred rather than another event. Second, good explanations are selected. It is impossible to
state the complete causal chain of events, because it is potentially infinite, since there is always
a preceding cause. Humans select explanations from a set of explanations and take them as the
explanation. Irrelevant and already known causes should not be provided. Third, he goes into
further detail, that “probabilities probably don’t matter”. This implies that people may choose
the best explanation based not on probability but on other metrics, such as simplicity and
relevance. Fourth, he states, that explanations are social. Explanations should be tailored to the
person receiving them and take into account the social context. Often explanations are given in
dialogues, iteratively providing more detailed information as needed. As [5] states: “explainer
and explainee may interact and argue about this explanation”. [6] made an extensive literature
review of Explainable AI and categorized it into the five groups (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) Definition of explanation,
(
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) Goals of explanations (WHY), (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) Content to include in the explanation (WHAT), (
        <xref ref-type="bibr" rid="ref4">4</xref>
        ) Types
of explanations (HOW) and (5) Evaluation of explanations.
      </p>
      <p>
        They “critically review the previous sections and give insights on new directions to create better
explanations” [6]. They argue that AI systems should provide more than one explanation,
targeting diferent groups of people: (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) Developers and AI researchers, (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) Domain experts and
(
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) Lay users.
      </p>
      <p>They analyze how the needs for explainable system proposed by [7] can be brought together
with these groups. While verification and improvement of the system through explainability are
needs of the first two groups, laypersons are mainly concerned with the right to an explanation.
Especially in situations that can afect their lives, people want an AI system to explain its
decision. In addition, they argue, like [5], that explanations should follow cooperative principles
of human conversation. For laypeople, they specifically recommend, based on existing
technical capabilities, to ofer explanations with multiple counterfactuals from which people can
interactively choose the appropriate explanation. They argue that “this explanation is parallel
to human modes and it is very likely to generate trust”. Both [5] and [6] refer to work by [8] and
argue that explanations should follow his maxims of good human conversation, i.e. quality
(only say what you believe), quantity (only say what is required), relation (only say what is
relevant) and manner (say it nicely).</p>
      <p>In addition to these very general findings, concrete opinions of laypeople must be obtained
for a potential implementation. This raises the questions in which situations, to what extent and
with which interaction possibilities explanations must be designed in order to have a benefit for
laypeople. Since the uncertainty of humans and AIs also seems to be an important factor, the
aim is to clarify under which conditions users believe they are following a decision by an AI. The
development of Explainable AI should aim to increase trust in the systems, so it is also important
to ask about factors that promote trust in the opinion of laypeople. On the psychological level,
it is interesting to see how an explanation afects people, which psychological characteristics
of people are addressed and how they feel about a decision. Especially the last aspect could
lead to the realization that explanations are useful even if they do not influence decisions at all.
Namely, when a person, through an explanation, has a positive feeling after a decision. This
study seeks to explore these issues in order to gain insights from a layperson’s perspective and
to provide further directions for research.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Methods</title>
      <sec id="sec-3-1">
        <title>3.1. Study Design</title>
        <p>In this section the study design, the recruiting process and the distribution of participants are
described.</p>
        <p>The study was conducted in the form of an online survey. The selection of questions is based on
questions already posed in literature and findings obtained by others as mentioned in Related
Work. Additional questions are integrated for further exploratory investigation. The full set of
questions can be found on GitHub1. The questions can be divided into six areas.
• Importance of explanations: the focus of this area is on the questions of whether and under
what conditions an explanation is important (e.g. “It is important to me to know how an
AI arrived at a decision or recommendation”).
• Extensiveness of explanations: here, the focus is on the level of detail of the explanations,
e.g. how is the interplay between coarse and detailed explanations as well as whether
global or local explanations are required (e.g. “It is important to me to be able to understand
the explanation of how an AI came to a decision, at least in principle, down to the smallest
detail”).
1https://github.com/ThorstenFooBar/xai-criteria-survey
• Interactivity: are interactive explanations desired and how could they be designed (e.g.</p>
        <p>“For a good explanation, it is important for me to be able to try out what if cases”).
• Certainty of human and AI : what is the role of uncertainty or certainty of both AI and
humans interacting with explanations in decision-making (e.g. “For me, it is particularly
important to recognize how certain an AI is in making a decision”).
• Trust: what factors need to be considered to design trustworthy AI (e.g. “For me, it is
important for trust in an AI that it discloses its internal workings, even if I would not
understand everything”).
• Self-determination: what human characteristics, such as the need for control, are
addressed by Explainable AI (e.g. “When using an AI system that provides comprehensible
explanations for decisions, I would feel more competent in using the AI system.”).
Each area consists of a set of questions measured on a five-item Likert scale, ranging from 1 (I
do not agree at all) to 5 (I fully agree), to systematically capture the participants’ opinions on
the diferent areas. Four textual questions were added to explore in which specific situations
explainability plays a role, which interactive elements are desired, and how an explanation must
be designed to promote trust or harm it.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Participants</title>
        <p>Participants were recruited in German-speaking countries, reaching  = 103 individuals (60.6%
male, 39.4% female). The median of the age of the participants is in the group 35-49 years. 67.3
% of the participants are between 25 and 49 years old. 101 participants are employed. Two are
students. 51% work in the IT sector, 6.7% in education. The remaining participants are spread
across a wide variety of industries (e.g. financial industry, research, culture, medical sector etc.).
At least three-quarters of the participants regularly use apps with AI-supported functions (5.0
median and 4.37 mean value on a five-item Likert scale).</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <p>In this section the analysis results of the  = 103 responses are presented. For the analysis
of the questions on the five-item Likert scale, descriptive statistics (mean, quartiles, standard
deviation etc.) were determined. Text responses are analyzed with manual clustering and
frequency analysis to extract important characteristics of trustworthy explanations. The impact
of certainty and uncertainty in combination with explanations is tested for significance with
the Mann-Whitney U-test.</p>
      <sec id="sec-4-1">
        <title>4.1. Important Criteria for Explainable AI</title>
        <p>In order to extract the most important criteria for Explainable AI to laypersons, the responses
were sorted by first and second quartile as follows: For strong positive statements, items were
selected whose first quartile was greater than or equal to 4.0. For strong negative statements,
the items were selected whose first quartile is equal to 1.0 and whose second quartile is less
than or equal to 3.0. The statements have been translated from German.
process of an AI.</p>
        <sec id="sec-4-1-1">
          <title>4.1.1. Positive Statements</title>
          <p>The 20 statements shown in Table 1 to Table 6 emerged as strong positive responses, grouped
by the categorization described in Study Design and sorted in descending order by their mean.</p>
          <p>As can be seen in Table 1, explanations in critical and non-critical situations are of great
importance for participants, with critical situations being rated higher. More important than
explanations in non-critical situations, however, is the comprehensibility of the decision-making
Positive statements about importance of explanations with mean value and standard deviation on a</p>
          <p>With regard to the extensiveness of the explanations (Table 2), roughly granular explanations
are desired, which can be examined in more detail if necessary.</p>
          <p>Positive statements about extensiveness of explanations with mean value and standard deviation on a
When using AI in critical situations, it is important to me that the AI can
explain how its recommendations or decisions were arrived at.</p>
          <p>It is important for me to know how an AI has come to a decision or 4.35
4.53
recommendation.</p>
          <p>I would like to know how recommendations on the internet (Netflix, 4.08
Amazon, Spotify, Google search results etc.) came about.</p>
          <p>An explanation shall be given very coarsely at the beginning and pre- 4.21
0.99
sented in more detail when needed.</p>
          <p>On the level of interactivity (Table 3), participants would like to have the opportunity to try
out what-if cases in order to understand an explanation, especially in critical situations. In
addition, there is a desire for the explanation to learn adaptively from user needs and become
increasingly individualized.</p>
          <p>At the level of certainty (Table 4), it is important to the participants that an AI indicates how
certain it is about the decision. Even if the AI is uncertain about the decision, it should provide
an explanation.</p>
          <p>The most important point for trust in explanations and AI (Table 5), according to participants,
is the need for a decision to ultimately always be made by the human. Furthermore, the
explanations should be plausible and the internal working methods transparent in order to
increase trust. The aforementioned what-ifs are also considered very important for trust.</p>
          <p>Participants also assume that explanations will have a positive efect on various areas of
self-determination (Table 6). They believe that they would feel more competent in dealing with
an AI, that they would be motivated to use an AI that provides explanations, and that their
Positive statements about interactivity of explanations with mean value and standard deviation on a
You have to make a dificult decision (you could lose a lot of money, for
4.27
example). You are supported in this decision by an AI. How strongly does
the following statement apply to you: “I would try out a lot of “what if”
cases to come to a decision”.</p>
          <p>For a good explanation, it is important for me to be able to try out “what
if” cases.</p>
          <p>An AI system whose explanations I can adapt to my own needs through
continuous interaction with the system (e.g. through feedback or
natural language dialogue) would be optimal for me.</p>
          <p>Positive statements about certainty and explanations with mean value and standard deviation on a</p>
          <p>Positive statements about trust and explanations with mean value and standard deviation on a five-item
For me, it is important for trust in an AI that, despite a plausible expla- 4.78
nation, I can make the decision myself.</p>
          <p>For me, it is important for trust in an AI that it provides a plausible
explanation for a decision.</p>
          <p>For me, it is important for trust in an AI that it discloses its internal
workings, even if I would not understand everything.</p>
          <p>For trust in an AI, it is important to me to be able to ask the AI “what-if”
questions.
sense of security would increase. They assume that explanations would appeal to their curiosity
and for this reason they would also use explanations in non-critical situations.</p>
        </sec>
        <sec id="sec-4-1-2">
          <title>4.1.2. Negative Statements</title>
          <p>The three statements in Table 7 and Table 8 emerged as strong negative responses, sorted in
ascending order by their mean. Low mean values mean that many people disagree with the
statement.
deviation on a five-item Likert scale.
Positive statements about self-determination theory and explanations with mean value and standard
When using an AI system that provides comprehensible explanations for
decisions, I would feel more competent in using the AI system.</p>
          <p>An AI system that provides comprehensible explanations for decisions
would motivate me to use the AI system.</p>
          <p>An AI system that provides comprehensible explanations for decisions
would give me a feeling of security.</p>
          <p>An AI system that provides comprehensible explanations for decisions
would appeal to my curiosity, so that I would also refer to explanations
in non-critical situations.
4.22
4.20
4.10
4.08</p>
          <p>At the level of importance (Table 7), there is a denial by the participants that an AI should
not provide an explanation if it is uncertain. This result is congruent with the above finding
that an AI should provide an explanation even if it is uncertain. It is also denied that an AI does
not have to provide explanations as long as the results are good. Conversely, this means that an
AI should provide explanations even if it produces good results.
cause I will reject the decision in any case.</p>
          <p>As long as the AI delivers good results, I don’t care how they came about.
2.27</p>
          <p>At the interaction level (Table 8), participants would not be willing to try out very many
what-ifs in non-critical situations.
Negative statements about explanations selected by first quartile equals 1.0 and second quartile lower
than or equal to 3.0 grouped by categories introduced in Study Design</p>
        </sec>
        <sec id="sec-4-1-3">
          <title>Importance</title>
          <p>Negative statements about explanations selected by first quartile equals 1.0 and second quartile lower
than or equal to 3.0 grouped by categories introduced in Study Design</p>
        </sec>
        <sec id="sec-4-1-4">
          <title>Interactivity</title>
          <p>You have to make an easy decision (e.g. choosing a film for the evening’s
1.99
0.97
television). You are supported in this decision by an AI. How strongly
does the following statement apply to you: “I would try a lot of “what
if” cases to come to a decision”.</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Textual Feedback on Trustworthy Explanations</title>
        <p>Participants were able to express how explanations for AI systems would have to be designed
in order to be able to trust them. In addition, they were asked how explanations would have to
be designed that could not be trusted. The answers were combined into categories based on the
same content. The categories were defined manually based on the frequency of their mention.
The results were then sorted by frequency of occurrence to get important characteristics. The
most frequent characteristics for trustworthy explanations (with the number of mentions on
y-axis) are shown in Figure 1.</p>
        <p>Participants want transparent, comprehensible and short explanations so that they can trust
them. The decision-making process should be transparent, as well as the data used for the
decision. Logical plausible explanations, to be given in more detail if required, are desired
for trust. Examples should also help, as well as the AI certainty already shown. In addition,
several participants would like to see an indication of sources, although it was not specified
which sources were meant. These could be sources of the data, the algorithms used, the training
pipeline of the models and many more.</p>
        <p>s 20
n
o
i
t
n
em 15
f
o
r
e
b 10
m
u
N
5
21
20
19
11
8
8
7
7
6
6
Traceability</p>
        <p>I confidence
A</p>
        <p>The most frequent characteristics of an explanation that is not trustworthy are shown in
Figure 2. Not surprisingly, reversals of previously positive statements are strongly represented.
These include incomprehensibility, non-traceability and the length of explanations as well as
complicated, implausible and illogical implementations. Furthermore, statements are made here
that attempts at manipulation through explanations would lead to a loss of trust.</p>
        <p>In addition to these quantitative results, it can be seen on a qualitative level that explanations
must be created in a human-centric way. Participants demand that explanations are truthful
and honest and that assertions not simply be made. Information must not be omitted. The AI’s
explanations must be controllable. They should be based on scientific and verified principles.</p>
        <p>oexplanation
N</p>
        <p>plicated
om
C</p>
        <p>Technical term
t plausible
o
N</p>
        <p>s
Longexplanation
They should be neutral and a contact person should be indicated. Manipulation by the
explanations must be excluded. The explanation must not serve any marketing purpose, should not
promote any purchases, must not be augustly nice and must not be promoted by any sponsor.
The participants would like to be able to influence the AI decision and the explanations. It
should be recognizable how the user behavior influences the AI decision. User preferences must
be taken into account and the deletion of data must be possible.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Textual Feedback on Interactivity</title>
        <p>Participants were asked about the design of interactive elements for explanations. The answers
were combined into categories based on the same content. The categories were defined manually
based on the frequency of their mention. The results were then sorted by frequency of occurrence
to get important characteristics. There were 41 answers given, as this question was not a
mandatory question. The most frequent answers are shown in Figure 3.</p>
        <p>What-if explanations followed by interactive diagrams in general are considered important
interactive components for promoting trust. More specifically, explanations should be able to
be compared with each other and the AI’s decisions should be changeable through interaction.
Context sensitivity includes statements on the adaptivity of the explanations, so that they can
adapt to the user’s needs. There was feedback on the interaction with a specific explanation in
general, i.e. comparing, exploring and rating explanations. In addition, feedback plays a major
role. The AI should be able to be improved through feedback, alternative explanations should
be able to be weighted. It can also be seen that access to historical explanations and decisions is
desired.</p>
        <p>s
n
o
i
t
n
e
m
f
o
r
e
b
m
u
N
6
4
2
6
4
3
3</p>
        <p>2</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4. Explanations under Uncertainty</title>
        <p>It is reasonable to assume that uncertainty in the context of AI contributes to trust. Either in a
positive or negative way. To understand the impact of uncertainty, in both human and AI, on
decisions in combination with explanations, participants were confronted with several decision
situations and had to indicate to whom they would leave the decision: the human, the AI or
no decision at all. All the decision-making situations were in the form of You have to make a
decision that you are not very sure about. However, you have a tendency. An AI has a diferent
tendency and is 90% sure that this decision will produce a positive result. What do you do?. After
each decision, the participants were asked how they feel about their decision on a five-item
Likert scale with the range 1 (I have a very bad feeling) to 5 (I have a very good feeling). The
cases where both human and AI are uncertain and have the same tendency for the decisions
can be found in Table 9. Cases where the AI is certain and the human is either uncertain or
certain and both parties have diferent tendencies for the decisions are shown in Table 10. The
ifrst three rows of the tables are the case number and the certainty levels of human and AI. In
the fourth row is stated if an explanation for the AI decision is available or not. The next three
rows show the distribution of the participants’ decisions. The last row holds the mean values
and standard deviations of the feelings about the decision made.</p>
        <sec id="sec-4-4-1">
          <title>4.4.1. Human and AI with the same decision tendencies</title>
          <p>In the case where the human and AI have the same tendencies, only the case where both parties
are uncertain is of particular interest, as it is the worst case. As soon as one of the two parties
is more confident in the decision, given the same tendency, the decision itself will only be
strengthened. Table 9 shows these cases, without and with explanation.</p>
          <p>The percentage values are the same for human decision and AI decision as both have the same
tendency. It can be seen that as long as human and AI are both uncertain about the decision,
the human decision is preferred by roughly three-quarters of the participants, regardless of
whether an explanation is given or not. Explanations do not seem to matter in these situations.</p>
        </sec>
        <sec id="sec-4-4-2">
          <title>4.4.2. Human and AI with diferent decision tendencies</title>
          <p>Table 10 presents cases where the AI is certain about its decision. Human certainty varies, as
does the presence of an explanation. The AI and the human have diferent tendencies regarding
the decision.</p>
          <p>It can be seen that the decision to rely on the AI is influenced by the presence of an explanation
(transitions from case 3 to 4 and from case 5 to 6). It is worth noting that 54.4% of the participants
would rely on the AI even if they are confident about their own decision preference and have a
diferent tendency compared to the AI if an explanation is present.</p>
        </sec>
        <sec id="sec-4-4-3">
          <title>4.4.3. Feelings about the decisions</title>
          <p>
            A Mann-Whitney U-test was used to examine whether participants felt significantly better
about their decision with an explanation in place. The U-test was performed on the mean values
between case pairs (
            <xref ref-type="bibr" rid="ref1 ref2">1,2</xref>
            ), (
            <xref ref-type="bibr" rid="ref3 ref4">3,4</xref>
            ), and (5,6) with a significance level  = 0.05. Table 4 shows the
1, 2 and  values of the U-test.
          </p>
          <p>
            It can be seen that the null hypothesis can be rejected for case pairs (
            <xref ref-type="bibr" rid="ref1 ref2">1,2</xref>
            ) and (
            <xref ref-type="bibr" rid="ref3 ref4">3,4</xref>
            ) but not for
(5,6). In the case where the human and AI are both uncertain and have the same tendencies for
the decision, having the AI provide an explanation very likely leads to a more positive feeling
about the decision made. Although, the choice is still the same. The case where the human is
uncertain, but the AI is confident in the decision, and they have diferent tendencies for the
decision, very likely leads to a more positive feeling to choose the AI decision if the AI provides
an explanation. If the human is also confident in his/her decision, choosing the AI decision does
not lead to a more positive feeling. Although, it is becoming more likely that the human will
choose the AI decision.
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Discussion</title>
      <p>The results of the study show that participants demand explanations for AI decisions. This
applies to both critical and non-critical situations. Even when the AI delivers very good results,
explanations are desired. This is also true when the AI is uncertain. It is an interesting question
which human factors play a decisive role in the respective situations. In non-critical situations,
it can be assumed that curiosity is one of the driving factors. In critical situations, it may be the
need for competence as well as security. However, these assumptions still need to be confirmed
empirically in further experiments. If explanations have such a high importance for AI decisions,
it is necessary to explore the exact impact of explanations in critical and non-critical situations
on trust.</p>
      <p>It has been shown that explanations on a coarse-granular level are useful at the beginning
and can be explored in detail only when needed. This raises the question of how much of an
impact this approach has on trust. The exact relationship between coarse-granular and detailed
representation with regard to trust must also be investigated empirically in experiments in the
future. It could be determined that participants imagine that explanations should be examined
intensively in critical situations. However, many critical situations are also time-critical (e.g.,
autonomous driving). In these situations, extensive analyses cannot be performed. It follows
that it is necessary to explore how granular the explanations have to be chosen depending
on the specific situation. Global explanations describing the internal workings of AI may be
important in initial trust building. Continuous explanation of individual decisions probably
contributes to building and maintaining trust. However, these relationships also need to be
confirmed empirically first. One challenge with global explanations and the related expressed
need of participants for a transparent decision-making process is that even if the inner workings
of the AI are shown, humans may still not be able to understand them because AI models work
diferently than human decision-making processes. The question here is how global explanations
can be presented in an appropriately simple way and which of them have a positive impact on
trust.</p>
      <p>Interactive elements are considered an important factor of a good explanation, which may
also be related to the social nature of explanations expressed by [5]. What-if exploration
and interactive diagrams are considered important implementations. Participants would be
willing to try many alternative cases in critical situations. In non-critical situations, a what-if
exploration would not be conducted as intensively because those explorations are more driven
by the curiosity of the participants. This raises the question of how much time people would
actually put into exploration and under what conditions it would be interrupted. In addition, it
remains open how much impact what-if explorations and interactive diagrams have on trust.</p>
      <p>It could be shown that certainty/uncertainty of humans and AI together with explanations
play a role in the choice of a decision. There are situations of certainty and uncertainty in
which explanations positively influence the feeling about a decision, and there are situations
in which explanations lead to a change in the decision toward reliance on AI. If humans rely
on the decision of an AI, this can be seen as an expression of trust. In further experiments,
the insights gained in this work need to be explored in concrete situations. The question is
how much of an impact does the communication of certainty/uncertainty along with providing
an explanation have on trust. In addition to uncertainty considerations, it is important that
people can make the decision themselves. The AI must provide a plausible explanation and
even ofer the possibility to explore explanations and compare them with other, also historical
explanations.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Shrikumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Greenside</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kundaje</surname>
          </string-name>
          ,
          <article-title>Learning important features through propagating activation diferences</article-title>
          ,
          <source>in: Proceedings of the 34th International Conference on Machine Learning - Volume 70, ICML'17</source>
          , JMLR.org,
          <year>2017</year>
          , p.
          <fpage>3145</fpage>
          -
          <lpage>3153</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M. T.</given-names>
            <surname>Ribeiro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Guestrin</surname>
          </string-name>
          ,
          <article-title>"why should i trust you?" explaining the predictions of any classifier</article-title>
          ,
          <source>in: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>1135</fpage>
          -
          <lpage>1144</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Dandl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Molnar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Binder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Bischl</surname>
          </string-name>
          ,
          <article-title>Multi-objective counterfactual explanations</article-title>
          ,
          <source>in: International Conference on Parallel Problem Solving from Nature</source>
          , Springer,
          <year>2020</year>
          , pp.
          <fpage>448</fpage>
          -
          <lpage>469</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Fisher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Rudin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Dominici</surname>
          </string-name>
          ,
          <article-title>All models are wrong, but many are useful: Learning a variable's importance by studying an entire class of prediction models simultaneously</article-title>
          .,
          <source>J. Mach. Learn. Res</source>
          .
          <volume>20</volume>
          (
          <year>2019</year>
          )
          <fpage>1</fpage>
          -
          <lpage>81</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>