<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>European Workshop on Algorithmic Fairness, June</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>The Explanation Dialogues: Understanding How Legal Experts Reason About XAI Methods</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Laura State</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alejandra Bringas Colmenarejo</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrea Beretta</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Salvatore Ruggieri</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Franco Turini</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stephanie Law</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, University of Pisa</institution>
          ,
          <addr-line>Largo B. Pontecorvo, 3, 56127 Pisa</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>ISTI-CNR</institution>
          ,
          <addr-line>Via G. Moruzzi, 1, 56124 Pisa</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>School of Law, University of Southampton</institution>
          ,
          <addr-line>4 University Rd, Southampton SO17 1BJ</addr-line>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Scuola Normale Superiore</institution>
          ,
          <addr-line>Piazza dei Cavalieri, 7, 56126 Pisa</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>0</volume>
      <fpage>7</fpage>
      <lpage>09</lpage>
      <abstract>
        <p>The Explanation Dialogues project is an expert focus study that aims to uncover expectations, reasoning, and rules of legal experts and practitioners towards explainable artificial intelligence (XAI). We examine legal perceptions and disputes that arise in a fictional scenario that resembles a daily life situation - a bank's use of an automated decision-making (ADM) system to decide on credit allocation to individuals. Through this simulation, the study aims to provide insights into the legal value and validity of explanations of ADMs, identify potential gaps and issues that may arise in the context of compliance with European legislation, and provide guidance on how to address these shortcomings.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Explainability</kwd>
        <kwd>AI</kwd>
        <kwd>Automated Decision-Making</kwd>
        <kwd>General Data Protection Regulation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>legal compliance of existing methods? ) Do legal experts understand and trust explanations
for ADM systems, and what are the steps identified to go forward?</p>
      <p>In concrete, we have developed a focus study where legal experts will be questioned about
a constructed, real-case scenario involving a private bank, an ADM system and an internal
consultant. The bank provides explanations about the ADM process used to assess its customers’
creditworthiness and, acting as internal consultants of the bank, the legal experts are expected
to evaluate compliance with the information and explanations concerning the interest and
duties of the bank and the interest and rights of the data subjects.</p>
    </sec>
    <sec id="sec-2">
      <title>1. Legal Background</title>
      <p>
        The project has been developed under the legal basis of the right to not be subject to an automated
decision-making system, as referred to in Article 22 of the GDPR, and the safeguards and
information duties established upon it in the same European Regulation. For the project, we
hold the view that the rights to information and an explanation about automated decision-making
arise from diferent Articles of the GDPR, Article 22(
        <xref ref-type="bibr" rid="ref3">3</xref>
        ), and Articles 13(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )(h), 14(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )(g) and 15(
        <xref ref-type="bibr" rid="ref1">1</xref>
        )(h)
respectively. Firstly, Articles 13(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )(h) and 14(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )(g) of the GDPR establish a duty for the data
controller to provide information to the data subject regarding the existence of automated
decision-making, meaningful information about the logic involved, and the significance and
the envisaged consequences. As a result of this duty, the data subject has an ex-ante right to
information which does not need to be actively exercised. Additionally, Article 15(
        <xref ref-type="bibr" rid="ref1">1</xref>
        )(h) assesses
the rights of the data subject to the access and requirement of information, including information
regarding the existence of automated decision-making, meaningful information about the logic
involved, and the significance and the envisaged consequences. As the exercise of this right
resides in the active exercise of it by the data subject, it is arguable that it can entail ex-ante or
ex-post information about the decision. Given that it is the subject who has to exercise that
right, it is apparent that Article 15(
        <xref ref-type="bibr" rid="ref1">1</xref>
        )(h) covers situations where either no automated decision
afecting the subject has yet been taken or a decision has been taken and the subject sought to
corroborate whether it is an automated or not. In the first case, the information received by
the data subject shall be the same as for Articles 13(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )(h) and 14(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )(g). In the second case, on
the contrary, the information shall be as set out in Article 22 (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) and its Recital 71. Thus, this
ex-post information about the particular decision shall explain the decision reached after such
assessment to contest the decision and express the point of view of the data subject. Indeed, the
ifrst case will involve a right to information, while the second will entail a right to explanation.
Finally, Article 22 establishes both a right to information and a right to an explanation. The first
will arise from the ex-ante necessity of the data subject to consent to and enter into a contract
with automated decision-making (systems) as referred to in the exceptions of Article 22(
        <xref ref-type="bibr" rid="ref2">2</xref>
        ). The
second will arise from the ex-post efective exercise of the safeguards set out in Article 22(
        <xref ref-type="bibr" rid="ref3">3</xref>
        ),
namely, to express his or her point of view and to contest the decision.
      </p>
      <p>This assessment of the rights to information and an explanation presents the legal motivation
behind our simulation. However, we will refrain from transmitting this view to our project
participants in order to avoid influencing their responses and their approach to the explanations
and information provided. We expect participants to evaluate whether the provided examples
comply with the provisions of the GDPR and to identify the possible gaps or problems, wherefore,
respecting their own rights’ assessment and judgment is of special relevance.</p>
    </sec>
    <sec id="sec-3">
      <title>2. Technical Background</title>
      <p>In the experiment, we present five diferent global and local explanation methods. While global
explanation methods focus on the full model, local methods describe the model behavior only
close to the data instance in focus (here: an application instance). Also, all presented methods
are model-agnostic, i.e. can generally be used to explain any type of underlying ML model [1].</p>
      <p>We present global and local explanations in the form of SHAP values [2], global explanations
as PDPs [3], local explanations in the form of counterfactual data instances computed by DICE
[4], and (counter-)factual rules computed by LORE [5].</p>
      <p>SHapley Additive exPlanations [2]: SHAP values rely on a game-theoretic approach. Local
SHAP values represent contributions of features towards the ADM risk score value for a single
application instance. Global SHAP values are calculated as averages over local SHAP values.
Partial Dependence Plot [3]: PDPs show the marginal efect of a feature towards the ADM
risk score. PDPs are calculated by varying the range of feature values and then computing the
average ADM risk score by keeping that value fixed over a set of application instances.
LOcal Rule-based Explanations [5]: LORE provides if-then rules as explanations. Both
a factual rule (“Why did the model decide that way?”) and a contrastive rule (“What has to
change in which way such that the decision will change?”) is presented.</p>
      <p>DIverse Counterfactual Explanations [4]: DICE provides single contrastive examples in
the form of data points. Such points are computed based on a minimum distance to the original
data instance (the application instance), however, they must receive the opposite decision
(diferent risk score) by the ADM system.</p>
    </sec>
    <sec id="sec-4">
      <title>3. Experiment and Interview Details</title>
      <p>The Explanation Dialogues project is realized via expert interviews where participants play the
role of internal legal consultants requested by a bank to analyse a set of explanations about
automated decisions on the granting and refusal of credit. For the project, participants are
presented with two diferent cases: explanations for customers that applied for credit and were
correctly rejected from the credit (“true positive”), and explanations for customers that applied
for credit and were falsely rejected, i.e. customers that were respectively considered correctly or
incorrectly as "high risk" creditors by the ADM system (“false positive”). For each case scenario,
ifve diferent types of explanations are developed. Questions focus on the analysis of single
explanations, and to compare between diferent methods and cases.</p>
      <p>We decided to rely on an hypothetical case, rather than observing and analyzing the process
followed by a real company to provide information and explanation to its customers, as due to
time and resources constrains the later would require to take under consideration too many
variants and explanations, whether the later allow us to set a quasi-controlled scenario.
Randomization and Design The explanations are randomly sampled per participant - we
sample three out of five (one global, two local). This is to cover diferent explanation methods
between diferent participants (subjects) but avoid an overload per participant. In each of the
two cases, therefore, the same explanation method is presented to the same participant, i.e., if
for a participant PDP explanations were sampled, they are presented in both cases. This is to
compare answers within the same participant. Also, the order of the presentation of the two
cases is random, i.e., whether a participant first learns about the “true positive” or the “false
positive” case. Thus, we use a mix of within- and between-subject design.</p>
      <p>Between the two cases, participants are asked to answer questions to facilitate the comparison
of answers within each case. After the online (written) interview, we will ofer the opportunity
to all participants to engage in follow-up interviews, and aim to clarify and gain more insights
about the answers provided.</p>
      <p>Participants Selection Since Explanation Dialogues seeks to gather the expertise and
knowledge of academics and professionals with reputable and renowned careers in legal matters
and compliance with AI systems, we will contact around thirty participants –including
academics, researchers, and professors- selected through purposive sampling, on the criteria that
they are legal experts on the GDPR of the European Union, particularly on explainability and
interpretability of ADM systems.</p>
      <p>Evaluation The analysis of the responses will be carried out through qualitative and
quantitative methods. We will analyze the open-ended questions of the (written) interview through a
qualitative analysis of the experts’ judgments about the explanations provided, with a focus
on understanding whether they consider the explanations to comply with the law and which
improvements should be made. We will also ask some questions about the general understanding
of the method - but this is not the main focus point of this study. We will conduct a thematic
analysis based on grounded theory (following Glaser and Strauss), from which we intend to
obtain knowledge that cannot be explored with closed questions. On the other hand, we will
use computational tools for summarization, exploration, and visualization in the quantitative
analysis of response data.</p>
    </sec>
    <sec id="sec-5">
      <title>4. Outlook</title>
      <p>It is critical to understand that while our work is closely related to those concerning users’
interactions and experiences towards XAI [6, 7], as well as their perceptions of justice on automated
decisions [8], our main purpose is distinct. While we also care about the understandability of
explanations, our central concern is the assessment of explanations by legal experts. In other
words, the study is set up to investigate how explanations about ADM systems as provided
by XAI tools are perceived and disputed by legal experts and scholars in a fictional scenario
resembling a daily life situation.</p>
      <p>This work is a highly interdisciplinary contribution between the law, the computer sciences,
the social sciences, and the cognitive sciences. Further - by putting forward an expert focus
study based on a fictions use-case in the credit domain and establishing links towards the
European legislative framework - we strongly acknowledge the need for contextual work in the
domain of XAI.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>Work supported by the European Union’s Horizon 2020 research and innovation programme
under Marie Sklodowska-Curie Actions for the project NoBIAS (g.a. No. 860630), and under
the Excellent Science European Research Council (ERC) programme for the XAI project (g.a.
No. 834756). This work reflects only the authors’ views and the European Research Executive
Agency (REA) is not responsible for any use that may be made of the information it contains.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>R.</given-names>
            <surname>Guidotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Monreale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ruggieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Turini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Giannotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Pedreschi</surname>
          </string-name>
          ,
          <article-title>A survey of methods for explaining black box models</article-title>
          ,
          <source>ACM Comput. Surv</source>
          .
          <volume>51</volume>
          (
          <year>2019</year>
          )
          <volume>93</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>93</lpage>
          :
          <fpage>42</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Lundberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <article-title>A unified approach to interpreting model predictions</article-title>
          ,
          <source>in: NIPS</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>4765</fpage>
          -
          <lpage>4774</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C.</given-names>
            <surname>Molnar</surname>
          </string-name>
          ,
          <source>Interpretable Machine Learning</source>
          ,
          <year>2019</year>
          . https://christophm.github.io/ interpretable-ml-book/.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>R. K.</given-names>
            <surname>Mothilal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sharma</surname>
          </string-name>
          ,
          <string-name>
            <surname>C. Tan,</surname>
          </string-name>
          <article-title>Explaining machine learning classifiers through diverse counterfactual explanations</article-title>
          , in: FAT*, ACM,
          <year>2020</year>
          , pp.
          <fpage>607</fpage>
          -
          <lpage>617</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>R.</given-names>
            <surname>Guidotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Monreale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ruggieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Pedreschi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Turini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Giannotti</surname>
          </string-name>
          ,
          <article-title>Local rule-based explanations of black box decision systems</article-title>
          , CoRR abs/
          <year>1805</year>
          .10820 (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Q. V.</given-names>
            <surname>Liao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. R.</given-names>
            <surname>Varshney</surname>
          </string-name>
          ,
          <article-title>Human-centered explainable AI (XAI): from algorithms to user experiences</article-title>
          ,
          <source>CoRR abs/2110</source>
          .10790 (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Rong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Leemann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Nguyen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Fiedler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Seidel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Kasneci</surname>
          </string-name>
          , E. Kasneci,
          <article-title>Towards human-centered explainable AI: user studies for model explanations</article-title>
          ,
          <source>CoRR abs/2210</source>
          .11584 (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>R.</given-names>
            <surname>Binns</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. Van Kleek</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Veale</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          <string-name>
            <surname>Lyngs</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Shadbolt</surname>
          </string-name>
          , '
          <article-title>it's reducing a human being to a percentage' perceptions of justice in algorithmic decisions</article-title>
          ,
          <source>in: Proceedings of the 2018 Chi conference on human factors in computing systems</source>
          ,
          <year>2018</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>14</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>