<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The Philosophy of X in XAI</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Neil McDonnell</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Glasgow</institution>
          ,
          <addr-line>University Avenue, Glasgow</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Explanation is a topic in its own right in philosophy, and a topic of newfound interest in AI research given the need in some domains for explainable AI (XAI). This paper traces some of the progress in the philosophical discourse and applies to a realistic application of AI where explanation is required. The aim is to show that philosophy may be of use in the search for the X in XAI.</p>
      </abstract>
      <kwd-group>
        <kwd>1 Explanation</kwd>
        <kwd>AI</kwd>
        <kwd>XAI</kwd>
        <kwd>Philosophy</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Explanation is a central topic in the philosophy
of science and it retains its status in part because
there is, as yet, no consensus on what the
necessary and sufficient conditions are for
counting as an explanation. This makes the
challenge of producing explainable AI (XAI) a
philosophically interesting one. In this short paper
I introduce some toy cases from the philosophy
literature that illustrate the challenge of
accounting for explanation, and draw on recent
philosophical progress to sketch a potential path
forward.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Causal Dependence</title>
      <p>
        In classes around the world the phrase
“correlation is not causation” is drummed into
students, but it remains contentious what the
missing ingredient is that you need to add to
correlation to get genuine causation. David Lewis
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] offered an answer: dependence. Whereas two
common effects of a cause (say, the bang and the
muzzle flare of a fired gun) correlate with one
another, we know they are not mutual causes. As
Lewis pointed out the flare does not depend on the
bang or vice versa – they both depend on the firing
of the gun. This is what causation has that
correlation does not: dependence.
      </p>
      <p>When we ask “why was there a bang?” one
obvious answer is “because a gun was fired” and
thus we cite a cause in giving an explanation, and
we find the cause by examining the dependency
in the situation. There is a problem though:</p>
      <p>Case 1 – Late Pre-emption: Billy and Suzy are
throwing rocks at a window. Both are accurate,
but Suzy throws harder and her rock reaches the
window first. The window breaks, then Billy’s
rock passes through the empty space.</p>
      <p>This case is a famous counterexample to
Lewis’ dependence thesis. In this case it is
obvious that Suzy is the cause of the window
breaking, but because Billy is there as backup the
window breaking does not depend on Suzy. So,
Suzy is the cause (by common sense) but Suzy is
not a cause (by Lewis’ theory). So much the worse
for Lewis’ theory.</p>
      <p>But there is an obvious comeback that shows a
problem for explanation. The window breaking
rather than not breaking at all did not depend on
Suzy (hence the problem) but the window
breaking exactly like that rather than a fraction of
a second later did depend on Suzy’s throw. This
shows that we can talk about the same event
(window breaking) two different ways and come
to two different conclusions about what it
depended on and hence what causes or explains it.
So explanation is sensitive to the description or
categorization of the consequent event.</p>
      <p>
        A parallel issue afflicts the antecedent (cause)
event. Here is another famous case [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]:
      </p>
      <p>Case 2 – Sophie: Sophie the pigeon is trained
to peck all and only red patches. A scarlet patch is
placed in front of Sophie and she pecks.</p>
      <p>There are three candidate explanations we can
offer for why Sophie pecked:
1.
2.
3.</p>
      <p>Because a scarlet patch was placed.</p>
      <p>Because a red patch was placed.</p>
      <p>Because a coloured patch was placed.</p>
      <p>
        Stephen Yablo [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] introduced this case to show
that we have a strong preference for explanation
2. Explanation 3 is not specific enough because it
might lead someone to think a blue patch could
have worked instead. Explanation 1 is too specific
because it might make you think your crimson
patch would not have triggered a peck. These
misleading implications make 1 and 3 less good
than 2 as an explanation. If you agree with Yablo
on this then it shows that explanation is also
sensitive to how you describe or characterize the
antecedent event, and that it is easy to give a
misleading explanation.
      </p>
      <p>Complicating the case a little more, we can
specify that the lab in which Sophie is being
experimented on only has two types of patches:
scarlet and transparent (colourless). If we know
this additional information it seems that
explanations 1 and 3 are no longer misleading
about crimson or blue patches since they are
already ruled out as viable alternatives. This
external fact about the context seems to change
the quality of an explanation without changing
anything about the specific interaction between
Sophie and the scarlet patch that we are seeking
the explanation about. This shows that the quality
of an explanation can vary with contextual
information about viable alternatives.</p>
    </sec>
    <sec id="sec-3">
      <title>3. A Realistic Problem</title>
      <p>There are a host of other problem cases from
the causation literature that are relevant to the
wider issue of XAI, but these examples illustrate
one strand where recent progress has offered a
potential solution. I will illustrate with the
realistic case of loan viability as assessed by AI.</p>
      <p>
        Loans cannot legally be denied on the basis of
a protected characteristic in (at least) Germany
and the United States [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and to protect against
abuse of this the candidate loanee is entitled to an
explanation if they are rejected. If an AI is used to
reach that determination in a more efficient way,
then it must be XAI so that the legal requirement
for an explanation is met.
      </p>
      <p>Our Billy and Suzy case shows us one type of
structure that could be a problem. Suppose the
system rejects candidate P and an explanation is
sought. The explanation offered is that P’s
employment contract is too short, but what is not
made clear is that the system would have rejected
P in any case based on P’s ethnicity (due to a
biased historical dataset, let us suppose). The
system is clearly flawed but this explanation
disguises the fact.</p>
      <p>This Sophie scenario also showed us a problem
that emerges in this scenario. The explanation
offered (that P’s contract is too short) implies that
extending the contract will change the verdict. It
won’t in the case as described, and so whilst it
does seem to qualify as an explanation, it is an
incomplete or misleading one that obscures the
problematic reasoning that is waiting in the wings.
It is analogous in a way to the first explanation in
the Sophie case (that a scarlet patch was placed)
since the explanation masks the presence of an
alternative cause, crimson in the Sophie case,
ethnicity in the loan case.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Lessons from Philosophy</title>
      <p>The lesson from these examples is that we
need a better form of explanation. This is where
some recent work in philosophy can help. The
three main insights are that good explanations
often have a contrastive structure, that we care
about what we can intervene upon, and that
robust/stable dependence relations make for better
explanations. I will unpack each briefly.</p>
      <p>
        Contrastive explanations do not seek to
explain just the outcome in isolation (e.g. the
broken window) but to explain the difference
between two states: the window breaking at that
moment rather than slightly later [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Suzy
made that difference but did not make the
difference between it breaking and not breaking at
all. In our other example, ‘placing a scarlet patch
rather than no patch at all’ does explain Sophie’s
peck, but ‘placing a scarlet patch rather than a
crimson one’ does not. Thus, making our
explanation query contrastive in the form “Why X
rather than Y?” is likely to yield a better
explanation. Applied to the loan case, if we ask
“Why was P rejected rather than accepted for the
loan?” the answer cannot just be that the contract
was too short, since a longer contract would not
have brought about the second contrast
(acceptance). This may then flag up the
problematic ethnic profiling that was previously
disguised. It also helps avoid the ambiguity about
what relevant alternatives are viable in the context
(blue?, transparent?) since the alternative is made
explicit in the contrastive target.
      </p>
      <p>
        A related view of explanation from Woodward
[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] holds that what we care about is what we need
to intervene upon to get the outcome that we
want. To stop the window breaking we need to
intervene on both Suzy and Billy. To get Sophie
to peck we need to ensure some kind of red patch
is placed, but we need not intervene to make it
some specific shade. And in the case of P’s loan
they need to change both their contract and
(absurdly) their ethnicity to get the loan. Thus
making explicit the interventions required to
change the outcome from one outcome to an
explicitly stated contrasting outcome gives a
richer explanation.
      </p>
      <p>
        Finally, it is worth noticing that both the output
of our target process can be graded, and so can the
inputs that yield that output. For example, it might
be the case that an applicant could be offered a
larger or smaller loan, based on better or worse
rates, depending on how risky a prospect the
system takes them to be. Suppose Q applies for a
loan and is accepted at a lower amount and poorer
rates than hoped. A good explanation of this
outcome – why the loan offered was low and
expensive rather than higher and cheaper – will
show Q what variables to intervene on for a better
outcome (reduce outgoings, clear existing debt,
extend contract etc.). It makes a difference
whether Q needs to remove just one of these
barriers, two of them, or all three before getting
the desired outcome and so an even better
explanation will additionally express how robust
the connection between these explainers and the
outcome is. Robustness, or sensitivity as it is
sometimes known [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], is a measure of the range of
counterfactual scenarios where the putative cause
is present and the effect still occurs. A small range
indicates that the relationship is sensitive, a
broader range indicates that it is robust. In general,
citing more robust causes provides a better
explanation as it extends into more scenarios.
      </p>
    </sec>
    <sec id="sec-5">
      <title>5. Lessons from Philosophy</title>
      <p>
        I have here briefly shown the benefits of
Contrastive [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], and Interventionist [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]
approaches to explanation, and introduced the
recent insights about causal robustness [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] from
the philosophical literature. The aim has been to
show the potential for philosophical reasoning
around causation and explanation to inform the
desiderata for what counts as explainable in XAI.
A highly influential figure in these recent debates,
both in philosophy and computer science, is Judea
Pearl [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], and it is to his formalism for
capturing the sorts of counterfactual dependence
relations at the heart of this discussion that I direct
interested practitioners.
      </p>
    </sec>
    <sec id="sec-6">
      <title>6. References</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>D.</given-names>
            <surname>Lewis</surname>
          </string-name>
          , Causation,
          <source>Journal of Philosophy</source>
          <volume>70</volume>
          (
          <issue>17</issue>
          ) (
          <year>1973</year>
          )
          <fpage>556</fpage>
          -
          <lpage>567</lpage>
          . doi:
          <volume>10</volume>
          .2307/2025310
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>S.</given-names>
            <surname>Yablo</surname>
          </string-name>
          , Mental Causation,
          <source>Philosophical Review</source>
          <volume>101</volume>
          (
          <issue>2</issue>
          ), (
          <year>1992</year>
          )
          <fpage>245</fpage>
          -
          <lpage>280</lpage>
          . doi:
          <volume>10</volume>
          .2307/2185535
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>N.</given-names>
            <surname>Savage</surname>
          </string-name>
          ,
          <source>Breaking into the black box of artificial intelligence</source>
          ,
          <year>2022</year>
          . URL: https://www.nature.com/articles/d41586- 022-00858-1
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>B. van Frassen</surname>
          </string-name>
          ,
          <source>The Scientific Image</source>
          , Oxford University Press, Oxford,
          <year>1980</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Schaffer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Contrastive</given-names>
            <surname>Causation</surname>
          </string-name>
          .
          <source>Philosophical Review</source>
          <volume>114</volume>
          (
          <issue>3</issue>
          ) (
          <year>2005</year>
          )
          <fpage>327</fpage>
          -
          <lpage>358</lpage>
          . doi:
          <volume>10</volume>
          .1215/
          <fpage>00318108</fpage>
          -114-3-327
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Woodward</surname>
          </string-name>
          ,
          <source>Making Things Happen: A Theory of Causal Explanation</source>
          , Oxford University Press, Oxford,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>L.</given-names>
            <surname>Fearnley</surname>
          </string-name>
          ,
          <article-title>Moral worth, right reasons and counterfactual motives</article-title>
          ,
          <source>Philosophical Studies</source>
          <volume>179</volume>
          (
          <issue>9</issue>
          ), (
          <year>2022</year>
          )
          <fpage>2869</fpage>
          -
          <lpage>2890</lpage>
          . doi:
          <volume>10</volume>
          .1007/s11098-022-01805-6
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J.</given-names>
            <surname>Pearl</surname>
          </string-name>
          , Causality, Cambridge University Press,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>J.</given-names>
            <surname>Pearl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Mackenzie</surname>
          </string-name>
          ,
          <source>The Book of Why: The New Science of Cause and Effect</source>
          , Allen Lane,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>