<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Using Ontologies to Enhance Human Understandability of Global Post-hoc Explanations of Black-box Models (Extended Abstract)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Roberto Confalonieri</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tillman Weyde</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tarek R.Besold</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fermín Moscoso del Prado Martín</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dept. of Computer Science</institution>
          ,
          <addr-line>City</addr-line>
          ,
          <institution>University of London</institution>
          ,
          <addr-line>GB-EC1V 0HB London</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Faculty of Computer Science, Free University of Bozen-Bolzano</institution>
          ,
          <addr-line>Domenikanerplatz 3, Bolzano-Bozen</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Lingvist Technologies OÜ</institution>
          ,
          <addr-line>Tallinn</addr-line>
          ,
          <country country="EE">Estonia</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Philosophy &amp; Ethics, Faculty of IE/IS, Eindhoven University of Technology</institution>
          ,
          <addr-line>5600 MB Eindhoven</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>This extended abstract overviews the work presented in [1] where an extension of Trepan is proposed. Trepan is a seminal global explanation approach that extracts surrogate decision trees from black-box models. Trepan was extended to take into account explicit knowledge, modeled by means of ontologies, to extract human-understadable explanations. Trepan is a tree induction algorithm that recursively extracts decision trees from black-box classifiers [ 2]. The algorithm is model-agnostic, and it can be applied to explain any black-box classifier (e.g., Multi-Layer Perceptron, Random Forest). Trepan combines the learning of the decision tree with a trained machine learning classifier (the oracle). The proposed extension of the Trepan algorithm, called Trepan Reloaded, uses a modified information gain that, in the creation of split nodes, gives priority to features associated with more general concepts defined in a domain ontology. This was achieved by means of an information content measure defined using the refinement operators [3]. The perceived understandability of the extracted explanations by humans was tested by means of a user study with four diferent tasks. Results were evaluated in terms of response times and correctness, subjective ease of understanding and confidence, and similarity of free text responses. The results showed that decision trees generated with Trepan Reloaded, taking into account domain knowledge, were significantly more understandable than those generated by standard Trepan. The enhanced understandability of post-hoc explanations was achieved with little compromise on the accuracy with which the surrogate decision trees replicate the behaviour of the original neural network models.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>1. Extended Abstract</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>R.</given-names>
            <surname>Confalonieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Weyde</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. R.</given-names>
            <surname>Besold</surname>
          </string-name>
          ,
          <string-name>
            <surname>F.</surname>
          </string-name>
          <article-title>Moscoso del Prado Martín, Using ontologies to enhance human understandability of global post-hoc explanations of black-box models</article-title>
          ,
          <source>Artificial Intelligence</source>
          <volume>296</volume>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M. W.</given-names>
            <surname>Craven</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. W.</given-names>
            <surname>Shavlik</surname>
          </string-name>
          ,
          <article-title>Extracting tree-structured representations of trained networks</article-title>
          ,
          <source>in: NIPS</source>
          <year>1995</year>
          , MIT Press,
          <year>1995</year>
          , pp.
          <fpage>24</fpage>
          -
          <lpage>30</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>N.</given-names>
            <surname>Troquard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Confalonieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Galliani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Peñaloza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Porello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Kutz</surname>
          </string-name>
          , Repairing Ontologies via Axiom Weakening, in: S. A.
          <string-name>
            <surname>McIlraith</surname>
            ,
            <given-names>K. Q.</given-names>
          </string-name>
          <string-name>
            <surname>Weinberger</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence</source>
          ,
          <source>(AAAI-18)</source>
          , New Orleans, Louisiana, USA, February 2-
          <issue>7</issue>
          ,
          <year>2018</year>
          , AAAI Press,
          <year>2018</year>
          , pp.
          <fpage>1981</fpage>
          -
          <lpage>1988</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>