-

Using Ontologies to Enhance Human Understandability of Global Post-hoc Explanations of Black-box Models (Extended Abstract)

Roberto Confalonieri

Tillman Weyde

Tarek R.Besold

Fermín Moscoso del Prado Martín

2 0 Dept. of Computer Science , City , University of London , GB-EC1V 0HB London 1 Faculty of Computer Science, Free University of Bozen-Bolzano , Domenikanerplatz 3, Bolzano-Bozen , Italy 2 Lingvist Technologies OÜ , Tallinn , Estonia 3 Philosophy & Ethics, Faculty of IE/IS, Eindhoven University of Technology , 5600 MB Eindhoven

This extended abstract overviews the work presented in [1] where an extension of Trepan is proposed. Trepan is a seminal global explanation approach that extracts surrogate decision trees from black-box models. Trepan was extended to take into account explicit knowledge, modeled by means of ontologies, to extract human-understadable explanations. Trepan is a tree induction algorithm that recursively extracts decision trees from black-box classifiers [ 2]. The algorithm is model-agnostic, and it can be applied to explain any black-box classifier (e.g., Multi-Layer Perceptron, Random Forest). Trepan combines the learning of the decision tree with a trained machine learning classifier (the oracle). The proposed extension of the Trepan algorithm, called Trepan Reloaded, uses a modified information gain that, in the creation of split nodes, gives priority to features associated with more general concepts defined in a domain ontology. This was achieved by means of an information content measure defined using the refinement operators [3]. The perceived understandability of the extracted explanations by humans was tested by means of a user study with four diferent tasks. Results were evaluated in terms of response times and correctness, subjective ease of understanding and confidence, and similarity of free text responses. The results showed that decision trees generated with Trepan Reloaded, taking into account domain knowledge, were significantly more understandable than those generated by standard Trepan. The enhanced understandability of post-hoc explanations was achieved with little compromise on the accuracy with which the surrogate decision trees replicate the behaviour of the original neural network models.

1. Extended Abstract

[1]

Confalonieri ,

Weyde ,

T. R.

Besold , F. Moscoso del Prado Martín, Using ontologies to enhance human understandability of global post-hoc explanations of black-box models , Artificial Intelligence 296 ( 2021 ).

[2]

M. W.

Craven ,

J. W.

Shavlik , Extracting tree-structured representations of trained networks , in: NIPS 1995 , MIT Press, 1995 , pp. 24 - 30 .

[3]

Troquard ,

Confalonieri ,

Galliani ,

Peñaloza ,

Porello ,

Kutz , Repairing Ontologies via Axiom Weakening, in: S. A. McIlraith , K. Q. Weinberger (Eds.), Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence , (AAAI-18) , New Orleans, Louisiana, USA, February 2- 7 , 2018 , AAAI Press, 2018 , pp. 1981 - 1988 .