<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>From Common Sense Reasoning to Neural Network Models: a Conditional and Multi-preferential Approach for Explainability and Neuro-Symbolic Integration (an Overview)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Francesco Bartoli</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco Botta</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Roberto Esposito</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Laura Giordano</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Valentina Gliozzi</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Daniele Theseider Dupré</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>DISIT - Università del Piemonte Orientale</institution>
          ,
          <addr-line>Alessandria</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Dipartimento di Informatica, Università di Torino</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <fpage>66</fpage>
      <lpage>78</lpage>
      <abstract>
        <p>This short paper reports about a line of research exploiting a conditional logic of commonsense reasoning to provide a semantic interpretation to neural network models. A “concept-wise" multi-preferential semantics for conditionals is exploited to build a preferential interpretation of a trained neural network starting from its input-output behavior. The approach is general (model agnostic): it is based on a notion of metric distance to de ne preferences and has been rst proposed for Self-Organising Maps (SOMs). For MultiLayer Perceptrons (MLPs), a deep network can as well be regarded as a (fuzzy) conditional knowledge base (KB), in which the synaptic connections correspond to weighted conditionals. This opens to the possibility of adopting conditional description logics as a basis for neuro-symbolic integration. Proof methods for many-valued weighted conditional KBs have been developed, based on Answer Set Programming and Datalog encodings to deal with the entailment and model-checking problems.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Preferential Description Logics</kwd>
        <kwd>Typicality</kwd>
        <kwd>Neural Networks</kwd>
        <kwd>Explainability</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Preferential approaches to common sense reasoning (e.g., [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]) have their roots in conditional
logics [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ], and have been recently extended to Description Logics (DLs), to deal with
inheritance with exceptions in ontologies, by allowing non-strict form of inclusions, called defeasible
or typicality inclusions.
      </p>
      <p>
        Di erent preferential semantics [
        <xref ref-type="bibr" rid="ref4 ref5 ref6 ref7">4, 5, 6, 7</xref>
        ] and closure constructions (e.g., [
        <xref ref-type="bibr" rid="ref10 ref8 ref9">8, 9, 10</xref>
        ]) have
been proposed for such defeasible DLs. In this paper, we report about a concept-wise
multipreferential semantics [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], rst introduced as a semantics of ranked knowledge bases in a
description logic (DL) to account for preferences with respect to di erent concepts, and later
extended to weighted conditional knowledge bases and proposed as a semantics for some neural
network models [
        <xref ref-type="bibr" rid="ref12 ref13 ref14">12, 13, 14</xref>
        ].
      </p>
      <p>
        We deal with both an unsupervised model, Self-organising maps (SOMs) [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], which is
considered as a psychologically and biologically plausible neural network model, and a supervised one,
MultiLayer Perceptrons (MLPs) [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. Learning algorithms in the two cases are quite di erent
but our project is to capture in a semantic interpretation the behavior of the network after
training and not to provide a logical characterization of the learning process.
      </p>
      <p>
        In both cases, considering a domain of input stimuli presented to the network e.g., during
training or generalization), a semantic interpretation describing the input-output behavior of the
network can be provided as a multi-preferential interpretation, where preferences are associated
to concepts. For SOMs, the learned categories C1, . . . , Cn are regarded as concepts so that a
preference relation over the domain of input stimuli is associated with each category [
        <xref ref-type="bibr" rid="ref12 ref14">12, 14</xref>
        ].
For MLPs, each unit of interest in the deep network (including hidden units) can be associated
with a concept and with a preference relation on the domain [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
      <p>The idea is that, for two input stimuli x and y and two categories/concepts, e.g., Horse and
Zebra, the neural model can, for example, assign to x a degree of membership in Horse higher
than the degree of membership of y, so that x can be regarded as more typical than y as a
horse (x &lt;Horse y ), while x could be less typical than y as a zebra (y &lt;Zebra x ). A preferential
interpretation can be built over the domain of input stimuli, and used for checking properties
such as: are the instances of category C1 also instances of C2? Are typical instances of C1
also instances of C2? This veri cation can be done by model-checking on the preferential
interpretation.</p>
      <p>
        For MLPs, the relationship between the logic of commonsense reasoning and deep neural
networks is even stronger, as a deep neural network can itself be regarded as a conditional
knowledge base, i.e., as a set weighted conditionals. This has been achieved by developing a
concept-wise fuzzy multi-preferential semantics for DLs with weighted defeasible inclusions.
Some di erent preferential closure constructions have been considered for weighted knowledge
bases (the coherent [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], faithful [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] and ϕ-coherent [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] multi-preferential semantics), and
their relationships with MLPs have been investigated (see [
        <xref ref-type="bibr" rid="ref13 ref18">13, 18</xref>
        ]).
      </p>
      <p>
        Undecidability results for fuzzy DLs with general inclusion axioms [
        <xref ref-type="bibr" rid="ref19 ref20">19, 20</xref>
        ] have motivated
the investigation of many-valued approximations of fuzzy multi-preferential entailment. The
semantics above have been reconsidered in the nitely many-valued case. In [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] an ASP-based
approach has been exploited for reasoning with weighted conditional KBs under ϕ-coherent
entailment. Datalog with weakly strati ed negation has been used for developing a
modelchecking approach for MLPs, still in the many-valued case [
        <xref ref-type="bibr" rid="ref22 ref23">22, 23</xref>
        ]. Both the entailment and the
model-checking approaches have been experimented in the veri cation of properties of some
trained multilayer feedforward networks and, speci cally, in the veri cation of properties of
neural networks for the recognition of basic emotions.
      </p>
      <p>
        The strong relationships between neural networks and conditional logics of commonsense
reasoning suggest that conditional logics can be used for the veri cation of properties of neural
networks to explain their behavior. The possibility of combining symbolic knowledge with
elicited knowledge in the same formalism is a step towards neuro-symbolic integration, in the
direction of a trustworthy and explainable AI [
        <xref ref-type="bibr" rid="ref24 ref25 ref26">24, 25, 26</xref>
        ].
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. The concept-wise multi-preferential semantics</title>
      <p>
        The concept-wise multi-preferential semantics (cwm-semantics) has been introduced as a
semantics for ranked E L knowledge bases [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], and later extended to weighted knowledge bases
[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. In both cases the knowledge base contains strict (i.e., standard) inclusions and defeasible
or typicality inclusions T(C) ³ D (meaning “the typical Cs are Ds" or “normally Cs are Ds")
with a rank (resp. a weight). They correspond to KLM conditionals C |∼ D [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Ranks (weights)
of defeasible inclusions represent their strength (plausibility/implausibility). The
preferential semantics of ranked and weighted knowledge bases are de ned in terms of concept-wise
multi-preferential interpretations, based on di erent constructions.
      </p>
      <p>
        Concept-wise multi-preferential interpretations (cwm-interpretations) are de ned by adding
to standard DL interpretations (which are pairs I = ïΔ, ·I ð, where Δ is a domain, and ·I
an interpretation function) the preference relations &lt;C1 , . . . , &lt;Cn associated with a set of
distinguished concepts C1, . . . , Cn, representing the relative typicality of domain individuals
with respect to these concepts. Each preference relation &lt;Ci is a modular and well-founded
strict partial order on Δ. Preferences with respect to di erent concepts do not need to agree,
as we have seen. In the two-valued case, a global preference relation &lt; can be de ned from
the &lt;Ci ’s, and concept T(C) is interpreted as the set of all &lt;-minimal C elements. A simple
notion of global preference &lt; exploits Pareto combination of the preference relations &lt;Ci , but
a more sophisticated global preference, taking into account speci city, has also been considered
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. It has been proven therein that global preference in a cwm-interpretation determines a
KLM-style preferential interpretation, and cwm-entailment satis es the KLM postulates of a
preferential consequence relation [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>In the Sections 3 and 4 we will see that, both for SOMs and for MLPs, a multi-preferential
interpretation can be constructed from the input-output behavior of the network over a set of
input stimuli, and can be used for model checking.</p>
    </sec>
    <sec id="sec-3">
      <title>3. A preferential interpretation of Self-Organising Maps</title>
      <p>
        Once a SOM has learned to categorize, the result of the categorization can be seen as a
conceptwise multi-preferential interpretation over a domain of input stimuli, in which a preference
relation is associated with each concept (learned category). The combination of preferences
into a global one (following the approach described above) de nes a KLM-style preferential
model of the SOM. More precisely, once the SOM has learned to categorize, to assess category
generalization, Gliozzi and Plunkett [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] de ne the map’s disposition to consider a new stimulus
y as a member of a known category C as a function of the distance of y from the map’s
representation of C. The distance d(x, Ci) of a stimulus x from a category Ci can be used to
build a binary preference relation &lt;Ci among the stimuli in Δ with respect to category Ci
[
        <xref ref-type="bibr" rid="ref12 ref14">14, 12</xref>
        ], by letting x &lt;Ci y if and only if d(x, Ci) &gt; d(y, Ci) (x is more typical than y with
respect to category Ci if its distance from category Ci is lower than the relative distance of
y). Based on the assumption that the abstraction process in the SOM is able to identify the
most typical exemplars for a given category, in the semantic representation of a category, some
speci c stimuli (corresponding to the best matching units) are identi ed as the typical exemplars
of the category.
      </p>
      <p>
        A notion of relative distance, introduced by Gliozzi and Plunkett in their similarity-based
account of category generalization based on self-organising maps [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ], is also used for developing
another semantic interpretation of SOMs based on fuzzy DL interpretations. This is done by
interpreting each category (concept) as a function mapping each input stimulus to a value in
[
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ], based on the map’s generalization degree of category membership to the stimulus [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ].
      </p>
      <p>
        In both the two-valued and fuzzy case, the preferential model can be exploited to learn or
validate conditional knowledge from empirical data, by verifying conditional formulas over the
preferential interpretation constructed from the SOM. In both cases, model checking can be used
for the veri cation of inclusions (either defeasible inclusions or fuzzy inclusion axioms) over the
respective models of the SOM (for instance, do the most typical penguins belong to the category
Bird with at least a degree of membership 0.8?). Starting from the fuzzy interpretation of the
SOM, a probabilistic interpretation of this neural network model is also provided [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], based on
Zadeh’s probability of fuzzy events [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ], and on Montes et al. [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ] recent characterization of
the continuous t-norms compatible with Zadeh’s probability of fuzzy events.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. A preferential interpretation of MultiLayer Perceptrons</title>
      <p>
        The input-output behaviour of MLPs can be captured in a similar way as for SOMs by
constructing a preferential interpretation over a domain Δ of input stimuli, e.g., those stimuli considered
during training or generalization [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Each neuron k of interest for property veri cation can
be associated to a distinguished concept Ck. For each concept Ck, a preference relation &lt;Ck
is de ned over the domain Δ based on the activity values, yk(v), of neuron k for each input
v ∈ Δ. In a similar way, a fuzzy interpretation of the network can be constructed over the
domain Δ, as well as a fuzzy multi-preferential semantics.
      </p>
      <p>
        All the three semantics allow the input-output behavior of the network to be captured by
interpretations built over a set of input stimuli through simple constructions, which exploit the
activity level of neurons for the stimuli. In the fuzzy semantics, the interpretation of a concept
Ck is a mapping CkI : Δ → [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ], associating to each x ∈ Δ the degree of membership of x
in Ck. The activation value yk(x) of neuron k for a stimulus x in the network (assumed to be
in the interval [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ]) is taken to be the degree of membership of x in concept Ch. The fuzzy
interpretation also induces a preference &lt;Ch on Δ.
      </p>
      <p>
        The interpretation of boolean concepts is de ned by fuzzy combination functions, as usual in
fuzzy DLs [
        <xref ref-type="bibr" rid="ref30 ref31">30, 31</xref>
        ]. This also allows a preference relation &lt;C to be associated to any concept
C, and the typical C-elements to be identi ed, provided the interpretation is well-founded (an
assumption which clearly holds when the domain Δ is nite, as in this case). Let us call MfN,Δ
the fuzzy multi-preferential interpretation built from network N over a domain Δ.
      </p>
      <p>As for SOMs, logical properties of the network (including fuzzy typicality inclusions) can
then be veri ed by model checking over such an interpretation. Evaluating properties involving
hidden units might be as well of interest. We refer to the typicality properties considered in the
veri cation examples in Sections 6.1 and 6.2.</p>
      <p>The fuzzy multi-preferential interpretation MfN,Δ described above can be proven to be a
model of the neural network N in a logical sense, by mapping the multilayer network into a
weighted conditional knowledge base. Let us introduce a notion of weighted conditional KB.
5. Weighted conditional knowledge bases and MultiLayer Perceptrons
We introduce the de nition of weighted conditional knowledge bases through an example, and
give some hints about the two-valued and fuzzy multi-preferential semantics.</p>
      <p>
        A weighted ALC knowledge base contains, besides standard inclusion axioms (Tbox T ) and
assertions (Abox A), a set C = {C1, . . . , Ck} of distinguished ALC concepts and, for each Ci,
a set of weighted typicality inclusions of the form T(Ci) ³ Dj , with a positive or negative
weight wi,j (a real number). In the fuzzy case, T and A contain fuzzy axioms [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ].
      </p>
      <p>As an example, a knowledge base with T containing the inclusion Black ⊓ Grey ³ § may
also include the following weighted defeasible inclusions:
(d1) T(Bird ) ³ Fly , +20 (d2) T(Bird ) ³ ∃has_Wings.¦, +50
(d3) T(Bird ) ³ ∃has_Feathers.¦, +50 (d4) T(Penguin) ³ Fly , - 70
(d5) T(Penguin) ³ Black , +50 (d6) T(Penguin) ³ Grey , +10
The meaning is that a bird normally has wings, has feathers and ies, but having wings and
feathers is more plausible than ying, although ying is regarded as being plausible. For a
penguin, ying is not plausible (d4 has a negative weight), and being black is more plausible
than being grey.</p>
      <p>
        In the two-valued case, a semantics for weighted ALC knowledge bases can be de ned with
a semantic closure construction in the spirit of Lehmann’s lexicographic closure [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ], but more
similar to Kern-Isberner’s semantics of c-representations [
        <xref ref-type="bibr" rid="ref33 ref34">33, 34</xref>
        ], in which the world ranks
are generated as a sum of impacts of falsi ed conditionals. Here, the (positive or negative)
weights of the satis ed defaults are summed, but in a concept-wise manner, so to determine
the plausibility of a domain elements with respect to certain concepts. For a domain element x
in Δ, and a distinguished concept Ci, the weight Wi(x) of x wrt Ci is de ned as the sum of
the weights whi of the typicality inclusions T(Ci) ³ Di,h veri ed by x (and is −∞ when x is
not an instance of Ci). From the weights Wi(x) the preference relation f Ci can be de ned by
letting x f Ci y i Wi(x) g Wi(y). The higher the weight of x wrt Ci the higher its typicality
relative to Ci. This closure construction de nes preferences &lt;Ci (strict modular partial orders)
and allows for the de nition of concept-wise multi-preferential interpretations as in Section 2.
      </p>
      <p>In the fuzzy case, the fuzzy logic combination functions are used for complex concepts to
compute the Wi(x)’s and to determine the associated preference relations. Speci cally, let
TCi = {(dih, whi)} be the set of all weighted typicality inclusions dih = T(Ci) ³ Di,h for the
distinguished concept Ci, for each domain element x ∈ Δ, the weight Wi(x) of x wrt Ci in a
fuzzy interpretation I = ïΔ, ·I ð is the sum: Wi(x) = Ph whi DiI,h(x).</p>
      <p>
        To guarantee that the preferences determined from the knowledge base are coherent with
the fuzzy interpretation of concepts, some di erent semantic constructions have been
considered, namely the notions of coherent [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], faithful [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] and ϕ-coherent [
        <xref ref-type="bibr" rid="ref18 ref35">18, 35</xref>
        ] (fuzzy)
multipreferential semantics. Speci cally, for coherent interpretations we require that:
CiI (x) &lt; CiI (y) ⇐⇒ Wi(x) &lt; Wi(y)
      </p>
      <p>
        The notion of ϕ-coherence of a fuzzy interpretation I wrt a KB exploits a function ϕ from R
to the interval [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ], i.e., ϕ : R → [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ]. By slightly generalizing the fuzzy multi-preferential
semantics introduced as a gradual argumentation semantics in [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], we assume that di erent
functions ϕi are associated to the distinguished concepts Ci.
      </p>
      <p>An interpretation I = ïΔ, ·I ð is ϕ-coherent if, for all concepts Ci ∈ C and x ∈ Δ,
CiI (x) = ϕi(X whi DiI,h(x))
h
(1)
where TCi = {(T(Ci) ³ Di,h, whi)} is the set of weighted conditionals for Ci. A ϕ-coherent
model of knowledge base K, is de ned as a fuzzy interpretation I satisfying TBox T , ABox A
and the ϕ-coherence condition (1).</p>
      <p>
        As usual in preferential semantics, we restrict to canonical models, which are large enough
to contain a domain element for any possible valuation of concepts which is present in some
ϕcoherent model of K. Based on a notion of ϕ-coherent canonical model of a weighted knowledge
base [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], a notion of ϕ-coherent entailment can be de ned as expected.
      </p>
      <p>
        A mapping of a neural network N to a conditional KB KN can be de ned in a simple way
[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], associating a concept name Ci with each unit i in the network and by introducing, for each
synaptic connection from neuron h to neuron i with weight wih, a conditional T(Ci) ³ Ch
with weight whi = wih in KN . If we assume that the activation functions ϕi of the units in the
network N return values in the interval [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ], then the solutions of equations (1) characterize the
stationary states of MLPs, where CiI (x) corresponds to the activation of neuron i for some input
stimulus x, each DiI,h(x) corresponds to an input signal xh to neuron i, and Ph whi DiI,h(x)
corresponds to the induced local eld of neuron i [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ].
      </p>
      <p>
        Let us consider the fuzzy multi-preferential interpretation MfN,Δ built from N over a domain
Δ of input stimuli, as described in Section 4, and assume that a concept Ck is introduced
in the language for each unit k. It has been proven [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] that the interpretation MfN,Δ is a
coherent fuzzy multi-preferential model of the knowledge base KN , under some condition on
the activation functions in N . The properties that are entailed from KN are then satis ed by
MfN,Δ, for any choice of the domain Δ.
6. ASP and Datalog for reasoning about neural networks in the
many-valued case: from entailment to model-checking
While a neural network, once trained, is able and fast in classifying the new stimuli (that is,
it is able to do instance checking), other reasoning services such as satis ability, entailment
and model-checking are missing. Such reasoning tasks are useful for validating knowledge that
has been learned, including proving whether the network satis es some (strict or conditional)
properties.
      </p>
      <p>
        Undecidability results for fuzzy DLs with general inclusion axioms [
        <xref ref-type="bibr" rid="ref19 ref20">19, 20</xref>
        ] have motivated
the investigation of many-valued approximations of fuzzy multi-preferential entailment. The
semantics above have been reconsidered in the nitely many-valued case. In [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] an ASP-based
approach has been exploited for reasoning with weighted conditional KBs under ϕ-coherent
entailment. Datalog with weakly strati ed negation has been used for developing a
modelchecking approach for MLPs, still in the many-valued case [
        <xref ref-type="bibr" rid="ref22 ref23">22, 23</xref>
        ]. Both the entailment and the
model-checking approaches have been experimented in the veri cation of properties of some
trained multilayer feedforward networks.
      </p>
      <sec id="sec-4-1">
        <title>6.1. The entailment approach</title>
        <p>Reasoning on weighted KBs associated to neural networks, based on a multi-valued truth
space Cn = {0, n1 , . . . , n−n1 , nn }, for an integer n g 1, requires introducing, for each activation
function ϕ, a function ϕn which approximates ϕ(x) to the nearest value in Cn. A notion of
ϕn-coherence is de ned (the analog of ϕ-coherence in Sec. 5), and the corresponding ϕn-coherent
entailment, i.e., satisfaction in all ϕn-coherent models.</p>
        <p>
          In particular, we consider the entailment of a typicality inclusion such as T(C) ³ D g α from
a weighted knowledge base K in the nitely many-valued Gödel description logic with typicality
GnLCT, introduced in [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ] for the boolean fragment LC of ALC. Such a veri cation can be
formulated as a problem of computing preferred answer sets of an ASP program, considering
a single distinguished domain element auxC , intended to represent a typical C-element, and
selecting, as preferred answer sets, the ones maximizing the membership of auxC in concept C.
Answer sets maximizing the membership of auxC in concept C can be selected with an asprin
preference program, and represent those inputs stimuli associated with typical C-elements. For
all typical C-elements it is veri ed that membership in concept D is greater than α.
        </p>
        <p>
          As a proof of concept, in [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ] the entailment approach has been experimented for the weighted
GnLCT KBs corresponding to two of the trained multilayer feedforward network for the
MONK’s problems ([
          <xref ref-type="bibr" rid="ref36">36</xref>
          ]). The networks have 17 non-independent binary inputs, corresponding
to values of 6 inputs having 2 to 4 possible values; such inputs are features of a robot, e.g., head
shape and body shape being round, square or octagon, and jacket color being red, yellow, green
or blue. The network for problem 1 has 3 hidden units (h1, h2, h3) and an output unit (o); the
one for problem 3 has 2 hidden units.
        </p>
        <p>For example, in the rst problem, the trained network learned to classify inputs
satisfying a formula F 1 ≡ jacket _color _red or head _shape = body _shape which, in terms
of the classes i1 , . . . , i17 corresponding to the binary inputs, is: F1 ≡ i12 ⊔ (i1 ⊓ i4 )
⊔(i2 ⊓ i5 ) ⊔ (i3 ⊓ i6 ).</p>
        <p>For instance, the formula T(o) ³ F 1 g 1 can be veri ed for e.g. n = 5, where o is the
concept name associated with the output unit. That is, the G5LCT knowledge base entails that
the typical o-elements satisfy F 1. Stronger variants of F 1 have also been considered, to check
that the network learned F 1 but not such variants. The following formulae have been veri ed
for hidden nodes h1 , h2 , h3 : T(h1) ³ i12 ⊔ (¬i1 ⊓ ¬i4) g 1, T(h2) ³ i12 ⊔ (¬i3 ⊓ ¬i6) g 1,
T(h3) ³ ¬i12 ⊔ (i2 ⊔ i5) g 1.</p>
      </sec>
      <sec id="sec-4-2">
        <title>6.2. The model-checking approach</title>
        <p>
          Based on the general idea of using model-checking for verifying the properties of a neural
network, as described in Section 4 for MLPs, in [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ] we have developed a Datalog-based approach
which builds a multi-valued preferential interpretation of a trained feedforward network N
and, then, veri es the properties of the network for post-hoc explanation.
        </p>
        <p>
          The Datalog encoding contains a component Π(N , Δ, n) which is intended to build a (single)
many-valued, preferential interpretation with truth degrees in Cn, and a component associated
to the formulae to be checked. We exploited Datalog with weakly strati ed negation. The model
checking approach has been experimented in the veri cation of properties of neural networks
for the recognition of basic emotions using the Facial Action Coding System (FACS) [
          <xref ref-type="bibr" rid="ref37">37</xref>
          ].
        </p>
        <p>
          The RAF-DB [
          <xref ref-type="bibr" rid="ref38">38</xref>
          ] data set contains almost 30000 images labeled with basic emotions or
combinations of two emotions. It was used as input to OpenFace 2.0 [
          <xref ref-type="bibr" rid="ref39">39</xref>
          ], which detects a subset
of the Action Units (AUs) in [
          <xref ref-type="bibr" rid="ref37">37</xref>
          ], i.e., facial muscle contractions. The relations between such
AUs and emotions, studied by psychologists [
          <xref ref-type="bibr" rid="ref40">40</xref>
          ], can be used as a reference for formulae to be
veri ed on neural networks trained to learn such relations.
        </p>
        <p>
          From the original dataset, we selected the subset of the images that were labelled using only
one emotion in the set {suprise, f ear, happiness, anger}. The dataset was highly unbalanced
and we preprocessed the data by subsampling the larger classes and augmenting the minority
ones using standard data-augmentation techniques. The processed dataset contains 5 975 images
(the number of images was 4 283 before augmentation). The images were input to OpenFace
2.0; the output intensities were rescaled in order to make their distribution conformant to the
expected one in case AUs were recognized by humans [
          <xref ref-type="bibr" rid="ref37">37</xref>
          ]. The resulting AUs were used as
input to a neural network trained to classify its input as an instance of the four emotions. The
neural network model we used is a fully connected feed forward neural network with three
hidden layers having 1 800, 1 200, and 600 nodes (all hidden layers use RELU activation functions,
while the softmax function is used in the output layer).
        </p>
        <p>
          The model checking approach was applied, using the Clingo ASP solver as Datalog engine,
taking as set of input stimuli Δ the test set, containing 1194 images, and n = 5 (given that
AU intensities, when assigned by humans, are on a scale of ve values). Table 1 reports some
results for the veri cation of typicality inclusions T(E) ³ F g k/n, with the number of
typical individuals for the emotion E, the number of counterexamples for di erent values of
k, as well as the value of the conditional probabilities p(F/T(E)) of concept F given concept
T(E), based on Zadeh’s probability of fuzzy events [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ]. The approach is the one adopted to
develop a probabilistic interpretation of SOMs after training, starting from a fuzzy interpretation
[
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. It exploits a recent characterization of the continuous t-norms compatible with Zadeh’s
probability of fuzzy events (PZ -compatible t-norms) by Montes et al. [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ]. To compute the
conditional probabilities, we have assumed a uniform probability distribution over Δ. Note that
also typicality concepts can occur in conditional probabilities.
        </p>
        <p>For example, T(happiness) ³ au12 g 3/5 (where au12 is the activation of the lip corner
puller muscle, that is, smiling) does not hold as it has 1 counterexample out of 255 instances
of T(happiness). The value of P (au12/T(happiness)) is larger than 4/5, even though there
are 35 counterexamples for T(happiness) ³ au12 g 4/5.</p>
      </sec>
      <sec id="sec-4-3">
        <title>6.3. Further considerations</title>
        <p>For the emotion recognition problem in Section 6.2, in the entailment approach the grounding
size of the ASP program only allowed to deal with a network using boolean inputs for the
17 AUs considered, a layer of 8 hidden units, and a single output for deciding membership to
a single emotion. For happiness, in particular, with n=9 (i.e. 10 discrete values) the formula
T (happiness) ³ au6 ⊔ au12 g 1 was found to have 4 counterexamples among the 217
combinations of boolean inputs, 1446 being instances of T (happiness). Interestingly enough,
such 4 combinations do not occur in the data set (indeed, only a small fraction of 217, i.e., 131072
combinations may occur in a few thousand images).</p>
        <p>
          As expected, the model-checking approach outperforms the entailment approach. In fact, the
model checking approach considers a subset of all the possible inputs to the network, and the
veri cation problem is polynomial in time in the size of the domain Δ and in the size of the
formula to be veri ed [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]. On the other hand, all the possible combinations of the values of all
units (including hidden ones) need to be considered in the entailment-based approach. This
was the reason for limiting the size of the network (and, speci cally, the number of units in
the hidden layers). Note, the entailment approach has been developed for general weighted
conditional knowledge bases, which are not required to be acyclic, while in the experimentation
we have considered feedforward networks.
        </p>
        <p>
          A multilayer network can be seen as a set of weighted defeasible inclusions in a simple
description logic (only including boolean concepts). However, a weighted conditional knowledge
base can be more general. It can be de ned for several DLs including roles (as it has been done,
for instance, for E L [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] and for ALC [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]), and it allows for general inclusions axioms and
assertions. The combination of defeasible inclusions with strict (or fuzzy) inclusions and
assertions in a weighted KB allows for the combination of the knowledge acquired from the
network and symbolic knowledge in the same formalism. In the entailment based approach this
has been exploited in several ways, by adding constraints on the possible inputs through ABox
and TBox axioms (e.g., to exclude combinations of input values).
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>7. Conclusions</title>
      <p>Conditional logics of commonsense reasoning can be used for interpreting and verifying the
knowledge learned by a neural network for post-hoc explanation and, for MLPs, a trained
network can itself be seen as a conditional knowledge base.</p>
      <p>
        Much work has been devoted to the combination of neural networks and symbolic reasoning
(e.g., the work by d’Avila Garcez et al. [
        <xref ref-type="bibr" rid="ref41 ref42 ref43">41, 42, 43</xref>
        ] and Setzu et al. [
        <xref ref-type="bibr" rid="ref44">44</xref>
        ]), as well as to the de nition
of new computational models [
        <xref ref-type="bibr" rid="ref45 ref46 ref47 ref48">45, 46, 47, 48</xref>
        ]. The work summarized in this paper opens to
the possibility of adopting conditional logics as a basis for neuro-symbolic integration, e.g.,
learning the weights of a conditional knowledge base from empirical data, and combining the
defeasible inclusions extracted from a neural network with other defeasible or strict inclusions
for inference.
      </p>
      <p>
        Using a multi-preferential logic for the veri cation of typicality properties of a neural network
by model-checking is a general (model agnostic) approach. It can be used for SOMs, as in [
        <xref ref-type="bibr" rid="ref12 ref14">12, 14</xref>
        ],
by exploiting a notion of distance of a stimulus from a category to de ne a preferential structure,
as well as for MLPs, by exploiting units activity to build a fuzzy preferential interpretation.
Given the simplicity of the approach, a similar construction can be adapted to other network
models and learning approaches, and used in applications combining di erent network models
(as in the mentioned experiment to the recognition of basic emotions).
      </p>
      <p>
        Both the model-checking approach and the entailment-based approach are global approaches
(see, e.g., [
        <xref ref-type="bibr" rid="ref44">44</xref>
        ] for the notions of local and global approaches), as they consider the the behavior
of the network over a set Δ of input stimuli. Indeed, the evaluation of typicality inclusions
considers all the individuals in the domain to establish preference relations among them, with
respect to di erent aspects. However, properties of single individuals can as well be veri ed (by
instance checking, in DL terminology).
      </p>
      <p>The model-checking approach does not require to consider the activity of all units, but only
of the units involved in the property to be veri ed. In the entailment-based approach, on the
other hand, all units are considered. This limits its range of applicability to simple networks.</p>
      <p>
        The entailment-based approach is based on the idea of regarding a multilayer network as
weighted conditional knowledge base, and is speci c for this network model. For MLPs, it has
been proven that, in the fuzzy case, the interpretation built for model-checking is indeed a
model of the weighted conditional KB corresponding to the network [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Whether it is possible
to extend the logical encoding of MLPs as weighted KBs to other neural network models is a
subject for future investigation. The development of a temporal extension of this formalism to
capture the transient behavior of MLPs is also an interesting direction to extend this work.
      </p>
      <p>Acknowledgement: This research is partially supported by Università del Piemonte
Orientale and by INDAM-GNCS Project 2022 “Logiche non-classiche per tool intelligenti ed
explainable".</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Kraus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Magidor</surname>
          </string-name>
          ,
          <article-title>Nonmonotonic reasoning, preferential models and cumulative logics</article-title>
          ,
          <source>Arti cial Intelligence</source>
          <volume>44</volume>
          (
          <year>1990</year>
          )
          <fpage>167</fpage>
          -
          <lpage>207</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>D.</given-names>
            <surname>Lewis</surname>
          </string-name>
          , Counterfactuals, Basil Blackwell Ltd,
          <year>1973</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D.</given-names>
            <surname>Nute</surname>
          </string-name>
          , Topics in conditional logic, Reidel, Dordrecht (
          <year>1980</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Gliozzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Olivetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. L.</given-names>
            <surname>Pozzato</surname>
          </string-name>
          , Preferential Description Logics,
          <source>in: LPAR</source>
          <year>2007</year>
          , volume
          <volume>4790</volume>
          <source>of LNAI</source>
          , Springer, Yerevan, Armenia,
          <year>2007</year>
          , pp.
          <fpage>257</fpage>
          -
          <lpage>272</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>K.</given-names>
            <surname>Britz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Heidema</surname>
          </string-name>
          , T. Meyer, Semantic preferential subsumption, in: G. Brewka, J. Lang (Eds.),
          <source>KR</source>
          <year>2008</year>
          , AAAI Press, Sidney, Australia,
          <year>2008</year>
          , pp.
          <fpage>476</fpage>
          -
          <lpage>484</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>G.</given-names>
            <surname>Casini</surname>
          </string-name>
          , T. A. Meyer, I. Varzinczak,
          <article-title>Contextual conditional reasoning</article-title>
          ,
          <source>in: AAAI-21</source>
          ,
          <string-name>
            <surname>Virtual</surname>
            <given-names>Event</given-names>
          </string-name>
          ,
          <source>February 2-9</source>
          ,
          <year>2021</year>
          , AAAI Press,
          <year>2021</year>
          , pp.
          <fpage>6254</fpage>
          -
          <lpage>6261</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Gliozzi</surname>
          </string-name>
          ,
          <article-title>A reconstruction of multipreference closure</article-title>
          ,
          <source>Artif. Intell</source>
          .
          <volume>290</volume>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>G.</given-names>
            <surname>Casini</surname>
          </string-name>
          , U. Straccia,
          <article-title>Rational Closure for Defeasible Description Logics</article-title>
          , in: T. Janhunen, I. Niemelä (Eds.),
          <source>JELIA</source>
          <year>2010</year>
          , volume
          <volume>6341</volume>
          <source>of LNCS</source>
          , Springer, Helsinki,
          <year>2010</year>
          , pp.
          <fpage>77</fpage>
          -
          <lpage>90</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>G.</given-names>
            <surname>Casini</surname>
          </string-name>
          , T. Meyer, K. Moodley,
          <string-name>
            <given-names>R.</given-names>
            <surname>Nortje</surname>
          </string-name>
          ,
          <article-title>Relevant closure: A new form of defeasible reasoning for description logics</article-title>
          ,
          <source>in: JELIA</source>
          <year>2014</year>
          , LNCS 8761, Springer,
          <year>2014</year>
          , pp.
          <fpage>92</fpage>
          -
          <lpage>106</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Gliozzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Olivetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. L.</given-names>
            <surname>Pozzato</surname>
          </string-name>
          ,
          <article-title>Semantic characterization of rational closure: From propositional logic to description logics</article-title>
          ,
          <source>Art. Int</source>
          .
          <volume>226</volume>
          (
          <year>2015</year>
          )
          <fpage>1</fpage>
          -
          <lpage>33</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Theseider</given-names>
            <surname>Dupré</surname>
          </string-name>
          ,
          <article-title>An ASP approach for reasoning in a concept-aware multipreferential lightweight DL</article-title>
          , TPLP
          <volume>10</volume>
          (
          <issue>5</issue>
          ) (
          <year>2020</year>
          )
          <fpage>751</fpage>
          -
          <lpage>766</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Gliozzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Theseider</given-names>
            <surname>Dupré</surname>
          </string-name>
          ,
          <article-title>On a plausible concept-wise multipreference semantics and its relations with self-organising maps</article-title>
          , in: F.
          <string-name>
            <surname>Calimeri</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Perri</surname>
          </string-name>
          , E. Zumpano (Eds.),
          <source>CILC</source>
          <year>2020</year>
          ,
          <article-title>Rende</article-title>
          , IT, Oct.
          <volume>13</volume>
          -
          <fpage>15</fpage>
          ,
          <year>2020</year>
          , volume
          <volume>2710</volume>
          <source>of CEUR</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>127</fpage>
          -
          <lpage>140</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Theseider</given-names>
            <surname>Dupré</surname>
          </string-name>
          ,
          <article-title>Weighted defeasible knowledge bases and a multipreference semantics for a deep neural network model</article-title>
          ,
          <source>in: Proc. JELIA</source>
          <year>2021</year>
          , May 17-20, volume
          <volume>12678</volume>
          <source>of LNCS</source>
          , Springer,
          <year>2021</year>
          , pp.
          <fpage>225</fpage>
          -
          <lpage>242</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Gliozzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. T.</given-names>
            <surname>Dupré</surname>
          </string-name>
          ,
          <article-title>A conditional, a fuzzy and a probabilistic interpretation of self-organizing maps</article-title>
          ,
          <source>J. Log. Comput</source>
          .
          <volume>32</volume>
          (
          <year>2022</year>
          )
          <fpage>178</fpage>
          -
          <lpage>205</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>T.</given-names>
            <surname>Kohonen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Schroeder</surname>
          </string-name>
          , T. Huang (Eds.),
          <string-name>
            <surname>Self-Organizing</surname>
            <given-names>Maps</given-names>
          </string-name>
          ,
          <source>Third Edition</source>
          , Springer Series in Information Sciences, Springer,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>S.</given-names>
            <surname>Haykin</surname>
          </string-name>
          ,
          <string-name>
            <surname>Neural Networks - A Comprehensive Foundation</surname>
          </string-name>
          , Pearson,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <article-title>On the KLM properties of a fuzzy DL with Typicality</article-title>
          ,
          <source>in: Proc. ECSQARU</source>
          <year>2021</year>
          , Prague, Sept.
          <fpage>21</fpage>
          -
          <lpage>24</lpage>
          ,
          <year>2021</year>
          , volume
          <volume>12897</volume>
          <source>of LNCS</source>
          , Springer,
          <year>2021</year>
          , pp.
          <fpage>557</fpage>
          -
          <lpage>571</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <article-title>From weighted conditionals of multilayer perceptrons to a gradual argumentation semantics</article-title>
          ,
          <source>in: 5th Workshop on Advances in Argumentation in Artif. Intell.</source>
          ,
          <year>2021</year>
          , Milan, Italy, Nov.
          <volume>29</volume>
          , volume
          <volume>3086</volume>
          <source>of CEUR Workshop Proc</source>
          .,
          <year>2021</year>
          . URL: http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3086</volume>
          /paper8.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>M.</given-names>
            <surname>Cerami</surname>
          </string-name>
          , U. Straccia,
          <article-title>On the undecidability of fuzzy description logics with gcis with lukasiewicz t-norm</article-title>
          ,
          <source>CoRR abs/1107</source>
          .4212 (
          <year>2011</year>
          ). URL: http://arxiv.org/abs/1107.4212.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>S.</given-names>
            <surname>Borgwardt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Peñaloza</surname>
          </string-name>
          ,
          <article-title>Undecidability of fuzzy description logics</article-title>
          , in: G. Brewka,
          <string-name>
            <given-names>T.</given-names>
            <surname>Eiter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. A</given-names>
            .
            <surname>McIlraith</surname>
          </string-name>
          (Eds.),
          <source>Proc. KR</source>
          <year>2012</year>
          , Rome, Italy, June 10-14,
          <year>2012</year>
          , AAAI Press,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Theseider</given-names>
            <surname>Dupré</surname>
          </string-name>
          ,
          <article-title>An ASP approach for reasoning on neural networks under a nitely many-valued semantics for weighted conditional knowledge bases (</article-title>
          <year>2022</year>
          ). To appear in TPLP, https://doi.org/10.1017/S1471068422000163.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>F.</given-names>
            <surname>Bartoli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Botta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Esposito</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Theseider</given-names>
            <surname>Dupré</surname>
          </string-name>
          ,
          <article-title>An asp approach for reasoning about the conditional properties of neural networks: an experiment in the recognition of basic emotions</article-title>
          ,
          <source>in: Datalog 2.0 2022: 4th International Workshop on the Resurgence of Datalog in Academia and Industry, September</source>
          <volume>5</volume>
          ,
          <year>2022</year>
          , Genova - Nervi, Italy, volume
          <volume>3203</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>54</fpage>
          -
          <lpage>67</lpage>
          . URL: http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3203</volume>
          /paper4.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>F.</given-names>
            <surname>Bartoli</surname>
          </string-name>
          ,
          <article-title>A Typicality-based Interpretation of Neural Networks: an Experiment on Facial Emotion Recognition</article-title>
          ,
          <source>Master Thesis in Stochastics and Data Science</source>
          , University of Torino,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>A.</given-names>
            <surname>Adadi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Berrada</surname>
          </string-name>
          ,
          <article-title>Peeking inside the black-box: A survey on explainable arti cial intelligence (XAI)</article-title>
          ,
          <source>IEEE Access 6</source>
          (
          <year>2018</year>
          )
          <fpage>52138</fpage>
          -
          <lpage>52160</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>R.</given-names>
            <surname>Guidotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Monreale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ruggieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Turini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Giannotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Pedreschi</surname>
          </string-name>
          ,
          <article-title>A survey of methods for explaining black box models</article-title>
          ,
          <source>ACM Comput. Surv</source>
          .
          <volume>51</volume>
          (
          <year>2019</year>
          )
          <volume>93</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>93</lpage>
          :
          <fpage>42</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <surname>A. B. Arrieta</surname>
            ,
            <given-names>N. D.</given-names>
          </string-name>
          <string-name>
            <surname>Rodríguez</surname>
            ,
            <given-names>J. D.</given-names>
          </string-name>
          <string-name>
            <surname>Ser</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Bennetot</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Tabik</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Barbado</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>García</surname>
            , S. GilLopez, D. Molina,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Benjamins</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Chatila</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Herrera</surname>
          </string-name>
          ,
          <article-title>Explainable arti cial intelligence (XAI): concepts, taxonomies, opportunities and challenges toward responsible AI, Inf</article-title>
          .
          <source>Fusion</source>
          <volume>58</volume>
          (
          <year>2020</year>
          )
          <fpage>82</fpage>
          -
          <lpage>115</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>V.</given-names>
            <surname>Gliozzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Plunkett</surname>
          </string-name>
          ,
          <article-title>Grounding bayesian accounts of numerosity and variability e ects in a similarity-based framework: the case of self-organising maps</article-title>
          ,
          <source>Journal of Cognitive Psychology</source>
          <volume>31</volume>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>L.</given-names>
            <surname>Zadeh</surname>
          </string-name>
          ,
          <article-title>Probability measures of fuzzy events</article-title>
          ,
          <source>J.Math.Anal.Appl</source>
          <volume>23</volume>
          (
          <year>1968</year>
          )
          <fpage>421</fpage>
          -
          <lpage>427</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>I.</given-names>
            <surname>Montes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hernández</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Martinetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Montes</surname>
          </string-name>
          ,
          <article-title>Characterization of continuous t-norms compatible with zadeh's probability of fuzzy events</article-title>
          ,
          <source>Fuzzy Sets Syst</source>
          .
          <volume>228</volume>
          (
          <year>2013</year>
          )
          <fpage>29</fpage>
          -
          <lpage>43</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>G.</given-names>
            <surname>Stoilos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. B.</given-names>
            <surname>Stamou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Tzouvaras</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Z.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. Horrocks</given-names>
            ,
            <surname>Fuzzy</surname>
          </string-name>
          <string-name>
            <surname>OWL</surname>
          </string-name>
          <article-title>: uncertainty and the semantic web</article-title>
          ,
          <source>in: OWLED*05 Workshop on OWL Galway, Ireland, Nov 11-12</source>
          ,
          <year>2005</year>
          , volume
          <volume>188</volume>
          <source>of CEUR Workshop Proc</source>
          .,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>T.</given-names>
            <surname>Lukasiewicz</surname>
          </string-name>
          , U. Straccia,
          <article-title>Managing uncertainty and vagueness in description logics for the semantic web</article-title>
          ,
          <source>J. Web Semant</source>
          .
          <volume>6</volume>
          (
          <year>2008</year>
          )
          <fpage>291</fpage>
          -
          <lpage>308</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>D. J.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          ,
          <article-title>Another perspective on default reasoning</article-title>
          , Ann. Math. Artif. Intell.
          <volume>15</volume>
          (
          <year>1995</year>
          )
          <fpage>61</fpage>
          -
          <lpage>82</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>G.</given-names>
            <surname>Kern-Isberner</surname>
          </string-name>
          ,
          <article-title>Conditionals in Nonmonotonic Reasoning and Belief Revision - Considering Conditionals as Agents</article-title>
          , volume
          <volume>2087</volume>
          <source>of LNCS</source>
          , Springer,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>G.</given-names>
            <surname>Kern-Isberner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Eichhorn</surname>
          </string-name>
          ,
          <article-title>Structural inference from conditional knowledge bases</article-title>
          ,
          <source>Stud Logica</source>
          <volume>102</volume>
          (
          <year>2014</year>
          )
          <fpage>751</fpage>
          -
          <lpage>769</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <article-title>From weighted conditionals with typicality to a gradual argumentation semantics and back</article-title>
          ,
          <source>in: Proc. 20th International Workshop on Non-Monotonic Reasoning, NMR 2022, Part of FLoC</source>
          <year>2022</year>
          , Haifa, Israel,
          <source>August 7-9</source>
          ,
          <year>2022</year>
          , volume
          <volume>3197</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>127</fpage>
          -
          <lpage>138</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <surname>Thrun</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          et al.,
          <article-title>A Performance Comparison of Di erent Learning Algorithms</article-title>
          ,
          <source>Technical Report CMU-CS-91-197</source>
          , Carnegie Mellon University,
          <year>1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>P.</given-names>
            <surname>Ekman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Friesen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hager</surname>
          </string-name>
          , Facial Action Coding System, Research Nexus,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [38]
          <string-name>
            <given-names>S.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Deng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Du</surname>
          </string-name>
          ,
          <article-title>Reliable crowdsourcing and deep locality-preserving learning for expression recognition in the wild</article-title>
          ,
          <source>in: 2017 IEEE Conference on Computer Vision</source>
          and Pattern Recognition,
          <string-name>
            <surname>CVPR</surname>
          </string-name>
          <year>2017</year>
          ,
          <article-title>Honolulu</article-title>
          ,
          <string-name>
            <surname>HI</surname>
          </string-name>
          , USA, July
          <volume>21</volume>
          -
          <issue>26</issue>
          ,
          <year>2017</year>
          ,
          <year>2017</year>
          , pp.
          <fpage>2584</fpage>
          -
          <lpage>2593</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [39]
          <string-name>
            <given-names>T.</given-names>
            <surname>Baltrusaitis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Zadeh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y. C.</given-names>
            <surname>Lim</surname>
          </string-name>
          , L. Morency,
          <article-title>Openface 2.0: Facial behavior analysis toolkit</article-title>
          ,
          <source>in: 13th IEEE International Conference on Automatic Face &amp; Gesture Recognition, FG</source>
          <year>2018</year>
          , IEEE Computer Society,
          <year>2018</year>
          , pp.
          <fpage>59</fpage>
          -
          <lpage>66</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [40]
          <string-name>
            <given-names>B.</given-names>
            <surname>Waller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. C.</given-names>
            <surname>Jr.</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Burrows</surname>
          </string-name>
          ,
          <article-title>Selection for universal facial emotion</article-title>
          ,
          <source>Emotion</source>
          <volume>8</volume>
          (
          <year>2008</year>
          )
          <fpage>435</fpage>
          -
          <lpage>439</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          [41]
          <string-name>
            <surname>A. S. d'Avila Garcez</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Broda</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          <string-name>
            <surname>Gabbay</surname>
          </string-name>
          ,
          <article-title>Symbolic knowledge extraction from trained neural networks: A sound approach</article-title>
          , Artif. Intell.
          <volume>125</volume>
          (
          <year>2001</year>
          )
          <fpage>155</fpage>
          -
          <lpage>207</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          [42]
          <string-name>
            <surname>A. S. d'Avila Garcez</surname>
            ,
            <given-names>L. C.</given-names>
          </string-name>
          <string-name>
            <surname>Lamb</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          <string-name>
            <surname>Gabbay</surname>
          </string-name>
          ,
          <string-name>
            <surname>Neural-Symbolic Cognitive</surname>
            <given-names>Reasoning</given-names>
          </string-name>
          , Cognitive Technologies, Springer,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          [43]
          <string-name>
            <surname>A. S. d'Avila Garcez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Gori</surname>
            ,
            <given-names>L. C.</given-names>
          </string-name>
          <string-name>
            <surname>Lamb</surname>
            , L. Sera ni, M. Spranger,
            <given-names>S. N.</given-names>
          </string-name>
          <string-name>
            <surname>Tran</surname>
          </string-name>
          ,
          <article-title>Neuralsymbolic computing: An e ective methodology for principled integration of machine learning and reasoning</article-title>
          ,
          <source>FLAP</source>
          <volume>6</volume>
          (
          <year>2019</year>
          )
          <fpage>611</fpage>
          -
          <lpage>632</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          [44]
          <string-name>
            <given-names>M.</given-names>
            <surname>Setzu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Guidotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Monreale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Turini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Pedreschi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Giannotti</surname>
          </string-name>
          ,
          <article-title>GlocalX - from local to global explanations of black box AI models</article-title>
          ,
          <source>Artif. Intell</source>
          .
          <volume>294</volume>
          (
          <year>2021</year>
          )
          <article-title>103457</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.artint.
          <year>2021</year>
          .
          <volume>103457</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          [45]
          <string-name>
            <given-names>L. C.</given-names>
            <surname>Lamb</surname>
          </string-name>
          , A. S.
          <string-name>
            <surname>d'Avila Garcez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Gori</surname>
            ,
            <given-names>M. O. R.</given-names>
          </string-name>
          <string-name>
            <surname>Prates</surname>
            ,
            <given-names>P. H. C.</given-names>
          </string-name>
          <string-name>
            <surname>Avelar</surname>
            ,
            <given-names>M. Y.</given-names>
          </string-name>
          <string-name>
            <surname>Vardi</surname>
          </string-name>
          ,
          <article-title>Graph neural networks meet neural-symbolic computing: A survey and perspective</article-title>
          , in: C.
          <string-name>
            <surname>Bessiere</surname>
          </string-name>
          (Ed.),
          <source>Proc. IJCAI</source>
          <year>2020</year>
          ,
          <article-title>ijcai</article-title>
          .org,
          <year>2020</year>
          , pp.
          <fpage>4877</fpage>
          -
          <lpage>4884</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          [46]
          <string-name>
            <surname>L.</surname>
          </string-name>
          <article-title>Sera ni, A. S. d'Avila Garcez, Learning and reasoning with logic tensor networks</article-title>
          ,
          <source>in: XVth Int. Conf. of the Italian Association for Arti cial Intelligence</source>
          ,
          <source>AI*IA</source>
          <year>2016</year>
          , Genova, Italy,
          <source>Nov 29 - Dec 1</source>
          , volume
          <volume>10037</volume>
          <source>of LNCS</source>
          , Springer,
          <year>2016</year>
          , pp.
          <fpage>334</fpage>
          -
          <lpage>348</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          [47]
          <string-name>
            <given-names>P.</given-names>
            <surname>Hohenecker</surname>
          </string-name>
          , T. Lukasiewicz,
          <article-title>Ontology reasoning with deep neural networks</article-title>
          ,
          <source>J. Artif. Intell. Res</source>
          .
          <volume>68</volume>
          (
          <year>2020</year>
          )
          <fpage>503</fpage>
          -
          <lpage>540</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          [48]
          <string-name>
            <given-names>D.</given-names>
            <surname>Le-Phuoc</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Eiter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Le-Tuan</surname>
          </string-name>
          ,
          <article-title>A scalable reasoning and learning approach for neuralsymbolic stream fusion</article-title>
          ,
          <source>in: AAAI</source>
          <year>2021</year>
          ,
          <article-title>February 2-9</article-title>
          , AAAI Press,
          <year>2021</year>
          , pp.
          <fpage>4996</fpage>
          -
          <lpage>5005</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>