<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Model Checking Verification of MultiLayer Perceptrons in Datalog: a Many-valued Approach with Typicality</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Francesco Bartoli</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco Botta</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Roberto Esposito</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Laura Giordano</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Daniele Theseider Dupré</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>DISIT - Università del Piemonte Orientale</institution>
          ,
          <addr-line>Alessandria</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Dipartimento di Informatica, Università di Torino</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <fpage>54</fpage>
      <lpage>67</lpage>
      <abstract>
        <p>Description logics with typicality have been considered under a “concept-wise” multi-preferential semantics as the basis of a logical interpretation of MultiLayer Perceptrons (MLPs). In this paper we exploit a Datalog-based approach to prove logical properties of a trained network by model checking, starting from its input/output behavior, building a many-valued preferential model for the verification of typicality properties. The model is also used for providing a probabilistic account of MLPs, exploiting typicality concepts and Zadeh's probability of fuzzy events. We report about some experiments to the verification of properties of neural networks for the recognition of basic emotions. This work is a step in the direction of verifying and interpreting knowledge learned by a neural network, to achieve a trustworthy and explainable AI.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Description Logic</kwd>
        <kwd>Typicality</kwd>
        <kwd>Neural Networks</kwd>
        <kwd>Explainability</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Preferential approaches to common sense reasoning [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4 ref5">1, 2, 3, 4, 5</xref>
        ], have been extended to
description logics (DLs) to deal with inheritance with exceptions in ontologies, by allowing
for non-strict inclusions, called typicality or defeasible inclusions, with diferent preferential
semantics, e.g., [
        <xref ref-type="bibr" rid="ref6 ref7">6, 7</xref>
        ] and [
        <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
        ], and closure constructions, e.g, [
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref9">10, 11, 12, 9</xref>
        ].
      </p>
      <p>
        In recent work, a concept-wise multipreference semantics has been proposed [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] as a
semantics for ranked Description Logic (DL) knowledge bases (KBs), i.e. knowledge bases in
which defeasible or typicality inclusions of the form T(C ) ⊑ D (meaning “the typical C ’s
are D’s" or “normally C ’s are D’s"), stemming from KLM conditionals [
        <xref ref-type="bibr" rid="ref1 ref3">1, 3</xref>
        ], are given a rank,
representing their strength, where T is a typicality operator [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], that singles out the typical
instances of concept C.
      </p>
      <p>
        The multi-preferential semantics has been extended [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] to weighted knowledge bases, in
which typicality inclusions have a real (positive or negative) weight, representing plausibility
or implausibility. The semantics has been exploited to provide a preferential interpretation
to Multilayer Perceptrons (MLPs, [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]), an approach previously considered [
        <xref ref-type="bibr" rid="ref16 ref17">16, 17</xref>
        ] for
selforganising maps (SOMs, [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]). In both cases, considering the domain of all input stimuli
presented to the network during training (or in the generalization phase), one can build a
semantic interpretation of the network as a multi-preferential interpretation, where preferences
are associated to concepts. This allows properties of a neural network to be verified by model
checking over a fuzzy preferential interpretation. A MLP can as well be regarded as a weighted
conditional knowledge base [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] (based on a fuzzy concept-wise preferential semantics) by
interpreting synaptic connections as conditional implications. Specifically, the notions of
coherent [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], faithful [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] and φ-coherent [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] models of a weighted ALC knowledge base have
been considered for fuzzy ALC with typicality.
      </p>
      <p>
        In previous work, proof methods for reasoning with weighted conditional KBs have been
studied, in the two-valued case [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] for E L⊥ KBs, and in the finitely many-valued case [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ],
providing an approximation of fuzzy φ-coherent entailment for the boolean fragment. Finitely
many-valued DLs are well-studied in the literature [
        <xref ref-type="bibr" rid="ref23 ref24 ref25 ref26">23, 24, 25, 26</xref>
        ]. In particular, for the boolean
fragment LC of ALC (which does not contain roles, and then neither universal nor existential
restrictions), the finitely many-valued Gödel and Łukasiewicz description logics, GnLC and
ŁnLC, have been extended with a typicality operator, and a semantic closure construction, based
on φn-coherent interpretations, has been introduced to deal with weighted KBs. ASP and asprin
[
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] have been exploited for deciding φn-coherent entailment, a many-valued approximation of
φ-coherence entailment, and the ASP encoding is used to prove that the problem is in Π 2p [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ].
      </p>
      <p>In this paper, we investigate a Datalog-based approach to model checking for verifying the
logical properties of a neural network, by constructing a preferential interpretation of the
network starting from its input/output behavior over a set of input stimuli. We exploit the
activations of units for those stimuli, to define a many-valued interpretation of the concepts
associated with those units (namely, the units of interest for verification).</p>
      <p>
        More specifically, we exploit Datalog with weakly stratified negation [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ] in the construction
of the model of a neural network N over a given domain ∆ of input stimuli. This allows
the verification in polynomial time of typicality properties of the form T(C) ⊑ Dθα , for
θ ∈ {≥ , ≤ , &gt;, &lt;} and α ∈ [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ], as well as to evaluate the conditional probabilities P (D|C),
based on Zadeh’s probability of fuzzy events [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ], also including occurrences of typicality
concepts T(C).
      </p>
      <p>
        We conclude the paper by reporting about some experiments to the verification of properties
of neural networks for the recognition of basic emotions using the Facial Action Coding System
(FACS) [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ].
2. A fuzzy and a finitely many-valued description logic
Fuzzy description logics have been widely studied in the literature for representing vagueness
in DLs [
        <xref ref-type="bibr" rid="ref31 ref32 ref33">31, 32, 33</xref>
        ], based on the idea that concepts and roles can be interpreted as fuzzy sets
and fuzzy relations. In fuzzy logic, formulas have a truth degree from a truth space S, usually
[
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ], as in Mathematical Fuzzy Logic [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ] or {0, n1 , . . . , n−n 1 , nn }, for an integer n ≥ 1. The
ifnitely many-valued case is also well studied for DLs [
        <xref ref-type="bibr" rid="ref23 ref24 ref25 ref26">23, 24, 25, 26</xref>
        ]; in the following, we will
also consider a finitely many-valued extension of the boolean fragment of ALC with typicality.
      </p>
      <p>Let LC be the fragment of ALC with no roles, NC be a set of concept names and NI a set of
individual names. The set of LC concepts can be defined inductively as follows: (i) A ∈ NC , ⊤
and ⊥ are concepts; (ii) if C and D are concepts, then C ⊓ D, C ⊔ D, ¬C are concepts.</p>
      <p>
        A fuzzy interpretation for LC is a pair I = ⟨∆ , · I ⟩ where ∆ is a non-empty domain and
· I is fuzzy interpretation function that assigns to each concept name A ∈ NC a function
AI : ∆ → [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ], and to each individual name a ∈ NI an element aI ∈ ∆ . A domain element
x ∈ ∆ belongs to the extension of A to some degree in [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ], i.e., AI is a fuzzy set.
      </p>
      <p>The interpretation function · I is extended to complex concepts as follows:
⊤I (x) = 1
⊥I (x) = 0</p>
      <p>
        (¬C)I (x) = ⊖ CI (x)
(C ⊓ D)I (x) = CI (x) ⊗ DI (x)
(C ⊔ D)I (x) = CI (x) ⊕ DI (x)
where x ∈ ∆ and ⊗ , ⊕ , ▷ and ⊖ are arbitrary but fixed t-norm, s-norm, implication function,
and negation function, chosen among the combination functions of various fuzzy logics (we
refer to [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ] for details). In particular, in Gödel logic a ⊗ b = min{a, b}, a ⊕ b = max{a, b},
a ▷ b = 1 if a ≤ b and b otherwise; ⊖ a = 1 if a = 0 and 0 otherwise. In Łukasiewicz logic,
a ⊗ b = max{a + b − 1, 0}, a ⊕ b = min{a + b, 1}, a ▷ b = min{1 − a + b, 1} and ⊖ a = 1 − a.
      </p>
      <p>The interpretation function · I is also extended to non-fuzzy axioms (i.e., to strict inclusions
and assertions of an LC knowledge base) as follows:
(C ⊑ D)I = infx∈∆ CI (x) ▷ DI (x)</p>
      <p>
        (C(a))I = CI (aI )
A fuzzy LC knowledge base K is a pair (T , A) where T is a fuzzy TBox and A a fuzzy ABox.
A fuzzy TBox is a set of fuzzy concept inclusions of the form C ⊑ D θ α , where C ⊑ D is
an LC concept inclusion axiom, θ ∈ {≥ , ≤ , &gt;, &lt;} and α ∈ [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ]. A fuzzy ABox A is a set of
fuzzy assertions of the form C(a) θα where C is an LC concept, a ∈ NI , θ ∈ {≥ , ≤ , &gt;, &lt;}
and α ∈ [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ]. Following Bobillo and Straccia [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ], we assume that fuzzy interpretations are
witnessed, i.e., the sup and inf are attained at some point of the involved domain.
Definition 1 (Satisfiability and entailment).
axiom E (denoted I |= E), as follows:
- I satisfies a fuzzy LC inclusion axiom C ⊑ D θ α if (C ⊑ D)I θ α ;
- I satisfies a fuzzy LC assertion C(a)θα if CI (aI )θ α ,
for θ ∈ {≥ , ≤ , &gt;, &lt;}.
      </p>
      <p>A fuzzy interpretation I satisfies a fuzzy</p>
      <p>LC
Given a fuzzy KB K = (T , A), a fuzzy interpretation I satisfies T (resp. A) if I satisfies all fuzzy
inclusions in T (resp. all fuzzy assertions in A). A fuzzy interpretation I is a model of K if I
satisfies T and A. A fuzzy axiom E is entailed by a fuzzy knowledge base K, written K |= E, if
for all models I =⟨∆ , · I ⟩ of K, I satisfies E.</p>
      <p>
        In the finitely many-valued case, we assume the truth space to be Cn = {0, n1 , . . . , n−n 1 , nn }, for
an integer n ≥ 1. A finitely many-valued interpretation for LC is a pair I = ⟨∆ , · I ⟩ where: ∆ is
a non-empty domain and · I is an interpretation function that assigns to each a ∈ NI a value
aI ∈ ∆ , and to each A ∈ NC a function AI : ∆ → Cn. In particular, in [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] we have considered
two finitely many-valued cases based on ALC, the finitely many-valued Łukasiewicz description
logic ŁnALC and the finitely many-valued Gödel description logic GnALC, extended with
a standard involutive negation ⊖ a = 1 − a. Such logics are defined along the lines of the
ifnitely many-valued Łukasiewicz description logic SROIQ [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], the fuzzy extension of the
descrption logic SROIQ that joins Gödel and Zadeh fuzzy logics (called GZ SROIQ) [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ],
and the logic ALC∗ (S) [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. In the following we will focus on the LC fragment GnLC of
GnALC. For GnLC the interpretation function · I is extended to complex concepts and fuzzy
axioms as above, and we assume the interpretation of negated concepts exploits involutive
negation, i.e., (¬C)I (x) = ⊖ CI (x) = 1 − CI (x). The notions of knowledge base, satisfiability
and entailment are defined as above.
      </p>
      <sec id="sec-1-1">
        <title>3. Fuzzy LC with typicality and φ-coherent models</title>
        <p>
          Let us consider now fuzzy LC with typicality LCFT, following the approach for ALC in [
          <xref ref-type="bibr" rid="ref14 ref19">14, 19</xref>
          ],
as well as the finite many-valued case. The idea is similar to the extension of ALC with typicality
in the two-valued case [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], but the degree of membership of domain individuals in a concept C
is used to identify the typical elements of C. The extension allows for the definition of typicality
concepts of the form T(C), corresponding to the set of most typical C-elements.
        </p>
        <p>Note that, in a fuzzy interpretation I = ⟨∆ , · I ⟩, the degree of membership CI (x) of x in a
concept C induces a preference relation &lt;C on ∆ :</p>
        <p>
          x &lt;C y if C I(x) &gt; CI(y)
For a witnessed fuzzy LC interpretation I, each preference relation &lt;C has the properties
of preference relations in KLM-style ranked interpretations [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ], that is, &lt;C is a modular and
well-founded strict partial order. Similarly for a finitely many-valued LC interpretation I. Each
relation &lt;C has the properties of a preference relation in KLM rational interpretations, also
called ranked interpretations. It captures the relative typicality of domain elements wrt concept
C and may then be used to identify the typical C-elements. Let C&gt;I0 = {x ∈ ∆ | CI (x) &gt; 0}.
One can provide a (crisp) interpretation of typicality concepts T(C) in an interpretation I as
follows:
(T(C))I (x) =
︃{ 1
0
if x ∈ min&lt;C (C&gt;I0)
otherwise
(1)
where min&lt;(S) = {u : u ∈ S and ∄z ∈ S s.t. z &lt; u}. When (T(C))I (x) = 1, x is said to be
a typical C-element in I. Let us denote with LCFT the extension of fuzzy LC with typicality,
and with GnLCT the extension of GnLC with typicality.
Definition 2 ( LCFT interpretation). A LCFT interpretation I = ⟨∆ , · I ⟩ is a fuzzy LC
interpretation, extended by interpreting typicality concepts as in (1).
        </p>
        <p>In a similar way, we can define a GnLCT interpretation: a many-valued interpretation I =
⟨∆ , · I ⟩ implicitly defines a multi-preferential interpretation, where any concept C is associated
to a preference relation &lt;C . The notions of model of an LCFT (resp., a GnLCT) KB, and of
LCFT (resp., GnLCT) entailment are defined similarly as for fuzzy LC knowledge bases (see
Section 2).</p>
        <p>
          In [
          <xref ref-type="bibr" rid="ref14 ref19">14, 19</xref>
          ] a notion of weighted ALCFT knowledge base has been considered (and similarly for
the boolean fragment and the many-valued case [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]), as a tuple ⟨T , TC1 , . . . , TCk , A⟩, where T
is a set of inclusion axioms, A is a set of assertions and TCi = {(dih, whi)} is a set of all weighted
typicality inclusions dih = T(Ci) ⊑ Di,h for Ci, indexed by h, where each inclusion dih has
weight whi, a real number, and Ci and Di,h are LC concepts.
        </p>
        <p>
          Some diferent fuzzy semantics (a coherent [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], a faithful [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ] and a φ-coherent semantics
[
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]) have been considered for weighted knowledge bases, and have been exploited to provide
a semantic characterization of multilayer perceptrons as weighted knowledge bases. Based
on a notion of φ-coherent entailment and its approximation to the finitely many-valued case,
an ASP based approach has been proposed for the verification of typicality properties of a
weighted conditional knowledge bases [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ] in the logics GnLCT and ŁnLCT. More precisely,
an algorithm for deciding entailment of typicality inclusions T(C) ⊑ D θ α from a weighted
knowledge base in GnLCT (or in ŁnLCT) has been developed by exploiting an ASP encoding
and the asprin framework for answer set preferences [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ].
        </p>
        <p>In this paper, we consider a single interpretation which can be built over the domain of input
stimuli, by exploiting the activity of units, for the diferent inputs. We will see that such a model
can be constructed using Datalog with negation and that the Datalog program can be used for
proving properties of the network by model checking.</p>
      </sec>
      <sec id="sec-1-2">
        <title>4. A fuzzy preferential model of a network N</title>
        <p>
          The idea from [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] is that a fuzzy multi-preferential interpretation can be associated to a network
N , based on the activity of the network over a set of input stimuli ∆ . Fuzzy and typicality
properties of the network can then be verified by model checking over such an interpretation,
and used for post-hoc explanation.
        </p>
        <p>
          Here, we consider a trained feedforward network N , and associate a concept name Ci ∈ NC
to the units of interest i in N for property verification. They may include input, output or hidden
units. We construct a multi-preference interpretation over a (finite) domain ∆ of input stimuli.
For instance, the input vectors considered for training and/or generalization, or a subset of it.
We assume the activation of units to be in the interval [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ].
        </p>
        <p>
          Assume ∆ is a finite set. Following [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], we associate to N and ∆ a fuzzy multi-preferential
interpretation as follows.
        </p>
        <p>Definition 3. The fuzzy multi-preferential interpretation of a network N over the domain ∆ ,
is the LCFT interpretation IN∆ = ⟨∆ , · I ⟩ where the interpretation function · I satisfies condition
CkI (x) = yk(x), for all concept names Ck ∈ NC and x ∈ ∆ , where yk(x) is the output signal of
unit k, for input vector x.</p>
        <p>
          As we have seen above, the LCFT interpretation IN∆ is a multi-preferential interpretation, as
the fuzzy interpretation of concepts induces a preference relation associated to each concept,
i.e., to each unit. It has been proven that this interpretation is actually a model of the network
[
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], when the network is regarded as a weighted knowledge base, and under some conditions
on the activation functions of units. It allows the set of typical instances of a concept Ck to be
identified in the obvious way, by selecting the input stimuli x ∈ ∆ with the highest activity
values yk(x), for unit k. For instance, according to the semantics of typicality concepts, the
verification of an inclusion T(Ch) ⊑ D ≥ α over model IN∆ would require to identify typical
Ch-elements and to check whether their membership degree in concept D is greater or equal
than α , according to the choice of the t-norm, s-norm, and negation functions.
        </p>
        <p>In the next section we propose a Datalog-based approach to construct the many-valued
approximation IN∆ ,n of model IN∆ , and to verify concept inclusions, and typicality inclusions,
over such a model.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>5. Model checking of a neural network in Datalog</title>
      <p>
        We construct a many-valued interpretation IN∆ ,n of a network N over a domain ∆ , by restricting
to the truth space Cn, and approximating values v ∈ [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ] to the nearest value in Cn as follows:
⎧ 0
[v]n = ⎨ ni
⎩ 1
      </p>
      <p>1
iiff v2i−≤ 1 2&lt;n v ≤</p>
      <p>2n
if 2n− 1 &lt; v
2n
2i2+n1 , for 0 &lt; i &lt; n
(2)</p>
      <p>In the following, we will focus on the verification of properties of the network N in the
logic GnLCT, then building a GnLCT interpretation IN∆ ,n. The same approach can be used
for verifying properties of the network in ŁnLCT, with minor diferences in Datalog encoding.
First let us define the interpretation IN∆ ,n.</p>
      <p>Definition 4. The many-valued interpretation IN∆ ,n = ⟨∆ , · I ⟩ of a network N over the domain
∆ , is a GnLCT interpretation such that function · I satisfies, for all concept names Ck ∈ NC and
domain elements x ∈ ∆ , the condition CkI (x) = [yk(x)]n, where yk(x) is the output signal of unit
k, for input vector x.</p>
      <p>The verification that the network satisfies an inclusion of the form C ⊑ D ≥ α , where C and
D are concepts built from the concept names Ci ∈ NC , possibly containing typicality concepts,
can be done by checking whether the inclusion C ⊑ D ≥ α is satisfied in the model IN∆ ,n.
As a special case, one can verify inclusions of the form T(C) ⊑ D ≥ α . Such formulae are
interesting, e.g., in case C is associated to an output unit and D is a boolean combination of
input units, to check whether inputs that are classified as Cs with highest degree, satisfy D
with at least degree α .</p>
      <p>In the following we describe a Datalog encoding of the model checking problem. The encoding
contains a component Π( N , ∆ , n) which describes the interpretation IN∆ ,n, and a component
associated to the formula or the formulae to be checked.</p>
      <p>The program is defined in such a way that its unique stable model, corresponding to the
well-founded model of the program, also corresponds to model IN∆ ,n. The main features of the
program Π( N , ∆ , n) are the following.</p>
      <p>The activation of the relevant units in N for each input stimulus x ∈ ∆ is represented
as follows. Each activation yi(x) is approximated to the nearest value [yi(x)]n in Cn and
transformed to an integer vi = [yi(x)]n × n. Each input stimulus x ∈ ∆ is associated a number
h, and a corresponding constant h in the program. A preprocessing phase will introduce in the
program Π( N , ∆ , n) an atom individual (h, v1 , . . . , vm ) for each input stimulus x in ∆ with
number h, providing the tuple with all the (approximated) activation values for x of the units of
interest (where vi = [yi(x)]n × n).</p>
      <p>The valuation is encoded by a set of atoms of the form inst (x , A, v ), meaning that nv ∈ Cn
is the degree of membership of x in A; val (0 ..n) asserts that 0 ..n are the possible values,
representing Cn. For each concept name Ai associated to a unit of interest, the rule:
inst (X , A′i , Vi )
←
val (Vi ), individual (X , V1 , . . . , Vm ).
where A′i is the constant representing Ai, associates to each input stimulus x a membership
degree Vni ∈ Cn in concept Ai. A rule
ind (X ) ←</p>
      <p>individual (X , V1 , . . . , Vm )
identifies individuals.</p>
      <p>Formulae (concepts and concept inclusions) are represented using, for boolean concepts,
terms such as and(C′, D′) for C ⊓ D, where C′ and D′ represent C and D, and t(C′) for T(C).
Function symbols are used as syntactic sugar, as the grounding of rules is finite.</p>
      <p>
        The valuation is extended to boolean concepts C, and, similarly, to concept inclusions, defining
a predicate eval (C ′, X , V ). As in [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] the definition of the eval predicate depends on the choice
of the combination functions. For example, the rule:
eval (and (A, B ), I , V ) ← ind (I ), conc(and (A, B )),
eval (A, I , V1 ), eval (B , I , V2 ), val (V1 ),
val (V2 ), min(V1 , V2 , V ).
evaluates conjunctions, using a suitably defined min as combination function; conc is used to
make the instantiation of such rules finite, defining formulas of interest, that are the formulas
to be verified and their subformulae.
      </p>
      <p>Typical C-elements and the extension of eval to typicality concepts can be defined using
weakly stratified negation:
typical (X , C ) ← conc(t (C )),
eval (C , X , N ), N ! = 0 ,
hasmaxval (C , N ).
hasmaxval (C , n) ←</p>
      <p>conc(t (C )), someval (C , n).
hasmaxval (C , M ) ← val (M ), M &lt; n,
conc(t (C )), someval (C , M ),
not hasval _geq (C , M + 1 ).
someval (C , M ) ← ind (Y ), conc(t (C )),
eval (C , Y , M ).</p>
      <p>val (M ), conc(t (C )),
hasval _geq (C , M ) ←
someval (C , M ).</p>
      <p>val (M ), conc(t (C )),
hasval _geq (C , M ) ←</p>
      <p>M ! = 0, M &lt; n,
hasval _geq (C , M + 1 ).
eval (t (A), I , n) ← ind (I ), conc(t (A)),</p>
      <p>typical (I , A).
eval (t (A), I , 0 ) ← ind (I ), conc(t (A)),</p>
      <p>not typical (I , A).</p>
      <p>One or more formulae can be verified using, e.g., the following rules, relying on assertions
formula(Name, impl (C ′, D ′), Val ), to verify C ⊑ D ≥ Val , where impl (C ′, D ′) represents
C ⊑ D, and the inclusion is given a (unique) Name; then either ok (Name) or notok (Name)
will be derived, and, in the latter case, notok /2 points out the counterexamples:
conc(C ) ← formula(Name, C , Val ).
notok (X , Fname) ← formula(Fname, F , Val ),</p>
      <p>ind (X ), eval (F , X , V ), V &lt; Val .
notok (Fname) ←
fname(Name) ←
ok (Fname) ←
not notok (Fname).</p>
      <p>formula(Name, F , Val ).
notok (X , Fname).</p>
      <p>fname(Fname),</p>
      <p>
        The soundness and completeness of the Datalog encoding of the model checking problem in
IN∆ ,n, can be proven along the same lines of the one-to-one correspondence between GnLCT
models of a knowledge base and the answer sets of its ASP encoding [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] (Lemma 1 in the
supplementary material). While in [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] the answer sets of the program capture the models of
the conditional knowledge base associated to the network, here program Π( N , ∆ , n) has a
unique weakly perfect model [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ], corresponding to the interpretation IN∆ ,n (as well as a unique
stable model).
      </p>
      <p>
        The size of the Datalog program Π( N , ∆ , n) is linear in |∆ | × | NC | × n), where |∆ | is the
size of the domain of input stimuli considered and |NC | is the number of concepts (units) which
are of interest for the verification. It is easy to prove that the verification of a typicality inclusion
T (C) ⊑ D ≥ α in GnLCT is O(|∆ | × (|C| + |D|) × n).
6. Typicality concepts and the probability of fuzzy events
For the properties T(C) ⊑ D ≥ α , especially in case they do not hold for all stimuli, the
conditional probability of D given T(C), can be evaluated (and then compared with α ) based
on Zadeh’s probability of fuzzy events. In particular, based on a recent characterization of
the continuous t-norms compatible with Zadeh’s probability of fuzzy events (PZ -compatible
t-norms) by Montes et al. [
        <xref ref-type="bibr" rid="ref36">36</xref>
        ], a probabilistic interpretation of SOMs has been provided [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ],
starting from a fuzzy model of SOMs after training. The same approach has been considered as
well for MLPs [
        <xref ref-type="bibr" rid="ref37">37</xref>
        ]. In this section we consider as well typicality concepts, which do not require
a special treatment, except considering their semantics, as for all other concepts.
      </p>
      <p>Assuming a discrete probability distribution p over the domain ∆ of a fuzzy interpretation
I = ⟨∆ , · I ⟩, the probability of the fuzzy set CI , for each DL concept C, can be defined as:
P (CI ) = ∑︁d∈∆ CI (d) p(d). Let us consider the specific interpretation I = IN∆ built from the
trained network N over a set of input stimuli ∆ . In the following we will simply write P (C),
rather than P (CI ).</p>
      <p>
        Following Smets [
        <xref ref-type="bibr" rid="ref38">38</xref>
        ], we let the conditional probability of a fuzzy event C given the fuzzy
event D be P (C | D) = P (D ⊓ C)/P (D) (provided P (D) &gt; 0). As observed by Dubois
and Prade [
        <xref ref-type="bibr" rid="ref39">39</xref>
        ], this generalizes both conditional probability and the fuzzy inclusion index
advocated by Kosko [
        <xref ref-type="bibr" rid="ref40">40</xref>
        ]. Specifically, under the assumption that the probability distribution
p is uniform over the set ∆ of input stimuli, then P (D|C) = M ((D ⊓ C)I )/M (CI ), where
M (CI ) = ∑︁x∈∆ CI (x) is the size of the fuzzy event CI in the interpretation I.
      </p>
      <p>
        Note that, for a concept name Ck ∈ NC , associated to a unit k, and a domain element x ∈ ∆ ,
it holds that P (Ck|{x}) = Ck(x) [
        <xref ref-type="bibr" rid="ref17 ref37">17, 37</xref>
        ] (where {x} stands for the crisp concept containing
only x), which can be interpreted as a subjective probability that x is an instance of Ck [
        <xref ref-type="bibr" rid="ref41">41</xref>
        ],
i.e., the degree of belief that x is an instance of concept Ck.
      </p>
      <p>Computing conditional probabilities requires computing the size of the involved fuzzy sets.
We have extended our rule based approach to compute conditional probabilities P (D|T(C))
over the many-valued approximation IN∆ ,n of model IN∆ , where D and C may be boolean
concepts. In this case, the size of (T(C))I coincides with the number of typical C elements, and
the size of (D ⊓ T(C))I can be computed as ∑︁x∈(T(C))I DI (x) . This allows for the verification
of conditional constraints of the form P (D|T(C))θα over the model IN∆ ,n. The computation
can be performed using aggregates as follows:
numtyp(N , C ) :– conc(t (C )),</p>
      <p>N = #count {X : typical (X , C )}.
fuzzysetcondprob(Name, P ) :–
formula(Name, impl (t (C ), D ), K ),
numtyp(N , C ),
W = #sum{V , X : val (V ), ind (X ),</p>
      <p>typical (X , C ), eval (D , X , V )},</p>
      <p>
        P = (k ∗ W )/N.
where k is, e.g., 1000, to get the decimal part of the result in 3 digits. This use of aggregates
satisfies weak stratification restrictions [
        <xref ref-type="bibr" rid="ref42">42</xref>
        ] and the program has a unique stable model.
      </p>
      <p>We report the results of an experimentation in the next section.</p>
    </sec>
    <sec id="sec-3">
      <title>7. Recognizing basic emotions: an experimentation</title>
      <p>
        In this section, we report about experiments on the verification of properties of neural networks
for the recognition of basic emotions using the Facial Action Coding System (FACS) [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ].
      </p>
      <p>
        The RAF-DB [
        <xref ref-type="bibr" rid="ref43">43</xref>
        ] data set contains almost 30000 images labeled with basic emotions or
combinations of two emotions. The data set was used as input to OpenFace 2.0 [
        <xref ref-type="bibr" rid="ref44">44</xref>
        ], which
detects a subset of the Action Units (AUs) in [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ], i.e., facial muscle contractions. The relations
between such AUs and emotions, studied by psychologists [
        <xref ref-type="bibr" rid="ref45">45</xref>
        ], can be used as a reference for
formulae to be verified on neural networks trained to learn such relations.
      </p>
      <p>
        From the original dataset, we selected the subset of the images that were labelled with only
one emotion in the set { suprise, fear, happiness, anger }. The dataset is highly unbalanced
and this can afect the training of the neural network model; then we preprocessed the data
by subsampling the larger classes and augmenting the minority ones using standard
dataaugmentation techniques (e.g., rotations, flipping, etc.). The processed dataset contains 5 975
images (the number of images was 4 283 before augmentation). The images were input to
OpenFace 2.0; the output intensities were rescaled in order to make their distribution conformant
to the expected one in case AUs were recognized by humans [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ]. The resulting AUs were used
as input to a neural network trained to classify its input as an instance of the four emotions.
The neural network model we used is a fully-connected feed-forward neural network with
three hidden layers having 1 800, 1 200, and 600 nodes. All hidden layers use RELU activation
functions, while the softmax function is used in the output layer. The network was trained using
the Adam [
        <xref ref-type="bibr" rid="ref46">46</xref>
        ] optimizer with an initial learning rate η set to 0.003, and parameters β 1 = 0.895,
β 2 = 0.99, and ϵ = 10− 7. The network has been trained for 150 epochs with a batch size of 128
examples. All hyper-parameters have been tuned on a separated validation set.
      </p>
      <p>
        The model checking approach in section 5 was applied, using the Clingo ASP solver as Datalog
engine, taking, as set ∆ of input stimuli, the test set used in the learning phase, containing 1194
images, and n = 5 (given that AU intensities, when assigned by humans, are on a scale of five
values). Formulae of the form T(E) ⊑ F ≥ k/5 were checked, where E is an emotion and F
is a combination of AUs, using table 1 in [
        <xref ref-type="bibr" rid="ref45">45</xref>
        ] as a reference. Table 1 reports the results, with
the number of typical individuals for the emotion, the number of counterexamples for diferent
values of k, and the value of P (F |T(E)).
      </p>
      <p>For example, the formula T(happiness) ⊑ au1 ⊔ au6 ⊔ au12 ⊔ au14 ≥ 3/5 holds for
all individuals, as well as T(happiness) ⊑ au12 ≥ 2/5, while T(happiness) ⊑ au12 ≥ 3/5
(where au12 is the activation of the lip corner puller muscle, that is, smiling) has 1
counterexample out of 255 instances of T(happiness). The value of P (au12/T(happiness)) is larger
than 4/5, even though there are 35 counterexamples for T(happiness) ⊑ au12 ≥ 4/5.</p>
    </sec>
    <sec id="sec-4">
      <title>8. Conclusions</title>
      <p>In this paper we have described a Datalog approach to evaluate properties of trained MLPs by
model checking. The approach exploits a finitely many-valued approximation of a semantics
for fuzzy description logics with typicality.</p>
      <p>
        As a proof of concept, the proposed approach has been experimented for checking properties
of a trained neural network for the recognition of basic emotions using the Facial Action Coding
System (FACS) [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ].
      </p>
      <p>This work is a step in the direction of verifying and interpreting knowledge learned by a
neural network, in order to achieve a trustworthy and explainable AI. In the case study, there
were expectations, to be verified, on the input-output relation (emotions and AUs); in other
cases, less knowledge could be available in advance, so that the results could turn out to be
more useful, even though more dificult to find.</p>
      <p>Interpreting knowledge learned by a neural network in a logical form also opens the possibility
of combining empirical knowledge with elicited knowledge, e.g., in the form of strict inclusions
and definitions.</p>
      <p>
        We refer to the surveys by Garcez et al. [
        <xref ref-type="bibr" rid="ref47">47</xref>
        ] and by Guidotti et al. [
        <xref ref-type="bibr" rid="ref48">48</xref>
        ] for an outline of current
directions on the explanation of neural models and on the combination of neural networks and
symbolic reasoning.
      </p>
      <p>
        For future work, it would be interesting to investigate whether Fuzzy answer set programming
(FASP) via satisfiability modulo theories (SMT) [
        <xref ref-type="bibr" rid="ref49">49</xref>
        ], can be used for MLPs property verification.
      </p>
      <p>Acknowledgement: This research is partially supported by INDAM-GNCS Project 2022,
“Logiche non-classiche per tool intelligenti ed explainable".</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Kraus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Magidor</surname>
          </string-name>
          ,
          <article-title>Nonmonotonic reasoning, preferential models and cumulative logics</article-title>
          ,
          <source>Artificial Intelligence</source>
          <volume>44</volume>
          (
          <year>1990</year>
          )
          <fpage>167</fpage>
          -
          <lpage>207</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J.</given-names>
            <surname>Pearl</surname>
          </string-name>
          , System
          <string-name>
            <surname>Z</surname>
          </string-name>
          :
          <article-title>A natural ordering of defaults with tractable applications to nonmonotonic reasoning</article-title>
          , in: TARK'
          <fpage>90</fpage>
          ,
          <string-name>
            <surname>Pacific</surname>
            <given-names>Grove</given-names>
          </string-name>
          , CA, USA,
          <year>1990</year>
          , pp.
          <fpage>121</fpage>
          -
          <lpage>135</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Magidor</surname>
          </string-name>
          ,
          <article-title>What does a conditional knowledge base entail?</article-title>
          ,
          <source>Artificial Intelligence</source>
          <volume>55</volume>
          (
          <year>1992</year>
          )
          <fpage>1</fpage>
          -
          <lpage>60</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Benferhat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Cayrol</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Dubois</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Prade</surname>
          </string-name>
          ,
          <article-title>Inconsistency management and prioritized syntax-based entailment</article-title>
          ,
          <source>in: Proc. IJCAI'93</source>
          ,
          <string-name>
            <surname>Chambéry</surname>
          </string-name>
          „
          <year>1993</year>
          , pp.
          <fpage>640</fpage>
          -
          <lpage>647</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D. J.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          ,
          <article-title>Another perspective on default reasoning</article-title>
          , Ann. Math. Artif. Intell.
          <volume>15</volume>
          (
          <year>1995</year>
          )
          <fpage>61</fpage>
          -
          <lpage>82</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Gliozzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Olivetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. L.</given-names>
            <surname>Pozzato</surname>
          </string-name>
          , Preferential Description Logics,
          <source>in: LPAR</source>
          <year>2007</year>
          , volume
          <volume>4790</volume>
          <source>of LNAI</source>
          , Springer, Yerevan, Armenia,
          <year>2007</year>
          , pp.
          <fpage>257</fpage>
          -
          <lpage>272</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Gliozzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Olivetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. L.</given-names>
            <surname>Pozzato</surname>
          </string-name>
          ,
          <article-title>A NonMonotonic Description Logic for Reasoning About Typicality, Artif</article-title>
          . Intell.
          <volume>195</volume>
          (
          <year>2013</year>
          )
          <fpage>165</fpage>
          -
          <lpage>202</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.artint.
          <year>2012</year>
          .
          <volume>10</volume>
          .004.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>K.</given-names>
            <surname>Britz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Heidema</surname>
          </string-name>
          , T. Meyer, Semantic preferential subsumption, in: G. Brewka, J. Lang (Eds.),
          <source>KR</source>
          <year>2008</year>
          , AAAI Press, Sidney, Australia,
          <year>2008</year>
          , pp.
          <fpage>476</fpage>
          -
          <lpage>484</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>K.</given-names>
            <surname>Britz</surname>
          </string-name>
          , G. Casini, T. Meyer, K. Moodley,
          <string-name>
            <given-names>U.</given-names>
            <surname>Sattler</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Varzinczak</surname>
          </string-name>
          ,
          <article-title>Principles of KLM-style defeasible description logics</article-title>
          ,
          <source>ACM Trans. Comput. Log</source>
          .
          <volume>22</volume>
          (
          <year>2021</year>
          ) 1:
          <fpage>1</fpage>
          -
          <lpage>1</lpage>
          :
          <fpage>46</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>G.</given-names>
            <surname>Casini</surname>
          </string-name>
          , U. Straccia,
          <article-title>Rational Closure for Defeasible Description Logics</article-title>
          , in: T. Janhunen, I. Niemelä (Eds.),
          <source>JELIA</source>
          <year>2010</year>
          , volume
          <volume>6341</volume>
          <source>of LNCS</source>
          , Springer, Helsinki,
          <year>2010</year>
          , pp.
          <fpage>77</fpage>
          -
          <lpage>90</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>G.</given-names>
            <surname>Casini</surname>
          </string-name>
          , U. Straccia,
          <article-title>Defeasible inheritance-based description logics</article-title>
          ,
          <source>Journal of Artificial Intelligence Research (JAIR) 48</source>
          (
          <year>2013</year>
          )
          <fpage>415</fpage>
          -
          <lpage>473</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Gliozzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Olivetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. L.</given-names>
            <surname>Pozzato</surname>
          </string-name>
          ,
          <article-title>Semantic characterization of rational closure: From propositional logic to description logics</article-title>
          ,
          <source>Art. Int</source>
          .
          <volume>226</volume>
          (
          <year>2015</year>
          )
          <fpage>1</fpage>
          -
          <lpage>33</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Theseider</given-names>
            <surname>Dupré</surname>
          </string-name>
          ,
          <article-title>An ASP approach for reasoning in a concept-aware multipreferential lightweight DL</article-title>
          , TPLP
          <volume>10</volume>
          (
          <issue>5</issue>
          ) (
          <year>2020</year>
          )
          <fpage>751</fpage>
          -
          <lpage>766</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Theseider</given-names>
            <surname>Dupré</surname>
          </string-name>
          ,
          <article-title>Weighted defeasible knowledge bases and a multipreference semantics for a deep neural network model</article-title>
          ,
          <source>in: Proc. JELIA</source>
          <year>2021</year>
          , May 17-20, volume
          <volume>12678</volume>
          <source>of LNCS</source>
          , Springer,
          <year>2021</year>
          , pp.
          <fpage>225</fpage>
          -
          <lpage>242</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>S.</given-names>
            <surname>Haykin</surname>
          </string-name>
          ,
          <string-name>
            <surname>Neural Networks - A Comprehensive Foundation</surname>
          </string-name>
          , Pearson,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Gliozzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Theseider</given-names>
            <surname>Dupré</surname>
          </string-name>
          ,
          <article-title>On a plausible concept-wise multipreference semantics and its relations with self-organising maps</article-title>
          , in: F.
          <string-name>
            <surname>Calimeri</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Perri</surname>
          </string-name>
          , E. Zumpano (Eds.),
          <source>CILC</source>
          <year>2020</year>
          ,
          <article-title>Rende</article-title>
          , IT, Oct.
          <volume>13</volume>
          -
          <fpage>15</fpage>
          ,
          <year>2020</year>
          , volume
          <volume>2710</volume>
          <source>of CEUR</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>127</fpage>
          -
          <lpage>140</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Gliozzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. T.</given-names>
            <surname>Dupré</surname>
          </string-name>
          ,
          <article-title>A conditional, a fuzzy and a probabilistic interpretation of self-organizing maps</article-title>
          ,
          <source>J. Log. Comput</source>
          .
          <volume>32</volume>
          (
          <year>2022</year>
          )
          <fpage>178</fpage>
          -
          <lpage>205</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>T.</given-names>
            <surname>Kohonen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Schroeder</surname>
          </string-name>
          , T. Huang (Eds.),
          <string-name>
            <surname>Self-Organizing</surname>
            <given-names>Maps</given-names>
          </string-name>
          ,
          <source>Third Edition</source>
          , Springer Series in Information Sciences, Springer,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <article-title>On the KLM properties of a fuzzy DL with Typicality</article-title>
          ,
          <source>in: Proc. ECSQARU</source>
          <year>2021</year>
          , Prague, Sept.
          <fpage>21</fpage>
          -
          <lpage>24</lpage>
          ,
          <year>2021</year>
          , volume
          <volume>12897</volume>
          <source>of LNCS</source>
          , Springer,
          <year>2021</year>
          , pp.
          <fpage>557</fpage>
          -
          <lpage>571</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <article-title>From weighted conditionals of multilayer perceptrons to a gradual argumentation semantics</article-title>
          ,
          <source>in: 5th Workshop on Advances in Argumentation in Artif. Intell.</source>
          ,
          <year>2021</year>
          , Milan, Italy, Nov.
          <volume>29</volume>
          , volume
          <volume>3086</volume>
          <source>of CEUR Workshop Proc</source>
          .,
          <year>2021</year>
          . URL: http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3086</volume>
          /paper8.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Theseider</given-names>
            <surname>Dupré</surname>
          </string-name>
          ,
          <article-title>Weighted conditional EL⊥ knowledge bases with integer weights: an ASP approach</article-title>
          ,
          <source>in: Proc. 37th Int. Conf. on Logic Programming</source>
          ,
          <source>ICLP 2021 (Technical Communications)</source>
          , Porto, Sept.
          <fpage>20</fpage>
          -
          <lpage>27</lpage>
          ,
          <year>2021</year>
          , volume
          <volume>345</volume>
          <source>of EPTCS</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>70</fpage>
          -
          <lpage>76</lpage>
          . URL: https://doi.org/10.4204/EPTCS.345.19.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Theseider</given-names>
            <surname>Dupré</surname>
          </string-name>
          ,
          <article-title>An ASP approach for reasoning on neural networks under a finitely many-valued semantics for weighted conditional knowledge bases</article-title>
          ,
          <source>TPLP</source>
          <volume>22</volume>
          (
          <year>2022</year>
          )
          <fpage>589</fpage>
          -
          <lpage>605</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>A.</given-names>
            <surname>García-Cerdaña</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Armengol</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Esteva</surname>
          </string-name>
          ,
          <article-title>Fuzzy description logics and t-norm based fuzzy logics</article-title>
          ,
          <source>Int. J. Approx. Reason</source>
          .
          <volume>51</volume>
          (
          <year>2010</year>
          )
          <fpage>632</fpage>
          -
          <lpage>655</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.ijar.
          <year>2010</year>
          .
          <volume>01</volume>
          .001.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>F.</given-names>
            <surname>Bobillo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            <surname>Straccia</surname>
          </string-name>
          ,
          <article-title>Reasoning with the finitely many-valued Łukasiewicz fuzzy Description Logic SROIQ, Inf</article-title>
          . Sci.
          <volume>181</volume>
          (
          <year>2011</year>
          )
          <fpage>758</fpage>
          -
          <lpage>778</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.ins.
          <year>2010</year>
          .
          <volume>10</volume>
          .020.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>F.</given-names>
            <surname>Bobillo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Delgado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gómez-Romero</surname>
          </string-name>
          , U. Straccia,
          <article-title>Joining Gödel and Zadeh Fuzzy Logics in Fuzzy Description Logics</article-title>
          ,
          <source>Int. J. Uncertain. Fuzziness Knowl. Based Syst</source>
          .
          <volume>20</volume>
          (
          <year>2012</year>
          )
          <fpage>475</fpage>
          -
          <lpage>508</lpage>
          . doi:
          <volume>10</volume>
          .1142/S0218488512500249.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>S.</given-names>
            <surname>Borgwardt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Peñaloza</surname>
          </string-name>
          ,
          <article-title>The complexity of lattice-based fuzzy description logics</article-title>
          ,
          <source>J. Data Semant</source>
          .
          <volume>2</volume>
          (
          <issue>2013</issue>
          )
          <fpage>1</fpage>
          -
          <lpage>19</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>G.</given-names>
            <surname>Brewka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Delgrande</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Romero</surname>
          </string-name>
          , T. Schaub,
          <article-title>asprin: Customizing answer set preferences without a headache</article-title>
          ,
          <source>in: Proc. AAAI</source>
          <year>2015</year>
          ,
          <year>2015</year>
          , pp.
          <fpage>1467</fpage>
          -
          <lpage>1474</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>H.</given-names>
            <surname>Przymusinska</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. C.</given-names>
            <surname>Przymusinski</surname>
          </string-name>
          ,
          <article-title>Weakly perfect model semantics for logic programs</article-title>
          ,
          <source>in: Logic Programming</source>
          ,
          <source>Proceedings of the Fifth International Conference and Symposium</source>
          , Seattle, Washington, USA,
          <year>August</year>
          15-
          <issue>19</issue>
          ,
          <year>1988</year>
          (
          <article-title>2 Volumes)</article-title>
          , MIT Press,
          <year>1988</year>
          , pp.
          <fpage>1106</fpage>
          -
          <lpage>1120</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>L.</given-names>
            <surname>Zadeh</surname>
          </string-name>
          ,
          <article-title>Probability measures of fuzzy events</article-title>
          ,
          <source>J.Math.Anal.Appl</source>
          <volume>23</volume>
          (
          <year>1968</year>
          )
          <fpage>421</fpage>
          -
          <lpage>427</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>P.</given-names>
            <surname>Ekman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Friesen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hager</surname>
          </string-name>
          , Facial Action Coding System, Research Nexus,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>G.</given-names>
            <surname>Stoilos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. B.</given-names>
            <surname>Stamou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Tzouvaras</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Z.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. Horrocks</given-names>
            ,
            <surname>Fuzzy</surname>
          </string-name>
          <string-name>
            <surname>OWL</surname>
          </string-name>
          <article-title>: uncertainty and the semantic web</article-title>
          ,
          <source>in: OWLED*05 Workshop on OWL Galway, Ireland, Nov 11-12</source>
          ,
          <year>2005</year>
          , volume
          <volume>188</volume>
          <source>of CEUR Workshop Proc</source>
          .,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>T.</given-names>
            <surname>Lukasiewicz</surname>
          </string-name>
          , U. Straccia,
          <article-title>Description logic programs under probabilistic uncertainty and fuzzy vagueness</article-title>
          ,
          <source>Int. J. Approx. Reason</source>
          .
          <volume>50</volume>
          (
          <year>2009</year>
          )
          <fpage>837</fpage>
          -
          <lpage>853</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>S.</given-names>
            <surname>Borgwardt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Peñaloza</surname>
          </string-name>
          ,
          <article-title>Undecidability of fuzzy description logics</article-title>
          , in: G. Brewka,
          <string-name>
            <given-names>T.</given-names>
            <surname>Eiter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. A</given-names>
            .
            <surname>McIlraith</surname>
          </string-name>
          (Eds.),
          <source>Proc. KR</source>
          <year>2012</year>
          , Rome, Italy, June 10-14,
          <year>2012</year>
          , AAAI Press,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>P.</given-names>
            <surname>Cintula</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hájek</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          Noguera (Eds.),
          <source>Handbook of Mathematical Fuzzy Logic</source>
          , volume
          <volume>37</volume>
          -38,
          <string-name>
            <given-names>College</given-names>
            <surname>Publications</surname>
          </string-name>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>F.</given-names>
            <surname>Bobillo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            <surname>Straccia</surname>
          </string-name>
          ,
          <article-title>Reasoning within fuzzy OWL 2 EL revisited</article-title>
          ,
          <source>Fuzzy Sets Syst</source>
          .
          <volume>351</volume>
          (
          <year>2018</year>
          )
          <fpage>1</fpage>
          -
          <lpage>40</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>I.</given-names>
            <surname>Montes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hernández</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Martinetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Montes</surname>
          </string-name>
          ,
          <article-title>Characterization of continuous t-norms compatible with zadeh's probability of fuzzy events</article-title>
          ,
          <source>Fuzzy Sets Syst</source>
          .
          <volume>228</volume>
          (
          <year>2013</year>
          )
          <fpage>29</fpage>
          -
          <lpage>43</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>L.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Theseider</given-names>
            <surname>Dupré</surname>
          </string-name>
          ,
          <article-title>Weighted defeasible knowledge bases and a multipreference semantics for a deep neural network model</article-title>
          , CoRR abs/
          <year>2012</year>
          .13421 (
          <year>2021</year>
          ).
          <source>Technical Report</source>
          , https://arxiv.org/abs/
          <year>2012</year>
          .13421v2.
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [38]
          <string-name>
            <given-names>P.</given-names>
            <surname>Smets</surname>
          </string-name>
          ,
          <article-title>Probability of a fuzzy event: An axiomatic approach</article-title>
          ,
          <source>Fuzzy Sets and Systems</source>
          <volume>7</volume>
          (
          <year>1982</year>
          )
          <fpage>153</fpage>
          -
          <lpage>164</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [39]
          <string-name>
            <given-names>D.</given-names>
            <surname>Dubois</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Prade</surname>
          </string-name>
          ,
          <article-title>Fuzzy sets and probability: misunderstandings, bridges and gaps</article-title>
          ,
          <source>in: [Proceedings 1993] Second IEEE International Conference on Fuzzy Systems</source>
          ,
          <year>1993</year>
          , pp.
          <fpage>1059</fpage>
          -
          <lpage>1068</lpage>
          vol.
          <volume>2</volume>
          . doi:
          <volume>10</volume>
          .1109/FUZZY.
          <year>1993</year>
          .
          <volume>327367</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [40]
          <string-name>
            <given-names>B.</given-names>
            <surname>Kosko</surname>
          </string-name>
          ,
          <article-title>Neural networks and fuzzy systems: a dynamical systems approach to machine intelligence</article-title>
          , Prentice Hall,
          <year>1992</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          [41]
          <string-name>
            <given-names>N.</given-names>
            <surname>Friedman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Y.</given-names>
            <surname>Halpern</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Koller</surname>
          </string-name>
          ,
          <article-title>First-order conditional logic for default reasoning revisited, ACM TOCL</article-title>
          , ACM Press 1
          <article-title>(</article-title>
          <year>2000</year>
          )
          <fpage>175</fpage>
          -
          <lpage>207</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          [42]
          <string-name>
            <given-names>K. A.</given-names>
            <surname>Ross</surname>
          </string-name>
          ,
          <article-title>Modular stratification and magic sets for Datalog programs with negation</article-title>
          ,
          <source>J. ACM</source>
          <volume>41</volume>
          (
          <year>1994</year>
          )
          <fpage>1216</fpage>
          -
          <lpage>1266</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          [43]
          <string-name>
            <given-names>S.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Deng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Du</surname>
          </string-name>
          ,
          <article-title>Reliable crowdsourcing and deep locality-preserving learning for expression recognition in the wild</article-title>
          ,
          <source>in: 2017 IEEE Conference on Computer Vision</source>
          and Pattern Recognition,
          <string-name>
            <surname>CVPR</surname>
          </string-name>
          <year>2017</year>
          ,
          <article-title>Honolulu</article-title>
          ,
          <string-name>
            <surname>HI</surname>
          </string-name>
          , USA, July
          <volume>21</volume>
          -
          <issue>26</issue>
          ,
          <year>2017</year>
          ,
          <year>2017</year>
          , pp.
          <fpage>2584</fpage>
          -
          <lpage>2593</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          [44]
          <string-name>
            <given-names>T.</given-names>
            <surname>Baltrusaitis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Zadeh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y. C.</given-names>
            <surname>Lim</surname>
          </string-name>
          , L. Morency,
          <article-title>Openface 2.0: Facial behavior analysis toolkit</article-title>
          ,
          <source>in: 13th IEEE International Conference on Automatic Face &amp; Gesture Recognition, FG</source>
          <year>2018</year>
          , IEEE Computer Society,
          <year>2018</year>
          , pp.
          <fpage>59</fpage>
          -
          <lpage>66</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          [45]
          <string-name>
            <given-names>B.</given-names>
            <surname>Waller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. C.</given-names>
            <surname>Jr.</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Burrows</surname>
          </string-name>
          ,
          <article-title>Selection for universal facial emotion</article-title>
          ,
          <source>Emotion</source>
          <volume>8</volume>
          (
          <year>2008</year>
          )
          <fpage>435</fpage>
          -
          <lpage>439</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          [46]
          <string-name>
            <given-names>D. P.</given-names>
            <surname>Kingma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ba</surname>
          </string-name>
          ,
          <article-title>Adam: A method for stochastic optimization</article-title>
          ,
          <source>arXiv preprint arXiv:1412.6980</source>
          (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          [47]
          <string-name>
            <given-names>L. C.</given-names>
            <surname>Lamb</surname>
          </string-name>
          , A. S.
          <string-name>
            <surname>d'Avila Garcez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Gori</surname>
            ,
            <given-names>M. O. R.</given-names>
          </string-name>
          <string-name>
            <surname>Prates</surname>
            ,
            <given-names>P. H. C.</given-names>
          </string-name>
          <string-name>
            <surname>Avelar</surname>
            ,
            <given-names>M. Y.</given-names>
          </string-name>
          <string-name>
            <surname>Vardi</surname>
          </string-name>
          ,
          <article-title>Graph neural networks meet neural-symbolic computing: A survey and perspective</article-title>
          , in: C.
          <string-name>
            <surname>Bessiere</surname>
          </string-name>
          (Ed.),
          <source>Proc. IJCAI</source>
          <year>2020</year>
          ,
          <article-title>ijcai</article-title>
          .org,
          <year>2020</year>
          , pp.
          <fpage>4877</fpage>
          -
          <lpage>4884</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          [48]
          <string-name>
            <given-names>R.</given-names>
            <surname>Guidotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Monreale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ruggieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Turini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Giannotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Pedreschi</surname>
          </string-name>
          ,
          <article-title>A survey of methods for explaining black box models</article-title>
          ,
          <source>ACM Comput. Surv</source>
          .
          <volume>51</volume>
          (
          <year>2019</year>
          )
          <volume>93</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>93</lpage>
          :
          <fpage>42</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref49">
        <mixed-citation>
          [49]
          <string-name>
            <given-names>M.</given-names>
            <surname>Alviano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Peñaloza</surname>
          </string-name>
          ,
          <article-title>Fuzzy answer set computation via satisfiability modulo theories</article-title>
          ,
          <source>Theory Pract. Log. Program</source>
          .
          <volume>15</volume>
          (
          <year>2015</year>
          )
          <fpage>588</fpage>
          -
          <lpage>603</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>