Extraction of Conditional Belief Bases and the System Z Ranking
                         Model From Multilayer Perceptrons for Binary Classification
                         Marco Wilhelm, Alexander Hahn and Gabriele Kern-Isberner
                         Dept. of Computer Science, TU Dortmund University, Dortmund, Germany


                                           Abstract
                                           We extract propositional conditional belief bases from multilayer perceptrons, a basic type of feedforward neural networks, and
                                           investigate the relation between these two prevalent formalisms from knowledge representation and reasoning (KRR) and machine
                                           learning (ML), respectively. The ultimate goal of our work is to imitate with the extracted belief base the main information flow in the
                                           original multilayer perceptron detached from specific input data. For this, we introduce a notion of sufficient (in)activators of neurons
                                           which reflect the most relevant connections within the multilayer perceptron that lead to the (in)activation of the subsequent neurons.
                                           While focusing on the binary multi-class classification task, we show that our approach produces consistent belief bases from which
                                           principled inferences can be drawn, for instance under System Z. In particular, no inferences are invented by the System Z ranking
                                           model that are not in accordance with the initial neural network.

                                           Keywords
                                           multilayer perceptrons, binary classification, belief base extraction, conditional reasoning, system Z


                         1. Introduction                                                                                                   perceptrons are arranged to at least three fully connected
                                                                                                                                           layers with neurons connected to the other neurons from
                         Neural networks [1] are formal models studied in the re-                                                          the neighboring layers. The extracted belief base reflects the
                         search field of machine learning (ML) which have con-                                                             main information flow within such a multilayer perceptron.
                         tributed significantly to the recent success of AI. In neural                                                        The basic idea of our approach is to identify sets of pre-
                         networks, input data is propagated through a network of                                                           decessors of a neuron 𝑁 the (in)activation of which is suf-
                         neurons where neurons weight the received information                                                             ficient to (in)activate 𝑁 . Hereby, the (in)activation of a
                         and process it to the subsequent neurons. Neural networks                                                         neuron means that an input of the multilayer perceptron
                         are used in nearly every application domain with special                                                          triggers the neuron more (less) than a predefined threshold,
                         abilities in data processing, pattern recognition, data mining,                                                   i.e., the output value of the neuron is larger (smaller) than
                         and, what is in the focus of this paper, binary (multi-class)                                                     this threshold. Therewith, our approach is related to the
                         classification [2]. A drawback of neural networks is that                                                         work in [10] which aims at identifying “most influential”
                         they appear as a black box methodology. Usually, it is not                                                        neurons in neural networks, however without establishing
                         very transparent why input data leads to a specific output.                                                       logical connections between these neurons.
                            In contrast to neural networks, knowledge-based sys-                                                              In more detail, the main contributions of the present paper
                         tems [3] from the field of knowledge representation and rea-                                                      are as follows:
                         soning (KRR) typically provide a transparent and principled
                         way of drawing inferences. A frequently used inference for-                                                               • We introduce a notion of sufficient (in)activators of
                         malism, System Z [4], makes use of conditionals (𝐵|𝐴) in or-                                                                neurons (Definitions 6 and 7).
                         der to represent defeasible statements of the form “if 𝐴 holds,                                                           • We show that sufficient (in)activators are indepen-
                         then usually 𝐵 holds, too” [5, 6]. Ranking functions 𝜅 [7]                                                                  dent of the input of the multilayer perceptron (Propo-
                         like the System Z ranking function give such conditionals                                                                   sitions 2 and 3).
                         a clear semantics by assigning (im)plausibility values to                                                                 • Based on the notion of sufficient (in)activators, we
                         sentences while postulating that the verification of a con-                                                                 extract belief bases from multilayer perceptrons (Def-
                         ditional (𝐵|𝐴) is more plausible than its falsification, in                                                                 inition 9). The extracted belief bases are provably
                         symbols 𝜅(𝐴 ∧ 𝐵) < 𝜅(𝐴 ∧ ¬𝐵). The 𝜅-ranks according                                                                         consistent with respect to ranking semantics (Propo-
                         to System Z are gained by penalizing possible worlds for                                                                    sition 5).
                         falsifying conditionals, where the penalty points are the                                                                 • We use the extracted belief bases and their System Z
                         greater the more specific the falsified conditionals are. Al-                                                               ranking models for binary classification and relate
                         ternative ranking semantics are provided by System P [8]                                                                    their classification behavior to the direct classifica-
                         and c-representations [9].                                                                                                  tion with the initial multilayer perceptrons (Propo-
                            In this paper, we extract conditional belief bases from a                                                                sition 6).
                         specific type of neural networks called multilayer percep-
                         trons. Multilayer perceptrons are feedforward networks in                                                            With our approach we abstract from specific input data
                         which information is always processed towards the output,                                                         and also from overlay effects of less relevant connections
                         hence there are no cycles in the network. In contrast to                                                          in the neural networks. The most relevant connections are
                         general feedforward networks, the neurons in multilayer                                                           formalized in form of easy to understand conditionals. Note
                                                                                                                                           that establishing such formal bridges between neural- and
                          22nd International Workshop on Nonmonotonic Reasoning, November 2-4,                                             logic-based models is a very old enterprise and has been
                          2024, Hanoi, Vietnam                                                                                             pursued in the first papers on neural networks already [11].1
                          $ marco.wilhelm@tu-dortmund.de (M. Wilhelm);                                                                        The rest of the paper is organized as follows. First we
                          alexander.hahn@tu-dortmund.de (A. Hahn);
                          gabriele.kern-isberner@tu-dortmund.de (G. Kern-Isberner)
                                                                                                                                           recall basics on multilayer perceptrons, in particular with
                           0000-0003-0266-2334 (M. Wilhelm); 0009-0008-6114-2594 (A. Hahn);                                               respect to binary multi-class classification, and conditional
                          0000-0001-8689-5391 (G. Kern-Isberner)
                                   © 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribu-   1
                                   tion 4.0 International (CC BY 4.0).                                                                         We thank the anonymous referees for their valuable comments.


CEUR
                  ceur-ws.org
Workshop      ISSN 1613-0073
Proceedings
 Activation function    Specification                Range                 𝑁1 ▶ 𝑦𝑁1

 Identity               𝜑(𝑥) = {︃
                               𝑥                     R                                           𝜈𝑁1 ,𝑁
                                 0, 𝑥 < 0
 Heaviside step         𝜑(𝑥) =                       {0, 1}                                   𝜈𝑁2 ,𝑁
                                 1, 𝑥 ≥ 0                                  𝑁2 ▶ 𝑦𝑁2                         𝑁 ▶ 𝑦𝑁
                                   1
 Logistic function      𝜑(𝑥) =                       (0, 1)
                               1 + 𝑒−𝑥                                                      𝜈𝑁𝑛 ,𝑁
                               𝑒𝑥 − 𝑒−𝑥                                         ..
 Hyperbolic tangent     𝜑(𝑥) = 𝑥                     (−1, 1)                     .
                               𝑒 + 𝑒−𝑥
 ReLU                   𝜑(𝑥) = max(0, 𝑥)             R≥0
                                                                           𝑁 𝑛 ▶ 𝑦 𝑁𝑛
Table 1
Typical activation functions of neural networks.
                                                                  Figure 1: Schema of a neuron 𝑁 .

reasoning based on ranking functions (Section 2). Then, we
discuss related work on extracting belief bases from multi-
                                                                  input data for which the expected output is known. Here, we
layer perceptrons within a Description Logic context and
                                                                  solely consider neural networks which are already trained.
show that a naïve translation to propositional conditional
belief bases works only to a limited extent (Section 3). Even-
tually, we propose our novel approach on extracting belief        Multilayer Perceptrons In neural networks, neurons
bases based on sufficient (in)activators (Section 4) and use      are usually assigned to layers with different functionalities.
this approach for principled binary classification (Section 5).   Neurons in the first layer, the input layer, receive the input
We close the paper with a conclusion that points to future        of the network, and neurons in the last layer, the output
work (Section 6).                                                 layer, return the output. The layers in-between are called
                                                                  hidden layers. If a neural network is represented by an
                                                                  acyclic directed graph, it is called a feedforward network.
2. Preliminaries                                                  In feedforward networks information is always processed
                                                                  towards the output layer. Multilayer perceptrons constitute
In this section, we recall preliminaries on multilayer percep-    an important subclass of feedforward networks with edges
trons with an application to binary multi-class classification    only between adjacent layers and, taking this condition into
first (Section 2.1). Then, we explain basics on reasoning with    account, fully connected neurons. Multilayer perceptrons
conditionals, in particular based on System Z (Section 2.2).      have at least one hidden layer. This hidden layer (as well as
                                                                  a non-linear activation function) is necessary to distinguish
2.1. Multilayer Perceptrons for Binary                            data that is not linearly separable [12].
     Multi-Class Classification                                   Definition 1 (Multilayer Perceptron). A multilayer percep-
Multilayer perceptrons (MLPs) constitute a widely used type       tron ℳ𝜑 is a special neural network which is represented by
of neural networks which expand single perceptrons to sev-        a directed graph (𝒱ℳ𝜑 , ℰℳ𝜑 ) consisting of a set of vertices
eral fully connected layers. We give a brief introduction to                         𝒱ℳ𝜑 = {𝑁𝑖,𝑗 | 𝑖 ∈ [𝑚], 𝑗 ∈ [𝑛𝑖 ]}, 2
neural networks in general and to MLPs in particular. Af-
terwards, we discuss their application to binary multi-class      the neurons in ℳ𝜑 , and a set of edges
classification.
                                                                        ℰℳ𝜑 = {(𝑁𝑖,𝑗 , 𝑁𝑖+1,𝑘 )
Neural Networks Neural networks [1] are formal models                                        | 𝑖 ∈ [𝑚 − 1], 𝑗 ∈ [𝑛𝑖 ], 𝑘 ∈ [𝑛𝑖+1 ]},
used to process information in form of data in modern AI
systems. In the original sense, neural networks are func-         where 𝑚 ∈ N≥2 , and 𝑛𝑖 ∈ N for 𝑖 ∈ [𝑚]. Every edge
tions 𝒩 : R𝑛 → R𝑚 where 𝑛 is the size of the real-valued          (𝑁𝑖,𝑗 , 𝑁𝑖+1,𝑘 ) ∈ ℰℳ𝜑 is assigned a real-valued weight
input vectors ⃗𝑥, and where 𝑚 is the size of the real-valued      𝜈𝑖,𝑗,𝑘 = 𝜈𝑁𝑖,𝑗 ,𝑁𝑖+1,𝑘 , every neuron 𝑁0,𝑗 , 𝑗 ∈ [𝑛0 ], in the
output 𝒩 (𝑥⃗ ). The computation of 𝒩 (𝑥  ⃗ ) is specified by a    input layer is assigned the identity function 𝑓0,𝑗 : R → R
weighted directed graph the nodes of which are called neu-        with 𝑓0,𝑗 (𝑥) = 𝑥, and every further neuron 𝑁𝑖,𝑗 with 𝑖 > 0,
rons. The functionality of neurons is as follows. Neurons 𝑁       𝑗 ∈ [𝑛𝑖 ], is assigned a function 𝑓𝑁𝑖,𝑗 : R𝑛𝑖−1 +1 → R with
receive information encoded as real numbers 𝑦𝑁𝑖 from their                                    ∑︁
                                                                    𝑓𝑁𝑖,𝑗 (𝑥
                                                                           ⃗ ) = 𝜑(𝛽𝑖,𝑗 +                                ⃗ )), (1)
                                                                                                     𝜈𝑖−1,ℎ,𝑗 · 𝑓𝑁𝑖−1,ℎ (𝑥
parent nodes/neurons 𝑁𝑖 ∈ pa𝑁 , or the input vector ⃗𝑥 of
                                                                                                ℎ∈[𝑛𝑖−1 ]
the network, process this information based on an activation
function 𝜑𝑁 : R → R and possibly a bias 𝛽𝑁 ∈ R, and send          where 𝜑 is the activation function of ℳ𝜑 and 𝛽𝑖,𝑗 ∈ R is
the processed information                                         the bias of 𝑁𝑖,𝑗 . The input of ℳ𝜑 is any vector ⃗𝑥 ∈ R𝑛0 +1
                               ∑︁                                 whereby the 𝑗-th component of ⃗𝑥 is passed to the neuron 𝑁0,𝑗 ,
           𝑦𝑁 = 𝜑𝑁 (𝛽𝑁 +             𝜈𝑁𝑖 ,𝑁 · 𝑦𝑁𝑖 )               and the output of ℳ𝜑 is
                              𝑁𝑖 ∈pa𝑁
                                                                         ℳ𝜑 (𝑥
                                                                             ⃗ ) = (𝑓𝑁𝑚,0 (𝑥                    ⃗ )) ∈ R𝑛𝑚 +1 .
                                                                                           ⃗ ), . . . , 𝑓𝑁𝑚,𝑛𝑚 (𝑥
to their child nodes/neurons. Hereby, 𝜈𝑁𝑖 ,𝑁 ∈ R is the
weight of the edge from 𝑁𝑖 to 𝑁 (cf. Figure 1). Neurons             Figure 2 shows a schema of a multilayer perceptron with
without child nodes return the output of the neural network.      one hidden layer (𝑚 = 2). For a neuron 𝑁 ∈ ℳ𝜑 , we will
Typical activation functions of neural networks are shown         denote the set of its parent nodes by pa𝑁 which will help
in Table 1. The weights of a neural network and the biases        us to avoid indices.
of the neurons are usually derived from training data, i.e.,      2
                                                                      For 𝑚 ∈ N, we abbreviate [𝑚] = {0, 1, . . . , 𝑚}.
    𝑁0,0                     𝑁1,0                          𝑁2,0        𝑁0,0                      𝑁1,0                      𝑁2,0


      ..                        ..                          ..
       .                         .                           .

    𝑁0,𝑖                     𝑁1,𝑗                          𝑁2,𝑘        𝑁0,1                      𝑁1,1                      𝑁2,1


      ..                        ..                          ..
       .                         .                           .

   𝑁0,𝑛0                     𝑁1,𝑛1                     𝑁2,𝑛2           𝑁0,2                      𝑁1,2                      𝑁2,2


Figure 2: Multilayer perceptron with one hidden layer.              Figure 3: Multilayer perceptron from Example 1. Edges with
                                                                    negative weights are dashed.


Binary Multi-Class Classification A possible applica-                    𝑁𝑖,𝑗                𝜈𝑖,𝑗,0          𝜈𝑖,𝑗,1          𝜈𝑖,𝑗,2
tion of neural networks in general and multilayer percep-                𝑁0,0               −1.27            0.91           −0.44
trons in particular is binary (multi-class) classification [2].          𝑁0,1                1.23            0.81            0.27
For instance, the input ⃗𝑥 of a multilayer perceptron ℳ𝜑                 𝑁0,2               −0.91           −0.09            1.96
could represent medical patient data, and we could ask                   𝑁1,0                1.62           −0.96            1.31
for therapies that are suited to cure the patient. In the                𝑁1,1               −1.19            1.15            1.46
easiest case, the neurons in the output layer of ℳ𝜑 rep-                 𝑁1,2                0.14           −1.18           −0.14
resent the different therapies and are equipped with the
Heaviside step function as activation function 𝜑 such               Table 2
that ℳ𝜑 (𝑥 ⃗ ) ∈ {0, 1}𝑚 for some 𝑚 ∈ N. Then, 𝑦𝑖 = 1,              Weights of the multilayer perceptron from Example 1.
where 𝑦𝑖 is the outcome of neuron 𝑁𝑖 in the output layer,
can be interpreted as “the therapy 𝑁𝑖 is suited to cure the
patient represented by ⃗𝑥,” and 𝑦1 = 0 can be understood as         Example 1. We consider the multilayer perceptron ℳex       log
the opposite.                                                       from Figure 3 with the edge weights from Table 2 as a running
   In practice, one usually uses sigmoid functions like the lo-     example. Further, we assume that the neurons in ℳex    log are
gistic function (cf. Table 1) for classification, instead, which    unbiased (𝛽𝑁𝑖,𝑗 = 0), and let 𝜏 = 0.3. For instance, for the
range over the interval (0, 1) and, thus, allow for a grad-         input vector ⃗𝑥 = (0.9, 0.8, 0.1), we obtain
ual answer behavior. Furthermore, the Heaviside function
cannot be used for gradient-based training because it is not                               𝑦𝑁2,2 ≈ 0.844
differentiable at 0 and the derivative is 0 at all other points,
while the logistics function can be differentiated any number       so that ⃗𝑥 is classified as an instance of class 𝒞𝑁2,2 when the
of times which makes it particularly suited for numerical           tolerance factor 𝜏 is equal to or greater than 0.156.
methods. In this paper, we equip multilayer perceptrons                Besides the fact that sigmoid functions like the logistic
with the logistic function as an activation function and de-        function are common activation functions for classification
note this by ℳlog . Our approach works with any sigmoid             tasks, we will utilize in some proofs that logistic functions
function, though. We consider the following three-valued            are bounded between 0 and 1 (cf. the proofs of Proposi-
interpretation of the output of neurons in ℳlog .                   tions 2 and 3).
Definition 2 ((In)active Neurons). Let ℳlog be a multilayer         Definition 3 (Classification Scheme). Let ℳlog be a multi-
perceptron, let 𝑁 be a neuron in ℳlog , let ⃗𝑥 be an input vector   layer perceptron with the logistic function as activation func-
of ℳlog , and let 𝜏 ∈ [0, 0.5). We call 𝜏 a tolerance factor,       tion, and let 𝜏 be a tolerance factor. Then, we call (ℳlog , 𝜏 )
and say that neuron 𝑁 is (cf. (1))                                  a classification scheme.
     • activated by ⃗𝑥 wrt. 𝜏 , or active for short, iff
                                                                       Within our approach on extracting conditional belief
                               ⃗ ) ≥ 1 − 𝜏,
                           𝑓𝑁 (𝑥                                    bases from multilayer perceptrons, we will focus on the
                                                                    task of binary multi-class classification.
     • inactivated by ⃗𝑥 wrt. 𝜏 , or inactive for short, iff

                                 ⃗ ) ≤ 𝜏,
                             𝑓𝑁 (𝑥                                  2.2. Conditionals and System Z
                                                                    Within the field of nonmonotonic reasoning, conditionals [13]
     • ambiguous otherwise.                                         constitute a widely used representation of defeasible knowl-
   With Definition 2, we can say that an input vector ⃗𝑥            edge resp. beliefs. Here, we consider conditionals defined
of ℳlog is classified as an instance of class 𝒞𝑁 , represented      over a propositional language and interpret them via so-
by the neuron 𝑁 in the output layer of ℳlog , if 𝑁 is acti-         called ranking functions, in particular the System Z ranking
vated by ⃗𝑥, and ⃗𝑥 is declassified as an instance of class 𝒞𝑁      model.
if 𝑁 is inactivated by ⃗𝑥. Otherwise, the membership to 𝒞𝑁
is ambiguous. We give an example.
Conditional Reasoning Let ℒ(Σ) be a propositional lan-                     The resulting System Z ranking model is
guage defined over a finite signature Σ as usual.3 A condi-                                   ⎧
tional (𝐵|𝐴) with 𝐴, 𝐵 ∈ ℒ(Σ) is a formal representation                                      ⎨0, 𝜔 ∈ {𝑏𝑓 𝑝, 𝑏𝑓 𝑝, 𝑏𝑓 𝑝}
                                                                                              ⎪
of the defeasible statement: “If 𝐴 holds, then usually 𝐵                            𝜅𝑍Δ (𝜔) =   1, 𝜔 ∈ {𝑏𝑓 𝑝, 𝑏𝑓 𝑝}       .
holds, too.” Finite sets of conditionals serve as belief bases.                               ⎪
                                                                                              ⎩
                                                                                                2, 𝜔 ∈ {𝑏𝑓 𝑝, 𝑏𝑓 𝑝, 𝑏𝑓 𝑝}
The semantics of conditionals is based on possible worlds.
Here, possible worlds 𝜔 ∈ Ω(Σ) are the propositional inter-                      System Z coincides with rational closure [14].
pretations of ℒ(Σ) represented as complete conjunctions
of literals. That is, every atom from Σ occurs in a possible
world once, either positive or negated. A ranking func-                    3. Related Work and Synaptic
tion 𝜅 : Ω(Σ) → N0 ∪ {∞} [7] maps possible worlds to a                        Conditionals
degree of implausibility while satisfying the normalization
condition 𝜅−1 (0) ̸= ∅. The higher the rank 𝜅(𝜔), the less                 In this section, we briefly recall the extraction of beliefs from
plausible the possible world 𝜔 is. Hence, 𝜅−1 (0) is the set               neural networks as presented in [15] and provide a naïve
of the most plausible possible words. Ranking functions are                translation of this approach to propositional conditionals.
extended to propositions via                                               We also discuss why this naïve translation is too simple
                                                                           to capture the essential streams of information of a neural
                     𝜅(𝐴) =         min        𝜅(𝜔)                        network.
                               𝜔∈Ω(Σ) : 𝜔|=𝐴
                                                                              In [15], an extraction of belief bases from neural net-
and accept a conditional (𝐵|𝐴) if 𝜅(𝐴𝐵) < 𝜅(𝐴𝐵). A                         works is proposed where the belief bases are defined over
ranking function 𝜅 is a ranking model of a belief base Δ if 𝜅              defeasible subsumptions of Description Logic concepts.4
accepts all conditionals in Δ. If Δ has a ranking model, then              Neurons 𝑁𝑖 are represented as atomic concepts 𝐶𝑖 , and an
it is called consistent. Ranking models 𝜅 of Δ yield a non-                edge from a neuron 𝑁𝑖 to a neuron 𝑁𝑗 is represented as the
monotonic inference relation between Δ and conditionals                    defeasible subsumption T(𝐶𝑖 ) ⊑ 𝐶𝑗 , expressing that input
(𝐵|𝐴) in the following sense:                                              vectors ⃗𝑥 that typically activate 𝑁𝑖 also activate 𝑁𝑗 . This no-
                                                                           tion of representing the structure of a neural network using
       Δ |∼𝜅 (𝐵|𝐴) iff 𝜅(𝐴𝐵) < 𝜅(𝐴𝐵) or 𝜅(𝐴) = ∞.                          uncertain connections between atoms can be carried over
                                                                           to propositional conditional logic, utilizing atomic proposi-
System Z A sophisticated ranking model of consistent                       tions 𝐴𝑖 to represent neurons and conditionals (𝐴𝑖 |𝐴𝑗 ) to
belief bases is provided by System Z [4] which is based                    encode connections between them. Then, a (partial) possi-
on the notion of tolerance. A conditional (𝐵|𝐴) is toler-                  ble world 𝜔 encodes a possible state of the neural network,
ated by a belief base Δ if there is a possible world 𝜔 such                with 𝜔 |= 𝐴𝑖 (𝜔 |= 𝐴𝑖 ) meaning that the neuron 𝑁𝑖 is ac-
that 𝜔 |= 𝐴𝐵 (“the conditional (𝐵|𝐴) is verified in 𝜔”)                    tive (inactive) in the neural network. From another point of
and 𝜔 |= 𝐴′ 𝐵 ′ ∨ 𝐴′ for all conditionals (𝐵 ′ |𝐴′ ) in Δ (“the            view, 𝜔 can be seen as a representation of all input vectors ⃗𝑥
conditional (𝐵 ′ |𝐴′ ) is verified or not applicable in 𝜔”). An            that cause the same neurons to be (in)active. Together, the
ordered partition (Δ0 , Δ1 , . . . , Δ𝑚 ) of Δ is called a toler-          possible worlds in Ω(Σ) partition the set of input vectors
ance partition of Δ if every conditional in Δ0 is tolerated                based on their (abstracted) activation of neurons.
by Δ and (Δ1 , . . . , Δ𝑚 ) is a tolerance partition of Δ ∖ Δ0 .              We formalize the extraction of propositional conditionals
It is a well-known result that Δ is consistent iff Δ has a                 in analogy to the defeasible subsumptions in [15] now. For
tolerance partition. If the partitioning sets are chosen in-               this, and in the rest of this paper, we will use the same
clusion maximally, beginning from Δ0 , then the resulting                  symbol 𝑁 to denote both a neuron in the neural network and
tolerance partition 𝑍(Δ) = (Δ0 , Δ1 , . . . , Δ𝑚 ) is unique               the atomic proposition representing the neuron. Moreover,
and called Z-partition of Δ. Via the Z ranks 𝑍Δ (𝛿) = 𝑖 of
                                                                                                    ′
conditionals 𝛿 ∈ Δ where 𝑖 is the index of the partitioning                                 pa+
                                                                                              𝑁 = {𝑁 ∈ pa𝑁 | 𝜈𝑁 ′ ,𝑁 > 0},
set from 𝑍(Δ) with 𝛿 ∈ Δ𝑖 , the Z-partition of Δ allows one                                 pa−     ′
                                                                                              𝑁 = {𝑁 ∈ pa𝑁 | 𝜈𝑁 ′ ,𝑁 < 0},
to define the following System Z ranking model of consistent
belief bases Δ:                                                            denote the sets of the parent nodes 𝑁 ′ of 𝑁 within a neu-
              {︃                                                           ral network 𝒩 with positive and negative weights 𝜈𝑁 ′ ,𝑁 ,
                 0                            falΔ (𝜔) = ∅                 respectively.
   𝜅𝑍Δ (𝜔) =                                                 ,
                 1 + max𝛿∈falΔ (𝜔) 𝑍Δ (𝛿) otherwise
                                                                           Definition 4 (Synaptic Conditionals). Let 𝒩 be a neural net-
where 𝜔 ∈ Ω(Σ), and falΔ (𝜔) = {(𝐵|𝐴) ∈ Δ𝜔 |= 𝐴𝐵}                          work. Then we define for each neuron 𝑁 ∈ 𝒩 the backward
is the set of conditionals falsified in 𝜔.                                 synaptic conditionals as follows:
                                                                                                  {︀ ′          ′
Example 2. A typical example to illustrate System Z is the                            Δ+← (𝑁 ) = (𝑁 |𝑁 ) | 𝑁 ∈ pa𝑁 ,
                                                                                                                      + }︀

Tweety example. Let Δ = {𝛿1 , 𝛿2 , 𝛿3 } with                                          Δ−
                                                                                                  {︀
                                                                                                        ′       ′     − }︀
                                                                                        ← (𝑁 ) = (𝑁 |𝑁 ) | 𝑁 ∈ pa𝑁 .

          𝛿1 = (𝑏|𝑝),         𝛿2 = (𝑓 |𝑏),        𝛿3 = (𝑓 |𝑝),             Analogously, we define forward synaptic conditionals:
state that penguins like Tweety are usually birds and birds                          Δ+
                                                                                                  {︀     ′    ′      + }︀
                                                                                        → (𝑁 ) = (𝑁 |𝑁 ) | 𝑁 ∈ pa𝑁 ,
usually fly, but penguins usually do not fly. The System Z
                                                                                     Δ−                  ′    ′      − }︀
                                                                                                  {︀
tolerance partition of Δ is 𝑍(Δ) = (Δ0 , Δ1 ) with                                      → (𝑁 ) = (𝑁 |𝑁 ) | 𝑁 ∈ pa𝑁 .


                  Δ0 = {𝛿2 },         Δ1 = {𝛿1 , 𝛿3 }.                        Note that backward synaptic conditionals are abductive
3
                                                                           in nature. The idea of backward synaptic conditionals is that
    In order to shorten logical expressions, we use the abbreviations 𝐴𝐵
                                                                           4
    for conjunctions 𝐴∧𝐵 and 𝐴 for negations ¬𝐴 where 𝐴, 𝐵 ∈ ℒ(Σ).             Please see [16] for an introduction to Description Logics.
if a neuron 𝑁 is active, the positive inputs of 𝑁 must have       Example 3. We consider the multilayer perceptron ℳex  log
outweighed the negative inputs of 𝑁 (modulo the bias 𝛽𝑁 ).        from Example 1. The synaptic belief bases extracted from
Therefore, it is plausible to assume that parents with positive   ℳexlog are
connections are generally active, while parents with nega-
tive connections are generally inactive, even if exceptions          Δ←
                                                                      ℳex
                                                                        log
                                                                            = {(𝑁0,0 |𝑁1,0 ), (𝑁0,1 |𝑁1,0 ), (𝑁0,2 |𝑁1,0 ),
are possible (and likely). Forward synaptic conditionals, on
                                                                                  (𝑁0,0 |𝑁1,1 ), (𝑁0,1 |𝑁1,1 ), (𝑁0,2 |𝑁1,1 ),
the other hand, are predictive: Given that a neuron 𝑁 has
an active parent with a positive connection (and without                          (𝑁0,0 |𝑁1,2 ), (𝑁0,1 |𝑁1,2 ), (𝑁0,2 |𝑁1,2 ),
any additional information about the other parents), it is                        (𝑁1,0 |𝑁2,0 ), (𝑁1,1 |𝑁2,0 ), (𝑁1,2 |𝑁2,0 ),
plausible to assume that this positive influence will cause 𝑁
to be active as well.                                                             (𝑁1,0 |𝑁2,1 ), (𝑁1,1 |𝑁2,1 ), (𝑁1,2 |𝑁2,1 ),
   We can now define belief bases containing synaptic con-                        (𝑁1,0 |𝑁2,2 ), (𝑁1,1 |𝑁2,2 ), (𝑁1,2 |𝑁2,2 )},
ditionals.
                                                                  and
Definition 5 (Synaptic Belief Bases). Let 𝒩 be a neural
network. We define the backward/forward synaptic belief              Δ→
                                                                      ℳex
                                                                        log
                                                                            = {(𝑁1,0 |𝑁0,0 ), (𝑁1,1 |𝑁0,0 ), (𝑁1,2 |𝑁0,0 ),
bases as the union of all synaptic conditionals that share the                    (𝑁1,0 |𝑁0,1 ), (𝑁1,1 |𝑁0,1 ), (𝑁1,2 |𝑁0,1 ),
same direction, i.e.,
                      ⋃︁ (︀ +                                                     (𝑁1,0 |𝑁0,2 ), (𝑁1,1 |𝑁0,2 ), (𝑁1,2 |𝑁0,2 ),
           Δ←              Δ← (𝑁 ) ∪ Δ−
                                                )︀
              𝒩 =                        ← (𝑁 ) ,                                 (𝑁2,0 |𝑁1,0 ), (𝑁2,1 |𝑁1,0 ), (𝑁2,2 |𝑁1,0 ),
                    𝑁 ∈𝒩
                ⋃︁ (︀ +                                                           (𝑁2,0 |𝑁1,1 ), (𝑁2,1 |𝑁1,1 ), (𝑁2,2 |𝑁1,1 ),
           Δ→        Δ→ (𝑁 ) ∪ Δ−
                                      )︀
            𝒩 =                 → (𝑁 ) .
                    𝑁 ∈𝒩                                                          (𝑁2,0 |𝑁1,2 ), (𝑁2,1 |𝑁1,2 ), (𝑁2,2 |𝑁1,2 )}.
   The synaptic belief bases capture the information that         In both cases (backward/forward), the Z-partition collapses:
is immediately available from the structure of the neural
network, namely the positive or negative influence neurons           𝑍(Δ←
                                                                        ℳex
                                                                          log
                                                                              ) = (Δ←
                                                                                    ℳex
                                                                                      log
                                                                                          ),        𝑍(Δ→
                                                                                                       ℳex
                                                                                                         log
                                                                                                             ) = (Δ→
                                                                                                                   ℳex
                                                                                                                     log
                                                                                                                         ),
have on each other based on the trained synaptic weights.
From a formal perspective, the direction of the conditionals      and we have, with 𝜓(𝑁2,2 ) = 𝑁0,0 ∧ 𝑁0,1 ∧ 𝑁0,2 ,
is arbitrary. As long as the two directions are not mixed, the
                                                                                      Δ |̸ ∼𝜅𝑍 (𝑁2,2 |𝜓(𝑁2,2 ))
synaptic belief base extracted from a multilayer perceptron                                 Δ

is consistent.                                                    regardless of whether Δ = Δ←           →
                                                                                             ℳex or Δ = Δℳex because
                                                                                                    log              log
Proposition 1. For every multilayer perceptron ℳ𝜑 , the
synaptic belief bases Δ←       →
                       ℳ𝜑 and Δℳ𝜑 are consistent.                  𝜅𝑍                               𝑍
                                                                    Δ (𝑁2,2 ∧ 𝜓(𝑁2,2 )) = 1 ̸< 0 = 𝜅Δ (𝑁2,2 ∧ 𝜓(𝑁2,2 ))


Proof. We prove the proposition for Δ←   ℳ𝜑 by showing that       for Δ = Δ←
                                                                           ℳex , and
                                                                                log
the layers of the multilayer perceptron ℳ𝜑 induce a toler-
ance partition of Δ←
                   ℳ𝜑 . Let (𝑚 + 1) ∈ N be the number of           𝜅𝑍                               𝑍
                                                                    Δ (𝑁2,2 ∧ 𝜓(𝑁2,2 )) = 1 ̸< 1 = 𝜅Δ (𝑁2,2 ∧ 𝜓(𝑁2,2 ))
layers in ℳ𝜑 and let 𝒩𝑖 be the set of neurons in the 𝑖-th
layer of ℳ𝜑 . Then, (Δ0 , . . . , Δ𝑚−1 ) defined by               for Δ = Δ→    ℳex   . Thus, In both cases this contradicts the
                                                                                  log
                                                                  fact that the input vector ⃗𝑥 = (0.9, 0.8, 0.1) triggers the
          Δ𝑘 = {(𝑁˙ ′ |𝑁 ) ∈ Δ←
                              ℳ | 𝑁 ∈ 𝒩𝑘+1 }                      neurons 𝑁0,0 , 𝑁0,1 , and 𝑁0,2 and is classified as an instance
                                                                  of 𝒞𝑁2,2 by ℳex log (cf. Example 1). Hence, we come to different
partitions Δ←  ℳ𝜑 . Now, we show that every conditional           conclusions if we either classify ⃗𝑥 = (0.9, 0.8, 0.1) by ℳexlog
in Δ𝑘 is tolerated by 𝑙 : 𝑘≤𝑙<𝑚 Δ𝑙 . Let Δ𝑘 and 𝑁 ∈ 𝒩𝑘+1
                      ⋃︀
                                                                  directly or classify ⃗𝑥 based on the synaptic belief bases.
be arbitrary but fixed. We choose a possible world 𝜔
with the following properties: (1) 𝜔 |= 𝑁 , (2) 𝜔 |= 𝑁 ′             The example above shows that belief bases consisting
if (𝑁 ′ |𝑁 ) ∈ Δ𝑘 for every 𝑁 ′ ∈ 𝒩𝑘 , and (3) 𝜔 |= 𝑁 ′′ for      of synaptic conditionals (only) are too basic to give any
every 𝑁 ′′ ∈ 𝒩𝑝 with 𝑘 < 𝑝 ≤ 𝑚 and 𝑁 ̸= 𝑁 ′′ . It can be          guarantees with respect to reasoning behavior when using
quickly checked that all three properties concern different       System Z. It is to be expected that a qualitative belief base
neurons and, hence, can be satisfied by 𝜔 at the same time.       cannot provide inferences on the same level of detail like
The properties (1) and (2) together ensure that 𝜔 verifies all    the original neural network. The example also shows that
conditionals with antecedent 𝑁 ; property (3) ensures that 𝜔      the belief base introduces new inferences which cannot be
is indifferent with respect to all other conditionals in all Δ𝑙   obtained from the neural network. This can be considered
with 𝑘 ≤ 𝑙 < 𝑚. Since Δ𝑘 and 𝑁 were chosen arbitrarily,           undesirable. Therefore, in order to make better use of the
this proves that every conditional in every Δ𝑘 is tolerated       quantitative information learned by the neural network, we
by all Δ𝑙 (with 0 ≤ 𝑘 ≤ 𝑙 < 𝑚).                                   make the extracted conditionals more complex to capture
   The proof for Δ→  ℳ is analogous; only the order of the
                                                                  relevant influences among the neurons better in the next
partition needs to be reversed.                                   section.

   In contrast to [15], which makes use of fuzzy Description
Logics, the synaptic belief bases are purely qualitative repre-   4. Sufficient (In)activators for Belief
sentations of the connections in neural networks. Naturally,         Base Extraction
this means that all information about how strong individual
connections between neurons are missing. The following            Now, we propose a more sophisticated approach than synap-
example shows that this can lead to different inferences.         tic conditionals for extracting conditional belief bases from
multilayer perceptrons. On the one hand, this means an ab-            Proof. (⇐) Assume that (2) and 𝑦𝑁 ′ ≥ 1 − 𝜏 for 𝑁 ′ ∈ 𝐴+
straction from specific input data to generalized defeasible          and 𝑦𝑁 ′ ≤ 𝜏 for 𝑁 ′ ∈ 𝐴′ hold. Then,
rules, here conditionals. On the other hand, the embedding                                ∑︀
of the essential information flow of multilayer perceptrons                     𝜑(𝛽𝑁 + 𝑁 ′ ∈pa𝑁 𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′ )
                                                                                          ∑︀
into a logical framework allows us to draw principled infer-                  = 𝜑(𝛽𝑁 + 𝑁 ′ ∈pa+ ∩𝐴+ 𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′
                                                                                                 𝑁
ences of verifiable quality.
                                                                                          ∑︀
                                                                                       + 𝑁 ′ ∈pa+ ∖𝐴+ 𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′
                                                                                          ∑︀     𝑁
                                                                                       + 𝑁 ′ ∈pa− ∩𝐴− 𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′
                                                                                                 𝑁
4.1. Basic Idea and Preconditions
                                                                                          ∑︀
                                                                                       + 𝑁 ′ ∈pa− ∖𝐴− 𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′ )
                                                                                                 𝑁∑︀

The basic idea of our method is to extract conditionals                       ≥ 𝜑(𝛽𝑁 +(1 − 𝜏 ) · 𝑁 ′ ∈pa+ ∩𝐴+ 𝜈𝑁 ′ ,𝑁
                                                                                                          𝑁
  𝜏,+             𝜏,+
                      ) from a multilayer perceptron ℳlog
                                                                                             ∑︀
𝛿𝑁      = (𝑁 |𝜓𝑁                                                                       +𝜏 · 𝑁 ′ ∈pa− ∩𝐴− 𝜈𝑁 ′ ,𝑁
                                                                                                    𝑁
where the consequence 𝑁 refers to a neuron from ℳlog
                                                                                          ∑︀
                                                                                       + 𝑁 ′ ∈pa− ∖𝐴− 𝜈𝑁 ′ ,𝑁 )
and the premise 𝜓𝑁    𝜏,+
                          to sets of parent nodes of 𝑁 which                                     𝑁
                                                                              ≥ 1 − 𝜏.
are (in combination) “most relevant” for the activation of 𝑁 .
Relevance here means that the conditional (𝑁 |𝜓𝑁       𝜏,+
                                                           ) is ef-   Hereby, we used 𝑁 ′ ∈pa+ ∖𝐴+ 𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′ ≥ 0. Thus,
                                                                                        ∑︀
fective, i.e., 𝜓𝑁 is true, only if it is guaranteed that the
                𝜏,+                                                                                  𝑁
                                                                      (𝐴+ , 𝐴− ) is a sufficient activator of 𝑁 .
neuron 𝑁 is sufficiently highly activated. Hence, it is reli-
                                                                        (⇒) We prove the contraposition. Assume that
ably justified to infer 𝑁 . Analogously, we extract condition-
als 𝛿𝑁𝜏,−
          = (𝑁 |𝜓𝑁  𝜏,−
                        ) wrt. the inactivation of 𝑁 . The “most
                                                                                                          ∑︁
                                                                                 𝜑(𝛽𝑁 + (1 − 𝜏 ) ·                𝜈𝑁 ′ ,𝑁
relevant” parents nodes of neurons in ℳlog are identified
                                                                                                           𝑁 ′ ∈pa+ ∩𝐴+
based on the notion of sufficient (in)activators.                                                                 𝑁

   We assume that the input of the multilayer percep-
                                                                                   ∑︁                          ∑︁
                                                                        +𝜏 ·                  𝜈𝑁 ′ ,𝑁 +                  𝜈𝑁 ′ ,𝑁 ) < 1 − 𝜏
tron ℳlog is normalized to ⃗𝑥 ∈ [0, 1]𝑛 and that the activa-                   𝑁 ′ ∈pa− ∩𝐴−               𝑁 ′ ∈pa− ∖𝐴−
                                                                                      𝑁                          𝑁
tion function used in ℳlog is the logistic function which
ensures that the output of all neurons in ℳlog is within the          holds. We have to show that there is 𝑦𝑁 ′ ∈ [0, 1] for 𝑁 ′ ∈
range [0, 1] again. Given a tolerance factor 𝜏 , this allows for      pa𝑁 with 𝑦𝑁 ′ ≥ 1 − 𝜏 for 𝑁 ′ ∈ 𝐴+ and 𝑦𝑁 ′ ≤ 𝜏 for
an interpretation of the activation of all neurons in ℳlog            𝑁 ′ ∈ 𝐴− such that
as in Definition 2.                                                                       ∑︁
                                                                               𝜑(𝛽𝑁 +          𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′ ) < 1 − 𝜏.
                                                                                          𝑁 ′ ∈pa𝑁
4.2. Sufficient (In)activators
                                                                      With
Based on the concept of active and inactive neurons, we de-                        ⎧
fine (sets of) parent nodes of neurons in a multilayer percep-                     ⎪
                                                                                   ⎪1−𝜏           if 𝑁 ′ ∈ pa+
                                                                                                             𝑁 ∩𝐴
                                                                                                                  +

tron ℳlog which are sufficient to activate resp. deactivate
                                                                                   ⎪
                                                                                   ⎨0             if 𝑁 ′ ∈ pa+
                                                                                                             𝑁 ∖𝐴
                                                                                                                 +
                                                                                   ⎪
                                                                                   ⎪
                                                                                   ⎪
the neurons, independent of the specific input vector ⃗𝑥.                    𝑦𝑁 ′ = 𝜏             if 𝑁 ∈ pa𝑁 ∩ 𝐴−
                                                                                                       ′     −

Definition 6 (Sufficient Activator). Let (ℳlog , 𝜏 ) be a                                         if 𝑁 ′ ∈ pa−   −
                                                                                   ⎪
                                                                                    1                        𝑁 ∖𝐴
                                                                                   ⎪
                                                                                   ⎪
                                                                                   ⎪
                                                                                   ⎪
classification scheme. Further, let 𝑁 be a neuron in ℳlog                                         if 𝑁 ∈ pa𝑁 ∖ (pa+
                                                                                                       ′                 −
                                                                                   ⎪
                                                                                                                   𝑁 ∪ pa𝑁 )
                                                                                   ⎩0
from a hidden layer or the output layer. We call a tuple
(𝐴+ , 𝐴− ) ⊆ pa2𝑁 with 𝐴+ ∩ 𝐴− = ∅ a sufficient acti-                 it follows that
vator of 𝑁 wrt. 𝜏 , if the activation of the neurons in 𝐴+ and                            ∑︀
                                                                                  𝜑(𝛽𝑁 + 𝑁 ′ ∈pa𝑁 𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′ )
the inactivation of the neurons in 𝐴− implies the activation                              ∑︀
                                                                                = 𝜑(𝛽𝑁 + 𝑁 ′ ∈pa+ ∩𝐴+ 𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′
of 𝑁 ; formally, if 𝑦𝑁 ′ ≥ 1 − 𝜏 for 𝑁 ′ ∈ 𝐴+ and 𝑦𝑁 ′ ≤ 𝜏                                ∑︀      𝑁

for 𝑁 ′ ∈ 𝐴− implies                                                                     + 𝑁 ′ ∈pa+ ∖𝐴+ 𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′
                                                                                          ∑︀      𝑁
                                                                                         + 𝑁 ′ ∈pa− ∩𝐴− 𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′
                                                                                                  𝑁
                       ∑︁
           𝜑(𝛽𝑁 +            𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′ ) ≥ 1 − 𝜏.
                                                                                          ∑︀
                                                                                         + 𝑁 ′ ∈pa− ∖𝐴− 𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′ )
                                                                                                  𝑁∑︀
                     𝑁 ′ ∈pa𝑁
                                                                                = 𝜑(𝛽𝑁 +(1 − 𝜏 ) · 𝑁 ′ ∈pa+ ∩𝐴+ 𝜈𝑁 ′ ,𝑁
                                                                                             ∑︀           𝑁
We denote the set of the sufficient activators of 𝑁 wrt. 𝜏                               +𝜏 · 𝑁 ′ ∈pa− ∩𝐴− 𝜈𝑁 ′ ,𝑁
                                                                                                     𝑁
by 𝒮𝒜𝜏 (𝑁 ).
                                                                                          ∑︀
                                                                                         + 𝑁 ′ ∈pa− ∖𝐴− 𝜈𝑁 ′ ,𝑁 )
                                                                                                  𝑁

   The idea of the sufficient activators in 𝒮𝒜𝜏 (𝑁 ) is that                    < 1 − 𝜏,
the output of the neurons 𝑁 ′ ∈ pa𝑁 with 𝑁 ′ ∈   / 𝐴+ ∪𝐴− is          which finishes the proof. Note that the choice of 𝑦𝑁 ′ = 0 in
irrelevant for the activation of 𝑁 , regardless of the concrete       case of 𝑁 ′ ∈ pa𝑁 ∖ (pa+𝑁 ∪ pa𝑁 ) is not mandatory because
                                                                                                      −
input of ℳlog , as captured in the next proposition.                  𝜈𝑁 ′ ,𝑁 = 0 holds in this case anyway.
Proposition 2. Let (ℳlog , 𝜏 ) be a classification scheme,               In this proof of Proposition 2 we have exploited that the
and let 𝑁 be a neuron in ℳlog from a hidden layer or the              logistic function is non-negative. If one wants to apply
output layer. Then, (𝐴+ , 𝐴− ) ⊆ pa2𝑁 with 𝐴+ ∩ 𝐴− = ∅                similar techniques to arbitrary sigmoid functions which are
is a sufficient activator of 𝑁 iff                                    not necessarily non-negative    but bounded by (𝑎, 𝑏) ⊂ ℛ
                                                                      one can rewrite 𝜑(𝛽𝑁 + 𝑁 ′ ∈pa𝑁 𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′ ) beforehand
                                                                                                ∑︀
                                                                      to
                        ∑︀
   𝜑(𝛽𝑁 + (1 − 𝜏 ) · 𝑁 ′ ∈pa+ ∩𝐴+ 𝜈𝑁 ′ ,𝑁                                                             ∑︁
                                 𝑁                                                            ′                       ′
                                                                                𝜑𝑁 ((𝑏 − 𝑎)(𝛽𝑁   +         𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′ ))
                                                       (2)
                 ∑︀
           + 𝜏 · 𝑁 ′ ∈pa− ∩𝐴− 𝜈𝑁 ′ ,𝑁
              ∑︀          𝑁                                                                               𝑁 ′ ∈pa𝑁
           + 𝑁 ′ ∈pa− ∖𝐴− 𝜈𝑁 ′ ,𝑁         ) ≥ 1 − 𝜏.
                      𝑁
                                                                      with 𝛽𝑁 ′      1
                                                                                         · (𝛽𝑁 + 𝑎 · 𝑁𝑖 ∈pa𝑁 𝜈𝑁 ′ ,𝑁 ) and 𝑦𝑁
                                                                                                                            ′
                                                                                                    ∑︀
                                                                                 = 𝑏−𝑎                                        ′ =
                                                                      𝑦𝑁 ′ −𝑎
                                                                       𝑏−𝑎
                                                                                where 𝑦𝑁 ′ is bounded by (0, 1) for all 𝑁𝑖 ∈ pa𝑁 .
                                                                                       ′
Note that in this case the thresholds for neurons being              Example 5. Again, we consider the multilayer perceptron
(in)active have to be adjusted from 1 − 𝜏 and 𝜏 to 𝑏 − 𝜏 and         ℳexlog from Example 1 (cf. Table 2) and the tolerance fac-
𝑎 + 𝜏 as well, now with 𝜏 ∈ [0, 𝑏−𝑎2
                                      ).                             tor 𝜏 = 0.3. Then, ({𝑁0,0 , 𝑁0,2 }, {𝑁0,1 }) is a sufficient
                                                                     inactivator of 𝑁1,0 because
  Proposition 2 can be used to compute sufficient activators.
For a neuron 𝑁 one generates each pair (𝐴+ , 𝐴− ) with                  𝜑(0.7 · (−1.27 − 0.91) + 0.3 · 1.23) ≈ 0.239 ≤ 0.3,
        𝑁 and 𝐴      ∈ pa−𝑁 and tests whether (2) holds or
                  −
𝐴+ ∈ pa+
not.                                                                 where 𝜑 is the logistic function (cf. Table 1). Note that
                                                                     ({𝑁0,0 , 𝑁0,2 }, ∅) is not a sufficient inactivator of 𝑁1,0 be-
Example 4. We consider the multilayer perceptron ℳex          log    cause
from Example 1 (cf. Table 2) and the tolerance factor 𝜏 = 0.3.
Then, for instance, ({𝑁0,0 , 𝑁0,1 }, ∅) is a sufficient activator         𝜑(0.7 · (−1.27 − 0.91) + 1.23) ≈ 0.427 > 0.3.
of 𝑁1,1 because                                                        For tuples of sets (𝑆1 , 𝑆2 ) and (𝑇1 , 𝑇2 ) we write
                                                                     (𝑆1 , 𝑆2 ) ⊑ (𝑇1 , 𝑇2 ) iff 𝑆1 ⊆ 𝑇1 and 𝑆2 ⊆ 𝑇2 . Obvi-
      𝜑(0.7 · (0.91 + 0.81) − 0.09) ≈ 0.753 ≥ 0.7,
                                                                     ously, if (𝐴+ , 𝐴− ) is a sufficient activator of 𝑁 and, for
where 𝜑 is the logistic function (cf. Table 1). Note that            (𝐴′+ , 𝐴′− ) ∈ pa(𝑁 )2 , (𝐴+ , 𝐴− ) ⊑ (𝐴′+ , 𝐴′− ) holds,
({𝑁0,0 }, ∅) is not a sufficient activator of 𝑁1,1 , instead, be-    then (𝐴′+ , 𝐴′− ) is a sufficient activator of 𝑁 , too. A similar
cause                                                                result holds for sufficient inactivators.
                                                                     Proposition 4. Let (ℳlog , 𝜏 ) be a classification scheme,
           𝜑(0.7 · (0.91) − 0.09) ≈ 0.633 < 0.7.
                                                                     and let 𝑁 be a neuron in ℳlog from a hidden layer or the
   Analogously to sufficient activators, we can define suffi-        output layer. Then,
cient inactivators.                                                       • if (𝐴+ , 𝐴− ) is a sufficient activator of 𝑁 , then
                                                                            (𝐴′+ , 𝐴′− ) ∈ pa2𝑁 with (𝐴+ , 𝐴− ) ⊑ (𝐴′+ , 𝐴′− )
Definition 7 (Sufficient Inactivator). Let (ℳlog , 𝜏 ) be a
                                                                            is a sufficient activator of 𝑁 , too,
classification scheme, and let 𝑁 be a neuron in ℳ from a hid-
den layer or the output layer. We call a tuple (𝐼 + , 𝐼 − ) ⊆ pa2𝑁        • if (𝐼 + , 𝐼 − ) is a sufficient inactivator of 𝑁 , then
with 𝐼 + ∩ 𝐼 − = ∅ a sufficient inactivator of 𝑁 wrt. 𝜏 if,                 (𝐼 ′+ , 𝐼 ′− ) ∈ pa2𝑁 with (𝐼 + , 𝐼 − ) ⊑ (𝐼 ′+ , 𝐼 ′− ) is
the activation of the neurons in 𝐼 + and the inactivation of                a sufficient inactivator of 𝑁 , too.
the neurons in 𝐼 − implies the inactivation of 𝑁 ; formally, if      Proof. Let (𝐴+ , 𝐴− ) be a sufficient activator of 𝑁 , and let
𝑦𝑁 ′ ≥ 1 − 𝜏 for 𝑁 ′ ∈ 𝐼 + and 𝑦𝑁 ′ ≤ 𝜏 for 𝑁 ′ ∈ 𝐼 − implies        (𝐴′+ , 𝐴′− ) ∈ pa2𝑁 with (𝐴+ , 𝐴− ) ⊑ (𝐴′+ , 𝐴′− ). Further,
                         ∑︁                                          let 𝑦𝑁 ′ ≥ 1 − 𝜏 for 𝑁 ′ ∈ 𝐴′+ and 𝑦𝑁 ′ ≤ 𝜏 for 𝑁 ′ ∈ 𝐴′− .
             𝜑(𝛽𝑁 +             𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′ ) ≤ 𝜏.                From 𝐴+ ⊆ 𝐴′+ and 𝐴− ⊆ 𝐴′− it follows that 𝑦𝑁 ′ ≥ 1−𝜏
                      𝑁 ′ ∈pa(𝑁 )
                                                                     for 𝑁 ′ ∈ 𝐴+ and 𝑦𝑁 ′ ≤ 𝜏 for 𝑁 ′ ∈ 𝐴− holds as well.
We denote the set of the sufficient inactivators of 𝑁 wrt. 𝜏         Then, because (𝐴+ , 𝐴− ) is a sufficient activator,
by 𝒮ℐ 𝜏 (𝑁 ).                                                                             ∑︁
                                                                               𝜑(𝛽𝑁 +           𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′ ) ≥ 1 − 𝜏.
  Similar to sufficient activators, the idea of sufficient in-                           𝑁 ′ ∈pa𝑁

activators (𝐼 + , 𝐼 − ) of neurons 𝑁 is that the output of the
                                                                     The proof for sufficient inactivators is analogous.
neurons 𝑁 ′ ∈ pa𝑁 with 𝑁 ′ ∈    / 𝐼 + ∪ 𝐼 − is irrelevant for the
inactivation of 𝑁 , regardless of the concrete input of ℳlog .          Proposition 4 suggests to define minimal sufficient
                                                                     (in)activators.
Proposition 3. Let (ℳlog , 𝜏 ) be a classification scheme,
and let 𝑁 be a neuron in ℳlog from a hidden layer or the             Definition 8 (Minimal Sufficient (In)activators). Let
output layer. Then, (𝐼 + , 𝐼 − ) ⊆ pa2𝑁 with 𝐼 + ∩ 𝐼 − = ∅ is a      (ℳlog , 𝜏 ) be a classification scheme, and let 𝑁 be a neuron
sufficient inactivator of 𝑁 iff                                      in ℳlog from a hidden layer or the output layer. Then,
                                                                          • A sufficient activator (𝐴+ , 𝐴− ) of 𝑁 is minimal if no
                    ∑︀
       𝜑(𝛽𝑁 + 𝜏 · 𝑁 ′ ∈pa+ ∩𝐼 − 𝜈𝑁 ′ ,𝑁
                              𝑁
                                                                            (𝐴′+ , 𝐴′− ) ∈ pa2𝑁 with (𝐴′+ , 𝐴′− ) ⊑ (𝐴+ , 𝐴− )
                                                            (3)
                 ∑︀
              + 𝑁 ′ ∈pa+ ∖𝐼 − 𝜈𝑁 ′ ,𝑁
                          𝑁∑︀                                               and (𝐴′+ , 𝐴′− ) ̸= (𝐴+ , 𝐴− ) is a sufficient activator
              + (1 − 𝜏 ) · 𝑁 ′ ∈pa− ∩𝐼 + 𝜈𝑁 ′ ,𝑁 ) ≤ 𝜏.
                                    𝑁                                       of 𝑁 ,
                                                                          • A sufficient inactivator (𝐼 + , 𝐼 − ) of 𝑁 is minimal if
Proof. The proof is similar to∑︀the proof of Proposition 2. For
                                                                            no (𝐼 ′+ , 𝐼 ′− ) ∈ pa2𝑁 with (𝐼 + , 𝐼 − ) ⊑ 𝐼 ′+ , 𝐼 ′− )
the direction (⇐) note that 𝑁 ′ ∈pa− ∖𝐼 + 𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′ ≤ 0.
                                       𝑁                                    and (𝐼 + , 𝐼 − ) ̸= (𝐼 ′+ , 𝐼 ′− ) is a sufficient inactivator
For the proof of the contraposition of (⇒), we select                       of 𝑁 .
              ⎧
              ⎪
              ⎪ 𝜏       if 𝑁 ′ ∈ pa+ 𝑁 ∩𝐼
                                            −                        We denote the set of the minimal sufficient activators of 𝑁
                                                                     wrt. 𝜏 with 𝒮𝒜𝜏min (𝑁 ) and the set of the minimal sufficient
              ⎪
              ⎪1        if 𝑁 ∈ pa𝑁 ∖ 𝐼
                              ′      +     −
              ⎪
              ⎪
                                                                     inactivators of 𝑁 wrt. 𝜏 with 𝒮ℐ 𝜏min (𝑁 ).
              ⎨
      𝑦𝑁 ′ = 1 − 𝜏 if 𝑁 ′ ∈ pa−      𝑁 ∩𝐼
                                            +

                        if 𝑁 ∈ pa𝑁 ∖ 𝐼
                              ′      −     +                            We consider our running example.
              ⎪
              ⎪0
              ⎪
              ⎪
              ⎪
                        if 𝑁 ′ ∈ pa𝑁 ∖ (pa+          −
              ⎪
                                              𝑁 ∪ pa𝑁 )
              ⎩0
                                                                     Example 6. The minimal sufficient (in)activators of the neu-
                                                                     rons in ℳexlog from Example 1 (cf. Table 2) with respect to
  Again, this proposition can be used to compute sufficient          the tolerance factor 𝜏 = 0.3 are shown in Table 3 resp. Ta-
inactivators as the next example shows.                              ble 4. Minimal sufficient (in)activators allow for a graphi-
                                                                     cal representation (cf. Figure 4). For instance, the minimal
                               𝑁0,0                                  𝑁1,0                                 𝑁2,0


                               𝑁0,1                                  𝑁1,1                                 𝑁2,1


                               𝑁0,2                                  𝑁1,2                                 𝑁2,2


          Figure 4: Minimal sufficient (in)activators of the neurons in ℳex
                                                                         log from Example 1. Solid lines indicate activation and dashed
          lines inactivation.


  𝑁𝑖,𝑗                        𝒮𝒜𝜏min (𝑁𝑖,𝑗 )                                every neuron 𝑁 in ℳlog its relationship to its sufficient
  𝑁1,0                              ∅                                       (in)activators by a conditional which states that if the neu-
  𝑁1,1                    {({𝑁0,0 , 𝑁0,1 }, ∅)}                             rons in one of the sufficient activators (inactivators) of 𝑁 are
  𝑁1,2                       {({𝑁0,2 }, ∅)}                                 (in)activated, then the neuron 𝑁 is usually active (inactive),
  𝑁2,0                {({𝑁1,0 , 𝑁1,2 }, {𝑁1,1 })}
                                                                            too.
  𝑁2,1                            ∅                                         Definition 9 (Belief Base Δ𝜏ℳlog ). Let (ℳlog , 𝜏 ) be a clas-
  𝑁2,2             {({𝑁1,0 }, {𝑁1,2 }), ({𝑁1,1 }, ∅)}                       sification scheme, and let 𝑁 be a neuron from a hidden layer
Table 3                                                                     or the output layer of ℳlog . Then, we define the conditionals
                                                                              𝜏,+          𝜏,+        𝜏,−         𝜏,−
Minimal sufficient activators of the neurons in the hidden resp.            𝛿𝑁    = (𝑁 |𝜓𝑁     ) and 𝛿𝑁   = (𝑁 |𝜓𝑁    ) via
output layer of ℳex
                  log from Example 1 wrt. 𝜏 = 0.3.
                                                                                                           ⎛                          ⎞
                                                                                               ⋁︁              ⋀︁            ⋀︁
                                                                             𝜓𝑁𝜏,+
                                                                                    =                      ⎝        𝑁′ ∧          𝑁 ′⎠ ,
  𝑁𝑖,𝑗                        𝒮ℐ 𝜏min (𝑁𝑖,𝑗 )                                        (𝐴+ ,𝐴− )∈𝒮𝒜𝜏
                                                                                                 min (𝑁 )          𝑁 ′ ∈𝐴+         𝑁 ′ ∈𝐴−
                                                                                                               ⎛                              ⎞
  𝑁1,0                 {({𝑁0,0 , 𝑁0,2 }, {𝑁0,1 })}                                             ⋁︁                  ⋀︁              ⋀︁
                                                                             𝜏,−                                             ′
  𝑁1,1                             ∅                                        𝜓𝑁   =                             ⎝          𝑁 ∧               𝑁 ′⎠ ,
  𝑁1,2                             ∅                                                 (𝐼 + ,𝐼 − )∈𝒮ℐ 𝜏          𝑁 ′ ∈𝐼 +          𝑁 ′ ∈𝐼 −
                                                                                                    min (𝑁 )
  𝑁2,0                             ∅
  𝑁2,1                 {({𝑁1,0 , 𝑁1,2 }, {𝑁1,1 })}                          provided that
  𝑁2,2                             ∅                                                                                     𝜏,+
                                                                                            𝒮𝒜𝜏min (𝑁 ) ̸= ∅ in case of 𝛿𝑁   ,
                                                                                               𝜏                         𝜏,−                         (*)
Table 4                                                                                     𝒮ℐ min (𝑁 ) ̸= ∅ in case of 𝛿𝑁 .
Minimal sufficient inactivators of the neurons in the hidden resp.
output layer of ℳex                                                         Note that the conditionals depend on the tolerance factor 𝜏
                   log from Example 1 wrt. 𝜏 = 0.3.
                                                                            because the sets of (minimal) sufficient (in)activators depend
                                                                            on 𝜏 . However, the conditionals are not dependent on any input
                                                                            vector of ℳlog , since 𝜏 abstracts from that. Based on that, we
sufficient inactivator ({𝑁0,0 , 𝑁0,2 }, {𝑁0,1 }) of 𝑁1,0 can be             define the extraction of the belief base Δ𝜏ℳlog from ℳlog via
visualized as three outgoing edges from 𝑁0,0 , 𝑁0,1 , and 𝑁0,2 ,
respectively, which conjointly result in 𝑁1,0 . The dashed line                Δ𝜏ℳlog = {𝛿𝑁
                                                                                          𝜏,+
                                                                                              | 𝑁 ∈ 𝒩 + } ∪ {𝛿𝑁
                                                                                                              𝜏,−
                                                                                                                  | 𝑁 ∈ 𝒩 − },
in Figure 4 after these three edges have met indicates that
({𝑁0,0 , 𝑁0,2 }, {𝑁0,1 }) is a sufficient inactivator (and not an           where 𝒩 𝜏,+ is the set of neurons 𝑁 for which the condi-
                                                                                    𝜏,+
activator) of 𝑁1,0 and the dashed line from 𝑁0,1 indicates                  tional 𝛿𝑁   exists, and where 𝒩 𝜏,− is the set of neurons 𝑁
                                                                                                       𝜏,−
that 𝑁0,1 has a negative influence on the inactivation of 𝑁1,0              for which the conditional 𝛿𝑁   exists, i.e., (*) applies.
(because the weight 𝜈0,1,0 is positive).
                                                                               The number of conditionals in Δ𝜏ℳlog is bounded by
   Altogether, (minimal) sufficient activators and inactiva-                the number of neurons in ℳlog (minus the input layer)
tors make it possible to abstract from the concrete input data              which means a higher degree of abstraction than prevalent
of a multilayer perceptron ℳlog and reveal the essential                    in synaptic belief bases (cf. Definition 5) the cardinality of
streams of information within ℳlog . This is the motivation                 which is bounded by the number of edges in ℳlog . Fur-
for our following extraction of conditional belief bases from               thermore, the condition (*) in Definition 9 ensures that the
multilayer perceptrons.                                                     conditionals 𝛿𝑁𝜏,+
                                                                                                (resp. 𝛿𝑁𝜏,−
                                                                                                             ) are added to Δ𝜏ℳlog only
                                                                            if 𝑁 has sufficient activators (inactivators). This prevents
                                                                            from conditionals of the form (𝑁 |⊥) and (𝑁 |⊥) in Δ𝜏ℳlog
4.3. Belief Base Extraction
                                                                            which would cause inconsistencies according to our accep-
Now, we describe our approach on extracting a conditional                   tance definition of conditionals. If there is a neuron 𝑁 with
belief base Δ𝜏ℳlog from a multilayer perceptron ℳlog based                  𝛿𝑁𝜏,+    𝜏,−
                                                                                  , 𝛿𝑁   / Δ𝜏ℳlog , then one can increase 𝜏 in order to
                                                                                         ∈
on sufficient (in)activators. In Δ𝜏ℳlog we formalize for                    improve the chance of obtaining such a conditional.
Example 7. We consider ℳex      log from Example 1 and the        for 𝑗 = 1, . . . , 𝑚 is a partition of Δ𝜏ℳlog (modulo empty
tolerance factor 𝜏 = 0.3. The minimal sufficient (in)activators   sets). Let 𝛿 ∈ Δ𝑗 , provided⋃︀hat Δ𝑗 ̸= ∅. We have to
of the neurons in ℳex
                    log are shown in Table 3 resp. Table 4 from   show that 𝛿 is tolerated by 𝑚      𝑘=𝑗 Δ𝑘 . For this, let 𝛿 be
which we can derive the belief base Δ0.3
                                       ℳlog . The conditionals    of the form 𝛿𝑁 for some 𝑁 ∈ 𝒩𝑗 . The proof for 𝛿 of
                                                                                  𝜏,+

in Δ0.3
     ℳlog are                                                     the form 𝛿𝑁 𝜏,−
                                                                                   is analogous. By construction of 𝛿𝑁  𝜏,+
                                                                                                                            , there
                                                                  is (𝐴 , 𝐴 ) ∈ 𝒮𝒜min (𝑁⋀︀
                                                                        +     −          𝜏
                                                                                                 ) and a (partial) possible  world
            0.3,−
           𝛿𝑁     = (𝑁1,0 |𝑁0,0 ∧ 𝑁0,2 ∧ 𝑁0,1 ),                  𝜔 ∈ Ω(𝒩𝑗−1 ) with 𝜔 |= 𝑁 ′ ∈𝐴+ 𝑁 ′ ∧ 𝑁 ′ ∈𝐴− 𝑁 ′ (𝐴+
                                                                                                                ⋀︀
              1,0
            0.3,+
           𝛿𝑁     = (𝑁1,1 |𝑁0,0 ∧ 𝑁0,1 ),                         and 𝐴− are disjoint).
              1,1
                                                                     Thanks to Lemma 1, we can extend 𝜔 to a (partial) possible
            0.3,+
           𝛿𝑁 1,2
                  = (𝑁1,2 |𝑁0,2 ),                                world 𝜔 ′ ∈ Ω(𝒩𝑗−1 ∪ 𝒩𝑗 ) such that all conditionals in Δ𝑗
            0.3,+                                                 are either not applicable or verified by concatenating 𝑁 ′
           𝛿𝑁     = (𝑁2,0 |𝑁1,0 ∧ 𝑁1,2 ∧ 𝑁1,1 ),
              2,0                                                 to 𝜔 in case of 𝜔 |= 𝜓𝑁     ′ or concatenating 𝑁 to 𝜔 in case
                                                                                            𝜏,+                      ′
            0.3,−
           𝛿𝑁 2,1
                  = (𝑁2,1 |𝑁1,0 ∧ 𝑁1,2 ∧ 𝑁1,1 ),                  of 𝜔 |= 𝜓𝑁 ′ for 𝑁 ∈ 𝒩𝑗 . In particular, 𝜔 ′ verifies 𝛿𝑁
                                                                              𝜏,−        ′                                      𝜏,+
                                                                                                                                    .
            0.3,+                                                 By a repeated application of this argument,    we  can construct
           𝛿𝑁     = (𝑁2,2 |𝑁1,0 ∧ 𝑁1,2 ∨ 𝑁1,1 ).
                                                                  a (partial) possible world 𝜔 ′′ ∈ Ω( 𝑚   𝑘=𝑗−1 𝒩𝑘 ) which veri-
              2,2
                                                                                                        ⋃︀

                                                                  fies 𝛿𝑁
                                                                        𝜏,+
                                                                             and falsifies no conditional from 𝑚    𝑘=𝑗 Δ𝑘 . Even-
                                                                                                                 ⋃︀
                                                       0.3,+
In particular, note the disjunction in the premise of 𝛿𝑁 2,2
because of the two (different) minimal sufficient activators      tually, this (partial) ⋃︀possible world can be extended to a
of 𝑁2,2 .                                                         possible world in Ω( 𝑚    𝑘=0 𝒩𝑘 ) by the concatenation of the
                                                                  remaining ground atoms, either positive or negated which
 The belief base Δ𝜏ℳlog is consistent. To show this, we           can be chosen freely.
make use of the following lemma.
                                                                      Note that the belief base Δ𝜏ℳlog might be empty, namely
Lemma 1. Let (ℳlog , 𝜏 ) be a classification scheme. Then,        if for all neurons in ℳlog there is no sufficient (in)activator.
for every neuron 𝑁 from a hidden layer or the output layer        On the contrary, if a neuron 𝑁 can be (in)activated, then
of ℳlog it holds that (cf. Definition 9)                          there is a sufficient (in)activator of 𝑁 so that there is a
                      𝜏,+    𝜏,−                                  conditional wrt. 𝑁 in Δ𝜏ℳlog . Thus, Δ𝜏ℳlog reflects the
                     𝜓𝑁   ∧ 𝜓𝑁   ≡ ⊥.                             most important information flow in ℳlog .
Proof. Assume that 𝜓𝑁   𝜏,+
                            ∧ 𝜓𝑁 𝜏,−
                                     ̸≡ ⊥ holds. Then, there
is a possible world 𝜔, a sufficient activator (𝐴+ , 𝐴− ) of 𝑁     5. Binary Classification with Δ𝜏ℳlog
wrt. 𝜏 , and a sufficient inactivator (𝐼 + , 𝐼 − ) of 𝑁 wrt. 𝜏
such that                                                         Now, we discuss how to perform binary (multi-class) clas-
  𝜔 |=
          ⋀︁
               𝑁′ ∧
                        ⋀︁
                              𝑁′ ∧
                                      ⋀︁
                                           𝑁′ ∧
                                                     ⋀︁
                                                         𝑁 ′.     sification based on the belief base Δ𝜏ℳlog which we have
                                                                  extracted from a multilayer perceptron ℳlog (cf. Defini-
       𝑁 ′ ∈𝐴+       𝑁 ′ ∈𝐴−          𝑁 ′ ∈𝐼 +   𝑁 ′ ∈𝐼 −
                                                                  tion 9). Recall that, following Definition 2, we can say that
It follows that (𝐴+ ∪ 𝐼 + ) ∩ (𝐴− ∪ 𝐼 − ) = ∅. Otherwise, 𝜔       an input vector ⃗𝑥 of ℳlog is classified (resp. declassified)
would mention an atom both negated and positive. From             as 𝒞𝑁 represented by the neuron 𝑁 from the output layer
this and Proposition 4 it follows that (𝐴+ ∪ 𝐼 + , 𝐴− ∪ 𝐼 − )     of ℳlog if ℳlog (𝑥 ⃗ ) ≥ 1 − 𝜏 (resp. ℳlog (𝑥⃗ ) ≤ 𝜏 ) where 𝜏
is both a sufficient activator and a sufficient inactivator       is a tolerance factor. We denote this with
of 𝑁 wrt. 𝜏 because (𝐴+ , 𝐴− ) ⊑ (𝐴+ ∪ 𝐼 + , 𝐴− ∪ 𝐼 − )
and (𝐼 + , 𝐼 − ) ⊑ (𝐴+ ∪ 𝐼 + , 𝐴− ∪ 𝐼 − ) hold. According to                 ℳlog , ⃗𝑥 |∼𝜏 𝑁 iff ℳlog (𝑥
                                                                                                       ⃗) ≥ 1 − 𝜏
the definitions of sufficient (in)activators, for appropriate                ℳlog , ⃗𝑥 |∼𝜏 𝑁 iff ℳlog (𝑥
                                                                                                       ⃗ ) ≤ 𝜏.
values 𝑦𝑁 ′ for 𝑁 ′ ∈ pa(𝑁 ),
                                                                  We lift this idea of classifying ⃗𝑥 from ℳlog to the belief
                                                                  base Δ𝜏ℳlog . Thereby, we make use of the System Z ranking
                             ∑︁
         1 − 𝜏 ≤ 𝜑(𝛽𝑁 +            𝜈𝑁 ′ ,𝑁 · 𝑦𝑁 ′ ) ≤ 𝜏
                           𝑁 ′ ∈pa𝑁                               model 𝜅𝑍 Δ𝜏
                                                                            ℳ
                                                                                  of Δ𝜏ℳlog .
                                                                               log

follows. This implies 1 − 𝜏 ≤ 𝜏 or, equivalent 0.5 ≤ 𝜏 ,
                                                                  Definition 10 (Z-Classification). Let (ℳlog , 𝜏 ) be a clas-
which contradicts 𝜏 ∈ [0, 0.5).
                                                                  sification scheme, let Δ𝜏ℳlog be the belief base extracted
  Lemma 1 states that there is no neuron 𝑁 in ℳlog for            from ℳlog , and let 𝜅𝑍             𝑍
                                                                                         ℳlog ,𝜏 = 𝜅Δ𝜏 ℳ
                                                                                                             be its System Z
                                                                                                               log
which both 𝛿𝑁𝜏,+
                 (supporting 𝑁 ) and 𝛿𝑁
                                      𝜏,−
                                          (supporting 𝑁 )         ranking model. With 𝐴⃗𝜏𝑥 we denote the set of neurons from
can be applicable at the same time.                               the input layer of ℳlog which are activated by ⃗𝑥 wrt. 𝜏 , and
Proposition 5. Let (ℳlog , 𝜏 ) be a classification scheme.        with 𝐼⃗𝑥𝜏 the set of neurons which are inactivated. Then, we
Then, the belief base Δ𝜏ℳlog extracted from ℳlog is consis-       say that an input vector ⃗𝑥 of ℳlog is
tent.                                                                  • Z-classified as 𝒞𝑁 wrt. a neuron 𝑁 from the output
                                                                         layer of ℳlog , denoted by
Proof. We show that Δ𝜏ℳlog has a tolerance partition from
which its consistency follows. Let 𝑚 + 1 be the number of
                                                                             Δ𝜏ℳlog , ⃗𝑥 |∼𝑍         𝑍
                                                                                           𝜏 𝑁, iff 𝜅ℳlog ,𝜏 accepts
layers in ℳlog and, for 𝑗 = 0, 1, . . . , 𝑚, let 𝒩𝑗 be the set
of neurons in the 𝑗-th layer. Then, (Δ1 , . . . , Δ𝑚 ) with
                                                                                                     ⋀︁          ⋀︁
                                                                                              (𝑁 |       𝑁′ ∧        𝑁 ′ ),
                                                                                                     𝑁 ′ ∈𝒜𝜏
                                                                                                           ⃗
                                                                                                           𝑥         𝑁 ′ ∈ℐ𝑥
                                                                                                                           𝜏
                                                                                                                           ⃗
         𝜏,+
  Δ𝑗 = {𝛿𝑁   ∈ Δ𝜏ℳlog | 𝑁 ∈ 𝒩𝑗 }
                               𝜏,−                                     • Z-declassified as 𝒞𝑁 , denoted by
                           ∪ {𝛿𝑁   ∈ Δ𝜏ℳlog | 𝑁 ∈ 𝒩𝑗 }
           Δ𝜏ℳlog , ⃗𝑥 |∼𝑍          𝑍
                         𝜏 𝑁 , iff 𝜅ℳlog ,𝜏 accepts                    Further, the Z-partition of Δ0.3        0.3       0.3
                                                                                                    ℳlog is 𝑍(Δℳex ) = (Δℳex ),
                                                                                                                    log          log

                                                                       so that, for (𝑁2,2 |𝜒𝑁2,2 ) with 𝜒𝑁2,2 = 𝑁0,0 ∧ 𝑁0,1 ∧ 𝑁0,2 ,
                                    ⋀︁          ⋀︁
                             (𝑁 |       𝑁′ ∧        𝑁 ′ ).
                                    𝑁 ′ ∈𝒜𝜏
                                          ⃗
                                          𝑥        𝑁 ′ ∈ℐ𝑥
                                                         𝜏
                                                         ⃗             we have, with Δ = Δ0.3 ℳex ,
                                                                                                log


   We obtain the following central result stating                          𝜅𝑍                            𝑍
                                                                            Δ (𝑁2,2 ∧ 𝜒𝑁2,2 ) = 0 < 1 = 𝜅Δ (𝑁2,2 ∧ 𝜒𝑁2,2 )
that 𝜅𝑍ℳlog ,𝜏 does not “invent” inferences but yields
inferences that can be drawn from ℳlog only. Instead,                  Thus, we classify ⃗𝑥 as an instance of 𝒞𝑁2,2 in accordance with
inferences drawn from 𝜅𝑍 ℳlog ,𝜏 can be understood, in some            the result from Example 1.
sense, as the most reliable inferences from ℳlog .
                                                                          Our approach focuses attention on the main dependencies
Proposition 6. Let (ℳlog , 𝜏 ) be a classification scheme,             among the neurons in multilayer perceptrons. In contrast
let Δ𝜏ℳlog be the belief base extracted from ℳlog , let 𝜅𝑍
                                                         ℳlog ,𝜏       to the synaptic conditionals in Section 3, the influence of
be its System Z ranking model, and let ⃗𝑥 be an input vector           several parent nodes on a neuron 𝑁 is aggregated, with
of ℳlog . Then,                                                        the guarantee that the aggregated parent nodes are able to
                                                                       (in)active 𝑁 . A depiction of these aggregated influences is
         Δ𝜏ℳlog , ⃗𝑥 |∼𝑍
                       𝜏 𝑁 implies ℳlog , ⃗
                                          𝑥 |∼𝜏 𝑁,                     shown in Figure 4 for our running example. Figure 4 can be
                                                                       understood as a visualization of the main information flow
and, analogously,                                                      in ℳexlog .

         Δ𝜏ℳlog , ⃗𝑥 |∼𝑍
                       𝜏 𝑁 implies ℳlog , ⃗
                                          𝑥 |∼𝜏 𝑁 .

Proof. Let 𝒜⃗𝜏𝑥 and ℐ⃗𝑥𝜏 be the sets of the neurons from the
                                                                       6. Conclusions
input layer of ℳlog which are activated resp. inactivated              We proposed an approach on extracting propositional con-
by the input ⃗𝑥 wrt. 𝜏 (cf. Definition 10). Further, let 𝑚 + 1         ditional belief bases from multilayer perceptrons (MLPs) for
be the number of layers in ℳlog , and, for 𝑗 = 0, 1, . . . , 𝑚,        binary multi-class classification. The conditionals relate to
let 𝒩𝑗 be the set of neurons in the 𝑗-th layer of ℳlog . We            the main information flow in the multilayer perceptron de-
prove that Δ𝜏ℳlog , ⃗𝑥 |∼𝑍𝜏 𝑁 implies ℳlog , ⃗  𝑥 |∼𝜏 𝑁 . The          tached from specific input vectors. Therewith, our approach
proof that Δ𝜏ℳlog , ⃗𝑥 |∼𝑍
                         𝜏 𝑁 implies ℳlog , ⃗  𝑥 |∼𝜏 𝑁 is anal-        abstracts from both the input data as well as overlay effects
ogous.                                                                 in the network and rebuilds the backbone of the network
  Let Δ𝜏ℳlog , ⃗𝑥 |∼𝑍
                    𝜏 𝑁 , i.e., by definition, 𝜅ℳlog ,𝜏 accepts
                                                𝑍
                                                                       within a prevalent KRR formalism. The main idea of our ap-
the conditional (𝑁 |𝜒𝑁 ) with                                          proach is to exploit sufficient (in)activators of neurons 𝑁 the
                                                                       (in)activation of which guarantees that 𝑁 is (in)activated
                                                                       as well. The extracted conditional belief base allows for
                         ⋀︁             ⋀︁
              𝜒𝑁 =              𝑁′ ∧         𝑁 ′.
                              𝜏
                        𝑁 ′ ∈𝒜⃗
                              𝑥
                                             𝜏
                                       𝑁 ′ ∈ℐ⃗
                                             𝑥
                                                                       drawing inferences in a principled way, for instance, under
                                                                       System Z. It is guaranteed that the belief base is consistent
Following the construction of possible worlds in the proof             and does not invent inferences that cannot be drawn from
of Proposition 5, every (partial) possible world 𝜔 ∈ Ω(𝒩0 )            the multilayer perceptron.
with 𝜔 ⋃︀|= 𝜒𝑁 can be extended to a possible world                        In recent work [17] it has been shown that there is a tight
𝜔 ′ ∈ Ω( 𝑚    𝑗=0 𝒩𝑗 ) such that no conditional from Δℳlog             connection between multilayer perceptrons and quantitative
                                                              𝜏

is falsified. Hence, 𝜅ℳlog ,𝜏 (𝜔 ) = 0. Because 𝜅ℳlog ,𝜏 ac-
                         𝑍           ′                   𝑍             bipolar argumentation frameworks. Roughly speaking, MLPs
cepts the conditional (𝑁 |𝜒𝑁 ), none of these extensions 𝜔 ′           can be seen as specific argumentation frameworks under a
satisfies 𝑁 . Otherwise, 𝜅𝑍                                            so-called MLP-semantics. To make this connection useful
                               ℳlog ,𝜏 (𝑁 ∧ 𝜒𝑁 ) = 0 would hold
                                                                       for explanations, some ideas on sparsification have been
which contradicts the acceptance of (𝑁 |𝜒𝑁 ). As a con-
                                                                       considered [18]. In future work, we want to investigate
sequence, the conditional 𝛿𝑁      𝜏,+
                                        (cf. Definition 9) must be
                                                                       the connections between our approach and the approaches
in Δ𝜏ℳlog which is the only possibility to exclude 𝑁 from
                                                                       from [17, 18]. Exploiting sparsified networks may simplify
the extensions 𝜔 ′ (and which is also accepted in all the ex-
                                                                       the computation of conditional belief bases. The other way
tensions 𝜔 ′ ). Otherwise, there is no reason why not to have
                                                                       round, the qualitative conditionals could perhaps be used to
an extension 𝜔 ′ with 𝜔 ′ |= 𝑁 .
                                                                       construct argumentation frameworks in order to simulate
    In more detail, either there is an extension 𝜔 ′ of 𝜔 with
                                                                       the MLPs that are easier to interpret than the argumentation
𝜔 |= 𝑁 and 𝜅𝑍
  ′
                  ℳlog ,𝜏 (𝜔 ) = 0 which contradicts the accep-
                             ′
                                                                       frameworks obtained from the current approaches.
tance of (𝑁 |𝜒𝑁 ), or 𝜅ℳlog ,𝜏 (𝜔 ′ ) > 0 for all such exten-
                            𝑍
                                                                          Also in future work, we want to extract conditionals
sion 𝜔 ′ which requires a conditional in Δ𝜏ℳlog that is falsi-         from multilayer perceptrons that are based on “necessary
fied in 𝜔 ′ . The only candidate for such a conditional would          (in)activators” and can be used for explaining classifications
be 𝛿𝑁 𝜏,+
          . As a consequence of the acceptance of 𝛿𝑁       𝜏,+
                                                               , the   that are made by the multilayer perceptrons. Therewith,
input vector ⃗𝑥 activates at least one sufficient activator of 𝑁 .     we expect to be able to bound all possible classifications
From this, it follows that ⃗𝑥 also activates 𝑁 in ℳlog .               from two directions (upper and lower bound) which, as we
                                                                       hope, can help to better understand the essence of binary
   We recall our running example to illustrate this proposi-           multi-class classification based on multilayer perceptrons.
tion.                                                                  Further research directions could be to investigate how the
                                                                       choice of the tolerance factor influences the shape of the
Example 8. We consider the same scenario as in Exam-
                                                                       conditional belief base and how different inference opera-
ple 1, i.e., the multilayer perceptron ℳex
                                        log , the tolerance factor
                                                                       tors, e.g., based on System P [8], lexicographic closure [19],
𝜏 = 0.3, and the input vector ⃗𝑥 = (0.9, 0.8, 0.1). Then,
                                                                       or c-representations [9], relate to the binary multi-class
          𝒜⃗0.3                        ℐ⃗𝑥0.3 = {𝑁0,2 }.               classification with multilayer perceptrons.
            𝑥 = {𝑁0,0 , 𝑁0,1 },
Acknowledgments                                                       telligence, EAAI 2021, Virtual Event, February 2-9,
                                                                      2021, AAAI Press, 2021, pp. 6463–6470.
This work was supported by DFG Grant KE 1413/14-1 of the         [18] H. Ayoobi, N. Potyka, F. Toni, Sparx: Sparse argumen-
German Research Foundation (DFG) awarded to Gabriele                  tative explanations for neural networks, in: K. Gal,
Kern-Isberner.                                                        A. Nowé, G. J. Nalepa, R. Fairstein, R. Radulescu (Eds.),
                                                                      ECAI 2023 - 26th European Conference on Artificial
                                                                      Intelligence, September 30 - October 4, 2023, Kraków,
References                                                            Poland - Including 12th Conference on Prestigious Ap-
 [1] K. Gurney, An Introduction to Neural Networks, UCL               plications of Intelligent Systems (PAIS 2023), volume
     Press, 1997.                                                     372 of Frontiers in Artificial Intelligence and Applica-
 [2] S. Yang, C. Zhang, W. Wu, Binary output layer of                 tions, IOS Press, 2023, pp. 149–156.
     feedforward neural networks for solving multi-class         [19] D. Lehmann, Another perspective on default reason-
     classification problems, IEEE Access 7 (2019) 5085–              ing, Ann. Math. Artif. Intell. 15 (1995) 61–82.
     5094.
 [3] A. Rajendra, P. Sajja, Knowledge-Based Systems, Jones
     & Bartlett Learning, 2009.
 [4] J. Pearl, System Z: A natural ordering of defaults with
     tractable applications to nonmonotonic reasoning, in:
     R. Parikh (Ed.), Proceedings of the 3rd Conference on
     Theoretical Aspects of Reasoning about Knowledge,
     Pacific Grove, CA, USA, March 1990, Morgan Kauf-
     mann, 1990, pp. 121–135.
 [5] B. d. Finetti, La logique de la probabilité, The Journal
     of Symbolic Logic 2 (1937) 31–39.
 [6] E. W. Adams, The Logic of Conditionals, Springer,
     1975.
 [7] W. Spohn, The Laws of Belief - Ranking Theory and
     Its Philosophical Applications, Oxford UP, 2014.
 [8] S. Kraus, D. Lehmann, M. Magidor, Nonmonotonic
     reasoning, preferential models and cumulative logics,
     Artif. Intell. 44 (1990) 167–207.
 [9] G. Kern-Isberner, A thorough axiomatization of a
     principle of conditional preservation in belief revision,
     Ann. Math. Artif. Intell. 40 (2004) 127–164.
[10] A. Ghorbani, J. Y. Zou, Neuron shapley: Discovering
     the responsible neurons, in: H. Larochelle, M. Ranzato,
     R. Hadsell, M. Balcan, H. Lin (Eds.), Advances in Neural
     Information Processing Systems 33: Annual Confer-
     ence on Neural Information Processing Systems 2020,
     NeurIPS 2020, December 6-12, 2020, virtual, 2020.
[11] W. S. McCulloch, W. H. Pitts, A logical calculus of the
     ideas immanent in nervous activity, The bulletin of
     mathematical biophysics 5 (1943) 115–133.
[12] G. Cybenko, Approximation by superpositions of a
     sigmoidal function, Math. Control. Signals Syst. 2
     (1989) 303–314.
[13] D. Nute, Topics in Conditional Logic, Springer, 2011.
[14] M. Goldszmidt, J. Pearl, On the Relation Between Ra-
     tional Closure and System Z, CSD (Series), UCLA Com-
     puter Science Department, 1991.
[15] L. Giordano, D. Theseider Dupré, Weighted defeasible
     knowledge bases and a multipreference semantics for
     a deep neural network model, in: Logics in Artificial
     Intelligence, Springer International Publishing, 2021,
     pp. 225–242.
[16] F. Baader, I. Horrocks, C. Lutz, U. Sattler, An Introduc-
     tion to Description Logic, Cambridge UP, 2017.
[17] N. Potyka, Interpreting neural networks as quanti-
     tative argumentation frameworks, in: Thirty-Fifth
     AAAI Conference on Artificial Intelligence, AAAI
     2021, Thirty-Third Conference on Innovative Applica-
     tions of Artificial Intelligence, IAAI 2021, The Eleventh
     Symposium on Educational Advances in Artificial In-