<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Stanford
University, Palo Alto, California, USA, March</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Neural-Symbolic Integration for Fairness in AI</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Benedikt Wagner</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Artur d'Avila Garcez</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>City, University of London</institution>
          ,
          <addr-line>Northampton Square, London, EC1V0HB</addr-line>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <volume>2</volume>
      <fpage>2</fpage>
      <lpage>24</lpage>
      <abstract>
        <p>Deep learning has achieved state-of-the-art results in various application domains ranging from image recognition to language translation and game playing. However, it is now generally accepted that deep learning alone has not been able to satisfy the requirement of fairness and, ultimately, trust in Artiifcial Intelligence (AI). In this paper, we propose an interactive neural-symbolic approach for fairness in AI based on the Logic Tensor Network (LTN) framework. We show that the extraction of symbolic knowledge from LTN-based deep networks combined with fairness constraints ofer a general method for instilling fairness into deep networks via continual learning. Explainable AI approaches which otherwise could identify but not fix fairness issues are shown to be enriched with an ability to improve fairness results. Experimental results on three real-world data sets used to predict income, credit risk and recidivism in financial applications show that our approach can satisfy fairness metrics while maintaining state-of-the-art classification performance.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Neurosymbolic AI</kwd>
        <kwd>Deep Learning with Knowledge Representation</kwd>
        <kwd>Fairness</kwd>
        <kwd>Explainability</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        of XAI approaches with varied results, measured in diferent ways, towards achieving a better
understanding of the behaviour of black box AI and ML systems, often aimed at
discovering undesired model properties such as unfair treatment based on protected attributes. While
some approaches attempt to extract global decision-making information from a complex model
(known as the teacher model) by learning a simpler (student) model - starting with the
seminal TREPAN approach [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] -, others have sought to provide explanations, which have become
known as local explanations, by describing specific representative cases (e.g. [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ]). Despite
the increasing adoption of such XAI methods known as distillation [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], the connection with
symbolic AI and knowledge representation has been largely ignored recently. In this paper, by
taking a hybrid (neurosymbolic) approach, we seek to maintain a correspondence between the
deep network and its counterpart symbolic description. Through the use of a neural-symbolic
approach known as Logic Tensor Networks (LTN) [
        <xref ref-type="bibr" rid="ref6 ref7 ref8">6, 7, 8</xref>
        ], we can achieve interactive
explainability by querying the deep network for symbolic knowledge in the form of first-order logic
rules. At the same time, we can ofer an approach for achieving fairness of an end-to-end
learning system by intervening at the symbolic counterpart of the system with the addition
of fairness constraints. With the use of the neural-symbolic cycle (Fig.2), we seek to bridge
low-level information processing such as perception and pattern recognition with reasoning
and explanation at a higher-level of abstract knowledge [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. The framework introduced in this
paper seeks to improve fairness via continual learning with symbolical LTN constraints.
      </p>
      <p>
        The contributions of this paper are:
(1) We introduce a method that allows one to act on information extracted by any XAI
approach in order to prevent the learning model from learning unwanted behaviour or bias
discovered by the XAI approach. We demonstrate how our method can leverage an existing XAI
method, known as SHAP [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], to discover and address undesired model behaviour.
      </p>
      <p>(2) We implement and outline the use of LTN for continual learning and iterative querying
by caching the learned representations and by using network querying in first-order logic to
check for knowledge learned by the deep neural network.</p>
      <p>
        (3) We apply the proposed method and tool to the field of quantitative fairness in finance.
Experimental results reported in this paper show comparable or improved accuracy across
three data sets while achieving fairness based on two fairness metrics, including a 7.1% average
increase in accuracy in comparison with a state-of-the-art neural network-based approach [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <p>In Section 2, we position the paper in the context of the related work. In Section 3, we
introduce the interactive continual-learning LTN method. In Section 4, we present and discuss
the experimental results achieving fairness. In Section 5, we conclude the paper and discuss
directions for future work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        Explainability: The goals of achieving comprehensible Machine Learning systems are diverse
[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. There have been many recent proposals to tackle the problem diferently, focusing either
on the system before training or during training to obtain inherently interpretable models or
post-training analyses and knowledge extraction. We shall focus on the latter two as they
have gained the most attention recently and connect to our approach more closely. The most
common way to diferentiate XAI methods currently in practice utilises two categories: global
and local explanations [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Most of the approaches that seek to incorporate an inherent level
of interpretability are global1. Whereas inherently interpretable models come with stringent
architectural constraints on the model itself, our approach is model-agnostic since LTN as a
framework simply requires the ability to query any deep network (or any ML model) for its
behaviour, that is, observing the value of an output given a predefined input, thereupon used
as part of a constraint-based regularisation [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The predictive model itself can be chosen
independently, with the LTN acting as an interface.
      </p>
      <p>
        Post-training methods seek to achieve explanation by approximating the behaviour of complex
black-box systems. One of the most prominent methods at present, based on this idea, is called
LIME, which stands for Local Interpretable Model-Agnostic Explanation [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. As the name
suggests, these explanations describe specific instances by approximating local variations in the
neighbourhood of a prediction. Although LIME can give very intuitive insights into
predictions, it remains unclear how widely applicable local explanations are, how problematic the
assumption of linearity is, and what may constitute a valid definition of soundness and
measure of closeness. The reader is referred to [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] for a critique of LIME.
      </p>
      <p>
        A method that has gained traction recently in finance is the Shapley value approach. Initially
proposed by [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], it has recently been adapted to ML models by [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. The goal is to capture the
average marginal contribution of a feature value across diferent possible combinations. A
single Shapley value for a feature of this specific input denotes the contribution of such a feature
to the prediction w.r.t. the average prediction for the data set [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. The authors propose
determining such a value by calculating the average change in the prediction by randomly adding
features to the model. The Shapley value works for both classification and regression tasks.
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] contributed to a significant boost in the popularity of this method by unifying various
1In [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], the XAI methods that are inherently interpretable are further diferentiated into rule-based,
prototypebased, and others.
feature attribution approaches into one framework and publishing a user-friendly
implementation. We refer the reader to [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] for an extensive survey of more methods.
      </p>
      <p>While the above XAI methods make a noticeable contribution to the obstacle of
explainability, none of them address the problem of how one should act upon the extracted information.
Consequently, we do not see the LTN approach as a method to be compared directly with
the above XAI approaches but to complement them. By applying the neural-symbolic cycle
multiple times, partial symbolic descriptions of the knowledge encoded in the deep network
will be checked and, through a human-in-the-loop approach, incorporated into the cycle as a
constraint on the learning process. This will enable an interactive integration of a desired
behaviour, notably fairness constraints, by checking and incorporating knowledge at each cycle,
instead of (global or local) XAI serving only to produce a one-of description of a static system.
Fairness: One of the main goals of the recent advancements in explainability encompasses
considerations of the fairness of automated classification systems. Although the discovery of
such unwanted behaviour is essential and useful, in this paper, we can address specific
undesired properties, discovered or specified symbolically, and alter the learned model towards a
fairer description.</p>
      <p>
        There have been a few methods addressing fair representation or classification: [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] seek to
achieve fair representation. [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] seek to achieve fair classification by proposing a reductionist
approach that translates the problem onto a sequence of cost-sensitive classification tasks [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ].
[
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] study fairness in naive Bayes classifiers and propose an interactive method for
discovering and eliminating discrimination patterns. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] integrate fairness into neural networks by
including complex and non-decomposable loss functions into the optimisation. Fairness
remains a significant challenge for Machine Learning. For an overview of the variety of fairness
notions, we refer the reader to [
        <xref ref-type="bibr" rid="ref18 ref21">21, 18</xref>
        ]. For an extensive overview of various fairness-oriented
ML methods, we refer the reader to [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] and [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ].
      </p>
      <p>The above methods are related because they introduce constraints either on the data or
the model during learning. The LTN-based approach used here introduces constraints as a
regularisation which therefore may apply to any model or data set. Also, in LTN, additional
fairness axioms can be specified during training time by the user, which may be unrelated
to the existing fairness axioms. Finally, at test time, the protected variables defined by such
axioms are not used, so that a final customer of the ML system will not be asked for sensitive
information on gender, race, etc.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Method</title>
      <p>
        The framework used in this paper is that of Logic Tensor Networks [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. However,
instead of treating the learning of the parameters from data and knowledge as a single process,
we emphasise the dynamic and flexible nature of the process of training from data, querying
the trained model for knowledge, and adding knowledge in the form of constraints for further
training, as part of a cycle whose stopping criteria are governed by a fairness metric.
Furthermore, we focus on the core of the LTN approach: constraint-based learning from data and
ifrst-order logic knowledge, and we make it iterative by saving the learned parametrisation
at each cycle in our implementation, while removing unnecessary constraints such as the use
with LTN of Neural Tensor Networks. In our experiments, we use standard feedforward neural
networks.
      </p>
      <sec id="sec-3-1">
        <title>3.1. Language</title>
        <p>(∀, ∃).</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Grounding</title>
        <p>
          Logic Tensor Networks [
          <xref ref-type="bibr" rid="ref24 ref6 ref8">6, 24, 8</xref>
          ] implement a many-valued first-order logic (FOL) language ,
which consists of a set of constants , variables  , function symbols  and predicate symbols
. Logical formulas in  allow to specify background knowledge related to the task at hand.
The syntax in LTN is that of FOL, with formulas consisting of predicate symbols and the
connectives for negations (¬), conjunction, disjunction and implication (∧, ∨, →) and quantifiers
As for the semantics of , LTN deviates from the standard abstract semantics of FOL and
proposes a concrete semantics where domains are interpreted in the Real field
maps 
Real Logic [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. To emphasize that symbols are interpreted according to their grounding onto
real numbers, LTN uses the term grounding, denoted by , in place of interpretation. Every
object denoted by a constant, variable or term is grounded onto a tensor of real numbers.
Function symbols are grounded as functions in the vector space, that is, an  -ary function
vectors of real numbers to one vector of real numbers. Predicates are grounded as
functions that map onto the interval [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ] representing the predicate’s degree of truth.
        </p>
        <p>
          The semantics for the connectives is defined according to fuzzy logic semantics:
conjuncℝ as defined in
tions are approximated by t-norms (e.g. 
(, 
)
), disjunctions by t-conorms (e.g. 
negation by fuzzy negation (e.g. 1− ) and implication by fuzzy implications (e.g. 
The semantics of the quantifiers is defined by aggregation functions. For instance, in the
sen(1−,  )
(,  )
),
).
tence ∃ ( ( ) ∧  ( )), ∃ can be implemented using max and ∧. Krieken et al. [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ] analysed
various fuzzy operators and recommended those suitable for diferentiable learning. In this
paper, we approximate binary connectives using the product t-norm and the corresponding
t-conorm and S implication. The universal quantifier is defined as the generalised mean, also
referred to as p-mean2 by Krieken et al. [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ].
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Learning</title>
        <p>The objects denoted by LTN constants and variables can be learned from data. LTN functions
and predicates are also learnable. Thus, the grounding of symbols depend on a set of parameters
 . With a choice of a multilayer perceptron to model each logical predicate, the parametrization
2p-mean( 1, … ,   ) = (   =1
1 ∑   )

 
1
.
used in this paper is:
 (1)( ) =  (1) (   (1) +  (1)</p>
        <p>)
 (2)( ) =  (2) ( (1)( )  (2) +  (2)</p>
        <p>
          )
( )( ) =  ( (2)( )  (3) +  (3)
)
where each  ( ) is an activation function, e.g. ReLU,  ( ) is a  weight matrix, and  ( ) a bias
vector;  denotes the sigmoid activation function which ensures that predicate  is mapped
from ℝ to a truth-value in [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ].
        </p>
        <p>Since the grounding of a formula  ( ) denotes the degree of truth of  , one natural training
signal is the degree of truth of the formulas in the knowledge-base . The objective function
is therefore to maximize the satisfiability of all formulas in :
 ∗ = arg max Sat ( ())</p>
        <p>∈
which is subject to an aggregation  of all formulas, e.g. the above-mentioned p-mean.</p>
        <p>
          Notice that in the above formulation we have substituted the Neural Tensor Network used
in [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] with a multilayer perceptron for simplicity. Notice also that the approach described
in Figure 2 is model-agnostic. The main idea from LTN used here is that of learning with
knowledge-base constraints and querying with many-valued first-order logic.
        </p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Continuous Querying</title>
        <p>LTN inference using first-order logic clauses is not a post-hoc explanation in the traditional
sense. In this paper, we argue that inference should form an integral part of an iterative process
allowing for incremental explanation through distillation of knowledge which is guided by
data. We achieve this by computing the value of a grounding  (  ), given a trained network
(set of parameters  ), for a user-defined query   .</p>
        <p>Specifically, we save and reinstate the learned parameters stored in the LTN
implementation. This is done by storing the parameters  resulting from  ∗ = arg max ∈ Sat  (). This
also means that changes made to the knowledge-base followed by further training will not
reinitialise parameters, but will instead start from saved  ∗. Having this functionality allows
us to continually query and guide the learning process according to added knowledge  ,
an approach akin to that of continual learning.</p>
        <p>
          A query is any logical formula expressed in first-order-logic. Queries are evaluated by
calculating the grounding  of any formula whose predicates are already grounded in the multilayer
perceptron, or even by defining a predicate in terms of existing predicates. For example, the
logical formula ∀ ∶ ( ( ) →  ( )) can be evaluated by applying the values of  , obtained
from the data set, to the trained perceptron, obtaining the values of output neurons  and 
in [
          <xref ref-type="bibr" rid="ref1">0,1</xref>
          ] (corresponding to the truth-values of predicates  and  , respectively), and
calculating the implication with the use of the Reichenbach-norm and aggregating for all  using the
p-mean. For an extensive analysis, we refer the reader to [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ].
        </p>
        <p>Algorithm 1 illustrates the steps we take to continuously refine  with a
human-in-theloop. The queries derive from questions a user might have about the model’s response: how
does the model behave for a specific group? How does the model behave for particular
edgecases? These questions can be translated relatively easily into FOL-queries. Simultaneously,
an XAI method further informs the user about possible undesired model behaviour which may
not be as apparent as the above common questions. This can be accomplished by a variety of
XAI methods which may give insight into the functionality of a black box model. In Figure 2,
XAI method SHAP reports a discrepancy in how the variable reported income is used by the ML
system for men and women. This can be changed by adding knowledge to  and retraining,
as will be illustrated in the experiments.</p>
      </sec>
      <sec id="sec-3-5">
        <title>3.5. Fairness</title>
        <p>
          Quantitative fairness metrics seek to introduce mathematical precision to the definition of
fairness in ML. Nevertheless, fairness is rooted in ethical principles and context-dependent human
value judgements. This functional dependence on value judgements is perhaps manifested in
the existence of mutually incompatible definitions of fairness [
          <xref ref-type="bibr" rid="ref11 ref26">26, 11</xref>
          ]. Rather than comparing
diferent notions of fairness, this paper focuses on achieving fairness as a desired outcome of
explainability and therefore it evaluates both the classical demographic parity metric and the
legal notion of disparate impact within the proposed framework of Algorithm 1.
The majority of fairness approaches in ML can be considered to target group fairness, meaning
parity among groups on aggregate.3 We adopt the following definitions of group fairness to
measure and compare our approach with other methods. Following [
          <xref ref-type="bibr" rid="ref12 ref19">19, 12</xref>
          ], we consider a
binary classification setting where the training examples consist of triples ( , ,  ) where  ∈ 
3By contrast, [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] advocate a more fine-grained individual fairness where similar individuals should be treated
similarly. Although our focus in this paper is on group fairness for the sake of achieving comparative results, we
believe that it should be possible to apply the approach proposed here to individual fairness.
is a feature vector,  ∈  is a protected attribute, and  ∈ {0, 1} is a label.
        </p>
        <p>Definition 3.1. Demographic Parity (DP): A classifier ℎ satisfies demographic parity under a
distribution on ( , ,  ) if its predictions ℎ( ) are independent of the protected attribute  .
That is, ∀ ∈  :</p>
        <p>[ℎ( ) =  ∣  =  ] =  [ℎ( ) =  ]
Since  ∈ {0, 1}, we can say that:</p>
        <p>∀ ∶  [ℎ( ) ∣  =  ] =  [ℎ( )]
The metric itself is typically reported as the diference between the above expected values
which should converge to zero for a fair classifier.</p>
        <p>Definition 3.2. Disparate Impact (DI): Given ( , , 
disparate impact if:
) as specified above, a classifier
ℎ has
 (ℎ( ) &gt; 0 ∣  = 0)</p>
        <p>
          ≤ 
 (ℎ( ) &gt; 0 ∣  = 1)
Adopting the "80%-rule" from industry [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ] would set the arbitrarily threshold for acceptable
adverse impact to at least 80% outcome alignment. This metric compares the proportion of
individuals that receive a positive output from an unprivileged and a privileged group and
converges towards value 1.0 for full removal of DI between groups.
        </p>
        <p>Algorithm 1: LTN-active learning cycle</p>
        <p>Input: Data-set, Knowledge (in the form of FOL)
Output: Model satisfiability measured as overall sat-level
for each predicate  in  do</p>
        <p>Initialize  (P)
for epoch &lt; num-epochs do
max sat  ( )</p>
        <p>
while Revision do
for each FOL-query   do</p>
        <p>Calculate  (  )
if  (  )&lt; 
then</p>
        <p>Add   to 
Apply XAI-method
for each predicate  do</p>
        <p>Inquire  (P)
if  has undesired property  ( ) then</p>
        <p>Revise  () to 
if  ≠ ∅ then
 ←  ∪ 
 ∗ = arg max ∈  ()
// each  can be a multilayer perceptron or output neuron
// optimize  to achieve max satisfiability of </p>
        <p>
          // user-defined Boolean
// query the network to obtain the truth-value of  
//  in [
          <xref ref-type="bibr" rid="ref1">0,1</xref>
          ] is a user-defined minimum sat value
        </p>
        <p>// we use Shapley values
// query predicate-specific groundings
// user-determined desiderata
// method-dependent revision
// re-train the network</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Experimental Results</title>
      <p>
        The following experiments illustrate how LTN can be used to obtain insight into a neural
network model and interactively address an undesired behaviour by adding new knowledge to the
background knowledge as illustrated in Figure 2. Background knowledge is used to provide a
meaningful semantics to the explanations, facilitating human-machine interaction, while being
injected into the neural network to achieve a desired property [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. Experiment 1 is presented
in a simulated environment to observe if one can achieve a desired behaviour in an idealised
world. Experiment 2 provides a practical translation of the idea onto real data and reports
comparisons with the state-of-the-art constraint-based learning methods for fairness. 4
      </p>
      <p>
        Experiment 1. Fairness using objective features: This experiment draws a parallel with
a current state-of-the-art method in the area of explainability. We demonstrate how traditional
XAI methods are able to benefit from a neural-symbolic approaches. Most importantly, we
demonstrate how the LTN method can remove any undesired disparities in a model-agnostic
approach when having access to objective features as proposed in [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
      </p>
      <p>
        We use the same example as the authors of the popular SHAP library connecting XAI and
fairness [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. They aim to dissect the model’s input features to understand the disparities based on
quantitative fairness metrics in the context of a credit underwriting scenario. A data generation
process allows one to ensure that the labels are statistically independent of the protected class
and any remaining disparities result from measurement, labelling or model errors. We
generate four hypothetical causal factors scaled between [
        <xref ref-type="bibr" rid="ref1">0-1</xref>
        ] (income stability, income amount,
spending restraint and consistency), which influence the observable features (job history,
reported income, credit inquiries and late payments). The customer quality for securing credit is
the product of all the factors that consequently determines the label as high-customer quality
by being strong simultaneously in all factors. The observable features are subject to a bias
introduced to obtain disparities in the system. This bias influences the mapping of the
underlying factors to the observable features and therefore simulate an under-reporting of errors for
women (the implementation contains further detailed explanation).
      </p>
      <p>
        We compare the demographic disparity between the gender groups by calculating their Shapley
values. We use such values as a popular way of gaining insight into model behaviour, although
other explainability methods could have been used, and show that one can intervene in the
model by adding knowledge for further training of the LTN to reduce disparities.
Since the SHAP method uses the same units as the original model output, we can decompose
the model output using SHAP and calculate each feature’s parity diference using their
respective Shapley value. Then, by adding clauses to LTN which seek to enforce equality as a soft
constraint, the neural network will be trained to reduce the diference (axioms 3-7 below).
Reapplying SHAP would then give a measure of the success of the approach on parity diference.
Axioms 3-7 are created based on the idea of treating similar people similarly from [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. It is
argued that finding an objective similarity or distance metric in practice can be challenging but
should be possible5.
      </p>
      <p>First, we split the data into two subsets for the protected ( ) and unprotected ( ) group,
re4The code for the experiments can be found here: http://github.com/benediktwagner/LTN_fairness.
5The authors further advocate making such metric public to allow for transparency. They propose the use of a
normative approach to fairness as the absolute guarantee of fairness.</p>
      <p>Experiment 1 is summarised below.</p>
      <p>Predicate:  for the positive class (i.e. credit default)
spectively, and create five subsets within each group, denoted
quantile-based discretisation of customer quality. 6. The five axioms (3 to 7) then state,
according to the discretisation, that if a member ( ) of set 
 defaults on credit, i.e. ℎ( ) = 1, then a
member ( ) of set </p>
      <p>should also default, ℎ( ) = 1, and vice-versa. Given the diferent groups,
one may wish to specify that equality in prediction is required, e.g. for the bottom 20% of the
protected group w.r.t. the unprotected group according to a fairness measure. In our approach,
the use of the generalised p-mean lends itself very well to this task by allowing for diferent
forms of aggregation for each equality sub-group (referred to as customer quality 1 to 5 below).
As a result, the user can specify in the system how strictly each fairness axiom is expected to
be satisfied (using the p-mean parameter  and the satisfiability threshold  , c.f. Algorithm 1).

 and 
 ,
1 ≤  ≤ 5, using
Training data:  , a set of individuals who credit default;  , a set of individuals who do not

credit default;  1, ...,  5 ⊂ { ∪  }, a set of female individuals with customer quality
1 to 5;  1, ...,  5 ⊂ { ∪ }, a set of male individuals with the same customer quality.
Axioms:
∀ ∈  ∶
∀ ∈  ∶
∀ ∈  1,  ∈  1 ∶
∀ ∈  2,  ∈  2 ∶
∀ ∈  3,  ∈  3 ∶
∀ ∈  4,  ∈  4 ∶
∀ ∈  5,  ∈  5 ∶
 ( )
¬ ( )
 ( ) ↔  ( )
 ( ) ↔  ( )
 ( ) ↔  ( )
 ( ) ↔  ( )
 ( ) ↔  ( )
(1)
(2)
(3)
(4)
(5)
(6)
(7)
Model: ℎ( ) (denoting  ( )) is the multilayer perceptron described in Section 3.3.
We initially train the multilayer perceptron on the data alone (without axioms 3 to 7) and
observe the disparities shown in the SHAP XAI chart shown in 3 (left). The network learned
undesired disparities among gender groups as a result of under-reporting errors in the data
(equivalent to using only axioms 1 and 2 with an LTN trained for 1000 epochs).
We subsequently add axioms 3 to 7 to the knowledge-base and re-train the LTN with these
axioms. As shown in the SHAP XAI chart of Figure 3 (right), this decreases disparity
considerably, having reduced the Disparity Impact from 0.64 to less than 0.001. This illustrates
the ability of LTN to account for fairness after having observed disparities in common XAI
methods by adding appropriate fairness axioms for further training. Despite its usefulness as
proof-of-concept, we acknowledge that the ideal notion of similarity among sub-groups can be
impracticable. Next, we continue our investigation with additional real-world data and derive
a notion of similarity automatically.</p>
      <p>6This choice should be attribute-independent and application specific, e.g. based on reported income from very
low, low and medium, to high and very high, diferent groups may require diferent interventions, i.e. diferent
axioms according to policy and the situation in the real-world</p>
      <p>
        Experiment 2. Fairness and Direct Comparisons: We compare results on three
publiclyavailable data sets used in the evaluation of fairness, obtained from the UCI machine learning
repository: the Adult data set for predicting income, German for credit risk, and COMPASS for
recidivism. We follow the experimental setup used in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], although they perform extensive
hyper-parameter tuning whilst our models are simpler. We compare our LTN-based approach
with another neural network-based approach that integrates fairness constraints into the loss
function using Lagrange multipliers [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], and with an approach for naive-Bayes classifiers [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ].
Gender is the protected variable in the Adult and German data sets, and race in the COMPASS
data set. We train a neural network with two hidden layers of 100 and 50 neurons, respectively
([
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] trains networks of up to 500 neurons per layer). We use the Adam optimiser with a
learning rate of 0.001 trained for a maximum of 5000 epochs. As in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], we report results
averaged over 5-fold cross-validation.
      </p>
      <p>As before, we first train the network without fairness constraints, while before we relied on an
objective notion of similarity that made it possible to split the individuals into sub-groups, now
we use a continual learning approach (when such objective notion is not present). The trained
network is queried to return the truth-value of the predicate used for the classification task
( ( )) for the entire training set  . The output helps determine, as a proxy for similarity,
the fairness constraints. As done in the previous experiment, a quantile-based discretisation
is carried out, but this time according to the result of querying the network, after splitting the
data into two subsets for each class according to the protected variable. Therefore, we obtain
equally-sized groups for each protected and unprotected variable. Again, five sub-groups are
used and the axioms from Experiment 1 apply.</p>
      <p>
        Querying axioms 3 to 7 reveals a low sat level at first as an indication of an unfair model
(   &lt; 0.5). This is confirmed by measuring the fairness metrics with DI ≤ 0.4 and DP ≥ 0.03
across all data sets 7. The results are shown in Figure 4 which also includes the results of the
approach to account for fairness in naive Bayes classifiers [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. The comparison with [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] is not
as straightforward as the comparison with FNNC [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] because [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] use 10-fold cross-validation
and do not measure DI or DP. Nevertheless, we include the results of our approach using
10fold cross-validation also in Figure 4 for the data sets. Since both LTN and FNNC are based on
neural networks, we can make a more direct comparison with FNNC [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <p>7this is also revealed in SHAP value disparity. We measure the fairness here using pre-defined metrics as
fairness is a known issue in these benchmarking datasets and therefore does not require an XAI method for detection</p>
      <p>
        As illustrated in Figure 4, we are able to outperform other state-of-the-art methods and achieve
a lower variability across all data sets, and pass the DI and DP fairness thresholds proposed by
[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. All experiments were carried out using the same hyper-parameters as reported above and
aggregation parameter  = 5. Finally, we would like to emphasize the flexibility of our approach
w.r.t. diferent notions of fairness and its potential use with alternative fairness constraint
constructions. The approach is not applicable exclusively to the metrics used here. With the
increasing number and complexity of equality groups with larger p-values for aggregation, and
the currently-evolving many notions of fairness being developed, we argue that rich languages
such as FOL will be needed to capture more fine-grained notions, possibly converging towards
individual fairness (with the generalised mean converging towards the  value).
      </p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion &amp; Future Work</title>
      <p>Combining XAI methods with neural-symbolic approaches allows us to not only learn about
the undesired behaviour of a model but also intervene to address discrepancies which is
ultimately the goal of Explainability. We have proposed an interactive model-agnostic method
and algorithm for fairness and have shown how one can remove demographic disparities from
trained neural networks by using a continual learning LTN-based framework. While
experiment 1 demonstrated the efectiveness on addressing undesired gender-based disparities on
simulated data, we have investigated such efectiveness on real-world data in experiment 2 and
compared to other methods.</p>
      <p>In the future, we plan to investigate the suitability of diferent XAI methods. In particular,
knowledge extraction methods that extract rules directly by querying from general to specific
knowledge, following the FOL language of LTN, could prove to be highly useful in practice.
Furthermore, we aim to adapt this approach to varying notions of fairness as the proposed
method itself is adaptable and there are lively discussions about the appropriateness of
varying definitions.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>We are thankful to Samy Badreddine, Michael Spranger and Luciano Serafini for their
comments and useful discussions.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>European</given-names>
            <surname>Union</surname>
          </string-name>
          ,
          <year>Regulation 2016</year>
          /
          <article-title>679 of the European parliament and the Council of the European Union</article-title>
          ,
          <source>Oficial Journal of the European Communities</source>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M. W.</given-names>
            <surname>Craven</surname>
          </string-name>
          ,
          <source>Extracting Comprehensible Models From Trained Neural Networks, Ph.D. thesis</source>
          , University of Wisconsin, Madison,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>B.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wattenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gilmer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Cai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wexler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Viegas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Sayres</surname>
          </string-name>
          ,
          <article-title>Interpretability beyond feature attribution: Quantitative Testing with Concept Activation Vectors (TCAV)</article-title>
          ,
          <source>in: 35th International Conference on Machine Learning</source>
          ,
          <string-name>
            <surname>ICML</surname>
          </string-name>
          <year>2018</year>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M. T.</given-names>
            <surname>Ribeiro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Guestrin</surname>
          </string-name>
          ,
          <article-title>"Why Should I Trust You?": Explaining the Predictions of Any Classifier</article-title>
          , arXiv:
          <fpage>1602</fpage>
          .04938 (
          <year>2016</year>
          ). URL: http://arxiv.org/abs/1602.04938.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>N.</given-names>
            <surname>Frosst</surname>
          </string-name>
          , G. Hinton,
          <article-title>Distilling a Neural Network Into a Soft Decision Tree (</article-title>
          <year>2017</year>
          ). URL: http://arxiv.org/abs/1711.09784.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L.</given-names>
            <surname>Serafini</surname>
          </string-name>
          , A. d. Garcez,
          <article-title>Logic tensor networks: Deep learning and logical reasoning from data and knowledge</article-title>
          ,
          <source>arXiv preprint arXiv:1606.04422</source>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Diligenti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gori</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Saccà</surname>
          </string-name>
          ,
          <article-title>Semantic-based regularization for learning and inference</article-title>
          ,
          <source>Artificial Intelligence</source>
          (
          <year>2017</year>
          ). doi:
          <volume>10</volume>
          .1016/j.artint.
          <year>2015</year>
          .
          <volume>08</volume>
          .011.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S.</given-names>
            <surname>Badreddine</surname>
          </string-name>
          , A. d. Garcez,
          <string-name>
            <given-names>L.</given-names>
            <surname>Serafini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Spranger</surname>
          </string-name>
          , Logic Tensor Networks (
          <year>2020</year>
          ). URL: http://arxiv.org/abs/
          <year>2012</year>
          .13635.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A. D.</given-names>
            <surname>Garcez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. R.</given-names>
            <surname>Besold</surname>
          </string-name>
          , L. De Raedt,
          <string-name>
            <given-names>P.</given-names>
            <surname>Foldiak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hitzler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Icard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. U.</given-names>
            <surname>Kiihnberger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. C.</given-names>
            <surname>Lamb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Miikkulainen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. L.</given-names>
            <surname>Silver</surname>
          </string-name>
          ,
          <article-title>Neural-symbolic learning and reasoning: Contributions and challenges</article-title>
          ,
          <source>in: AAAI Spring Symposium - Technical Report</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>A. S. D'Avila Garcez</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Broda</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          <string-name>
            <surname>Gabbay</surname>
          </string-name>
          ,
          <article-title>Symbolic knowledge extraction from trained neural networks: A sound approach</article-title>
          ,
          <source>Artificial Intelligence</source>
          (
          <year>2001</year>
          ). doi:
          <volume>10</volume>
          .1016/ S0004-
          <volume>3702</volume>
          (
          <issue>00</issue>
          )
          <fpage>00077</fpage>
          -
          <lpage>1</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Lundberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. I.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <article-title>A unified approach to interpreting model predictions</article-title>
          ,
          <source>in: Advances in Neural Information Processing Systems</source>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Padala</surname>
          </string-name>
          , S. Gujar, FNNC:
          <article-title>Achieving Fairness through Neural Networks</article-title>
          , in: C.
          <string-name>
            <surname>Bessiere</surname>
          </string-name>
          (Ed.),
          <source>Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence</source>
          , {IJCAI-
          <volume>20</volume>
          },
          <source>International Joint Conferences on Artificial Intelligence Organization</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A.</given-names>
            <surname>Adadi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Berrada</surname>
          </string-name>
          ,
          <article-title>Peeking inside the black-box: A survey on Explainable Artificial Intelligence (XAI)</article-title>
          ,
          <source>IEEE Access 6</source>
          (
          <year>2018</year>
          )
          <fpage>52138</fpage>
          -
          <lpage>52160</lpage>
          . doi:
          <volume>10</volume>
          .1109/ACCESS.
          <year>2018</year>
          .
          <volume>2870052</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>R.</given-names>
            <surname>Guidotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Monreale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Turini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Pedreschi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Giannotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A Survey</given-names>
            <surname>Of Methods For Explaining Black Box Models</surname>
          </string-name>
          , ACM Computing Surveys (
          <year>2018</year>
          )
          <fpage>1</fpage>
          -
          <lpage>45</lpage>
          . URL: https: //doi.org/10.1145/3236009.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>A.</given-names>
            <surname>White</surname>
          </string-name>
          , A. d. Garcez,
          <article-title>Measurable Counterfactual Local Explanations for Any Classifier</article-title>
          ,
          <source>in: 24th European Conference on Artificial Intelligence</source>
          ,
          <year>2020</year>
          . URL: http://arxiv.org/abs/
          <year>1908</year>
          .03020.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>L. S.</given-names>
            <surname>Shapley</surname>
          </string-name>
          ,
          <article-title>Contributions to the Theory of Games, in: Annals of Mathematical Studies v</article-title>
          .
          <volume>28</volume>
          ,
          <year>1953</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>E.</given-names>
            <surname>Štrumbelj</surname>
          </string-name>
          , I. Kononenko,
          <article-title>Explaining prediction models and individual predictions with feature contributions, Knowledge and Information Systems (</article-title>
          <year>2014</year>
          ). doi:
          <volume>10</volume>
          .1007/ s10115-013-0679-x.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>C.</given-names>
            <surname>Dwork</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hardt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Pitassi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Reingold</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Zemel</surname>
          </string-name>
          ,
          <article-title>Fairness through awareness</article-title>
          , in: ITCS 2012 - Innovations in Theoretical Computer Science Conference,
          <year>2012</year>
          . doi:
          <volume>10</volume>
          . 1145/2090236.2090255.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>A.</given-names>
            <surname>Agarwal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Beygelzimer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dudfk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Langford</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Hanna</surname>
          </string-name>
          ,
          <article-title>A reductions approach to fair classification</article-title>
          ,
          <source>in: 35th International Conference on Machine Learning</source>
          ,
          <string-name>
            <surname>ICML</surname>
          </string-name>
          <year>2018</year>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Choi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Farnadi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Babaki</surname>
          </string-name>
          , G. V. d. Broeck,
          <article-title>Learning Fair Naive Bayes Classifiers by Discovering and Eliminating Discrimination Patterns</article-title>
          ,
          <source>in: Proceedings of the AAAI Conference on Artificial Intelligence</source>
          ,
          <year>2019</year>
          . URL: http://arxiv.org/abs/.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>R.</given-names>
            <surname>Zemel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Swersky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Pitassi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Dwork</surname>
          </string-name>
          ,
          <article-title>Learning fair representations</article-title>
          ,
          <source>in: 30th International Conference on Machine Learning</source>
          ,
          <string-name>
            <surname>ICML</surname>
          </string-name>
          <year>2013</year>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>S. A.</given-names>
            <surname>Friedler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Choudhary</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Scheidegger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. P.</given-names>
            <surname>Hamilton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Venkatasubramanian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Roth</surname>
          </string-name>
          ,
          <article-title>A comparative study of fairness-enhancing interventions in machine learning</article-title>
          ,
          <source>in: FAT* 2019 - Proceedings of the 2019 Conference on Fairness, Accountability, and Transparency</source>
          ,
          <year>2019</year>
          . doi:
          <volume>10</volume>
          .1145/3287560.3287589.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>N.</given-names>
            <surname>Mehrabi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Morstatter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Saxena</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lerman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Galstyan</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          <article-title>Survey on Bias and Fairness in Machine Learning (</article-title>
          <year>2019</year>
          ). URL: http://arxiv.org/abs/.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>I.</given-names>
            <surname>Donadello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Serafini</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. D'Avila Garcez</surname>
          </string-name>
          ,
          <article-title>Logic tensor networks for semantic image interpretation</article-title>
          ,
          <source>in: IJCAI International Joint Conference on Artificial Intelligence</source>
          ,
          <year>2017</year>
          . doi:
          <volume>10</volume>
          .24963/ijcai.
          <year>2017</year>
          /221.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>E. v.</given-names>
            <surname>Krieken</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Acar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. v.</given-names>
            <surname>Harmelen</surname>
          </string-name>
          ,
          <string-name>
            <surname>Analyzing Diferentiable Fuzzy Logic Operators</surname>
          </string-name>
          (
          <year>2020</year>
          ). URL: https://arxiv.org/abs/
          <year>2002</year>
          .06100v1.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>J.</given-names>
            <surname>Kleinberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ludwig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mullainathan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rambachan</surname>
          </string-name>
          , Algorithmic Fairness,
          <source>AEA Papers and Proceedings</source>
          (
          <year>2018</year>
          ). doi:
          <volume>10</volume>
          .1257/pandp.20181018.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>M.</given-names>
            <surname>Feldman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. A.</given-names>
            <surname>Friedler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Moeller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Scheidegger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Venkatasubramanian</surname>
          </string-name>
          ,
          <article-title>Certifying and removing disparate impact</article-title>
          ,
          <source>in: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</source>
          ,
          <year>2015</year>
          . doi:
          <volume>10</volume>
          .1145/2783258. 2783311.
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>A. S. D.</given-names>
            <surname>Garcez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. C.</given-names>
            <surname>Lamb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. M.</given-names>
            <surname>Gabbay</surname>
          </string-name>
          ,
          <source>Neural-symbolic cognitive reasoning</source>
          , Springer Science &amp; Business
          <string-name>
            <surname>Media</surname>
          </string-name>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>