<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Fairness in Graph-theoretical Optimization Problems⋆</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Christopher Hojny</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Frits Spieksma</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sten Wessel</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Eindhoven University of Technology</institution>
          ,
          <addr-line>P.O. Box 513, 5600 MB Eindhoven</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>There is arbitrariness in optimum solutions of graph-theoretic problems that can give rise to unfairness. Incorporating fairness in such problems, however, can be done in multiple ways. For instance, fairness can be defined on an individual level, for individual vertices or edges of a given graph, or on a group level. In this work, we analyze in detail two individual-fairness measures that are based on finding a probability distribution over the set of solutions. One measure guarantees uniform fairness, i.e., entities have equal chance of being part of the solution when sampling from this probability distribution. The other measure maximizes the minimum probability for every entity of being selected in a solution. In particular, we reveal that computing these individual-fairness measures is in fact equivalent to computing the fractional covering number and the fractional partitioning number of a hypergraph. In addition, we show that for a general class of problems that we classify as independence systems, these two measures coincide. We also analyze group fairness and how this can be combined with the individual-fairness measures.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;fairness</kwd>
        <kwd>column generation</kwd>
        <kwd>matching</kwd>
        <kwd>independent set</kwd>
        <kwd>set systems</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Traditionally, when confronted with an instance of an optimization problem, the instance is
considered solved when a provably optimum solution has been found. After all, what more can
be wished for? We should acknowledge that there can be a large amount of arbitrariness in
selecting an optimum solution, and that this can be perceived as a source of unfairness.</p>
      <p>Imagine one is a patient with end-stage renal disease, and that there is a donor who is willing
to donate a kidney but who is incompatible with you. Imagine further that you and your
incompatible donor enter a kidney exchange program; we will refer to the pair entering the
program as a node. Entering this program means that at regular moments, runs of software are
made, and the output of such a run consists of a set of exchanges between nodes. A patient
from a node involved in such an exchange receives a kidney, other patients do not. Clearly,
it is quite relevant whether one is included in such an exchange or not. While the software
may find an optimum solution (according to some criterion, say maximizing the number of
exchanges), it can be the case that some node is not selected for an exchange while being part
of some optimum solution. In such cases it can be hard to explain that the software simply
selects an optimum solution which somehow favors some nodes at the expense of others. In this
contribution, we aim to pursue this matter from a fairness point of view, using kidney exchange
as a running example.</p>
      <p>
        We analyze an approach that has been used by, among others, Farnadi et al. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and Flanigan
et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Instead of proposing a fixed solution, the idea is to consider a pool of solutions from
which a solution is sampled according to some probability distribution, which we call procedural
or ex-ante fairness. In this way, diferent entities of the optimization problem (e.g., patients in
the kidney exchange example) are contained in a selected solution with a certain probability.
To model fairness, the probability distribution is required to satisfy some additional criteria.
      </p>
      <p>
        Although this sampling approach is not new, it has mostly been studied computationally. In
this article, we take a fresh look into this approach. While existing results in the literature mostly
considered the approach by Flanigan et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] for particular problems, we aim to understand
the approach on a structural level. For more details on our results, we refer to [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. A generic framework for modeling fairness in graphs</title>
      <p>
        We formulate a model for finding a probability distribution over the set of feasible solutions of
a graph-theoretical optimization problem. This model resembles fairness models for concrete
problems that exist in the literature [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ]. Let  be a given ground set, and let  be a family of
subsets of the ground set . We assume that ∅ ∈  and that every element  ∈  is contained
in some set  ∈  . We call the tuple (,  ) a set system. In the kidney exchange example,
the ground set  coincides with the nodes, whereas  coincides with the collection of subsets
of nodes that are covered by matchings in the compatibility graph.
      </p>
      <p>To achieve individual fairness, we construct a probability distribution {}∈ over  ,
such that if we sample according to , every element  ∈  has equal probability  to be in
the sampled subset. Our aim is to maximize this probability, which we call  (for uniform
probability). For any  ∈ , let  ⊆  denote the collection of subsets from  that contain
the . We model the problem of maximizing  with the following linear program (LP).
 =
maximize
subject to</p>
      <p>∑︁  = 
∈
∑︁  = 1,
∈
 ≥ 0
 ∈ R.</p>
      <p>∀ ∈ ,
∀ ∈  ,
(1a)
(1b)
(1c)
(1d)
(1e)
Constraints (1b) ensure that each element  ∈  is selected with uniform probability, while
Constraint (1c) and Constraints (1d) ensure that  is a probability distribution over  .</p>
      <p>
        We also consider a variant of this problem, where our notion of fairness is relaxed to
maximizing the minimum probability  that a ground set element is selected. This is also referred
to as Rawlsian justice, based on fairness principles introduced by Rawls [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], and can be modeled
with an LP that is very similar to (1), where Constraints (1b) are replaced with ∑︀∈  ≥ 
for all  ∈ . For some applications, Rawlsian justice may be more applicable than uniform
fairness as a fairness measure, or vice versa, depending on the objective of the decision maker.
      </p>
      <sec id="sec-2-1">
        <title>2.1. Relation to fractional hypergraph theory</title>
        <p>
          The fairness measures  and  are related to concepts developed in fractional hypergraph
theory. We refer to Scheinerman and Ullman [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], and Füredi [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] for an overview of fractional
graph and hypergraph theory. The main idea is to see the set system (,  ) as a hypergraph ℋ
with vertex set  and hyperedges  . The fractional partitioning number =(ℋ) is the minimum
weight of a fractional partition: an assignment of non-negative weights  ∈ R+ to the
hyperedges such that for every vertex the sum of weights of the incident hyperedges is exactly 1.
When the weights sum to at least 1 for every vertex, we call the assignment of weights a
fractional cover and consequently the minimum weight of a fractional cover is the fractional
covering number ≥ (ℋ) of the hypergraph.
        </p>
        <p>One can show that determining  can then be alternatively formulated as determining the
fractional partitioning number and similarly that measure  can be formulated as determining
the fractional covering number, as summarized in the following result.</p>
        <p>Lemma 1. Let ℋ = (,  ) be a hypergraph. Then it holds that  =</p>
        <p>1
 = =(ℋ) if =(ℋ) is finite, and  = 0 otherwise.</p>
        <p>In fact, when we apply this to the kidney exchange example, one can show that this implies
the following, rather high, lower bound on  when it is positive.</p>
        <p>Corollary 1. For fair matching for vertices, if  &gt; 0, then  ≥ 23 .</p>
        <p>1 . Furthermore,
≥ (ℋ)</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Computational complexity</title>
        <p>
          A fair probability distribution with respect to  and  can be found by solving the LP (1)
for  and , respectively. Linear programs are solved routinely in practice by the simplex
algorithm [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], and algorithms such as the ellipsoid method [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] and interior point methods [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]
have a provable polynomial running time. Note, however, that this does not imply that the
LPs can be solved in (poly(||)) time, as the running time of the ellipsoid and interior point
methods depend polynomially on the number of variables | |, which can depend exponentially
on ||. We make use of a more advanced technique for solving LPs: column generation. The
idea of column generation is to select a small subset  ′ ⊆  of variables and solve the LP
restricted to this subset of variables. Afterward, it is checked whether the solution is optimal
w.r.t. the entire set of variables  . If this is the case, the algorithm stops and returns an optimal
solution. Otherwise, it adds a variable from  ∖  ′ to  ′ that can improve the objective and
the method starts anew. Using column generation, we can observe that  and  can be found
in polynomial time whenever the corresponding optimization problem over  can be solved in
polynomial time, based on the seminal result by Grötschel, Lovász, and Schrijver [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].
Theorem 2. Let  be a finite set and let  be a collection of subsets of  such that ∅ ∈  . If,
for every  ∈ Q, a set  ∈  that minimizes ∑︀∈  can be found in time polynomial in ||,
then  and  can be found in (poly(||)) time.
        </p>
        <p>Examples where minimizing ∑︀∈  can be done in polynomial time are matching, shortest
paths, spanning trees, and many other combinatorial problems.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Independence systems</title>
      <p>The framework that determines  and  (Section 2) can be applied to arbitrary set
systems (,  ). The system (,  ) is an independence system when ∅ ∈  and for every  ∈  ,
we have ′ ∈  for all ′ ⊆ . Independence systems are also known as simplicial complexes
or downward-closed set systems. Many independence systems are known; we mention here: edge
sets that form matchings, edge sets that form forests, vertex sets that are independent sets, and
matroids. While in general there is no relation between  and  other than  ≤ , one of
our main contributions is that for independence systems that these two measures coincide.
Theorem 3. For any independence system (,  ),  = .</p>
    </sec>
    <sec id="sec-4">
      <title>4. Group fairness</title>
      <p>Other than arbitrariness in selecting a solution, unfairness can also be introduced through
systemic bias in the definition of the model or the description of the problem. For the kidney
exchange example, patients with certain blood types are more likely to be matched, simply
because they are compatible with a larger group of donors. To achieve this so-called group
fairness among diferent types of patients, one can ensure representation of protected groups in
the solution. Not much existing work combines individual and group fairness, while we show
that both approaches can be combined in a single framework.</p>
      <p>Our problem input now not only consists of a set system (,  ), but also of a collection of
groups  = {1, . . . , }, with  ⊆  subsets of the ground set , for all  ∈ []. We assume
that the groups are pairwise disjoint; when considering groups that arise from a single sensitive
attribute, this is in many cases a natural assumption. We impose group fairness constraints on
the possible solutions, i.e., we require ex-post group fairness. We thus restrict the set  to only
solutions that satisfy the group fairness constraints, gf ⊆  . Although it is possible to also
handle group fairness in an ex-ante setting as opposed to ex-post, group fairness constraints are
often hard conditions imposed on solutions, and in some cases required by regulations or law.</p>
      <p>We consider two types of group fairness constraints. Absolute constraints enforce that the
number of individuals from a group is between a given lower and upper bound. Relative
constraints bound the ratio between the number of individuals for a pair of groups, e.g., to
impose that a solution must contain the same number of individuals from each group. For the
fair matching for vertices problem in the kidney exchange example, we obtain the following.
Theorem 4. Let  = (, ) be a graph with a constant number of pairwise disjoint groups of
vertices. Let gf denote the collection of vertex-subsets covered by matchings in , restricted to
absolute and/or relative group fairness constraints. Then determining  and  for gf can be
done in polynomial time.
This research is supported by NWO Gravitation Project NETWORKS, Grant Number 024.002.003.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>C.</given-names>
            <surname>Hojny</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Spieksma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Wessel</surname>
          </string-name>
          ,
          <article-title>Fairness in graph-theoretical optimization problems</article-title>
          ,
          <year>2023</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv.2311.15953.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>G.</given-names>
            <surname>Farnadi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>St-Arnaud</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Babaki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Carvalho</surname>
          </string-name>
          ,
          <article-title>Individual fairness in kidney exchange programs</article-title>
          ,
          <source>Proceedings of the AAAI Conference on Artificial Intelligence</source>
          <volume>35</volume>
          (
          <year>2021</year>
          )
          <fpage>11496</fpage>
          -
          <lpage>11505</lpage>
          . doi:
          <volume>10</volume>
          .1609/aaai.v35i13.
          <fpage>17369</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>B.</given-names>
            <surname>Flanigan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Gölz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Hennig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. D.</given-names>
            <surname>Procaccia</surname>
          </string-name>
          ,
          <article-title>Fair algorithms for selecting citizens' assemblies</article-title>
          ,
          <source>Nature</source>
          <volume>596</volume>
          (
          <year>2021</year>
          )
          <fpage>548</fpage>
          -
          <lpage>552</lpage>
          . doi:
          <volume>10</volume>
          .1038/s41586-021-03788-6.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>J.</given-names>
            <surname>Rawls</surname>
          </string-name>
          ,
          <source>A Theory of Justice</source>
          , Belknap Press,
          <year>1971</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>E. R.</given-names>
            <surname>Scheinerman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. H.</given-names>
            <surname>Ullman</surname>
          </string-name>
          ,
          <article-title>Fractional graph theory: a rational approach to the theory of graphs</article-title>
          ,
          <source>Wiley-Inerscience Series in Discrete Mathematics and Optimization</source>
          , Wiley, New York, NY, USA,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Füredi</surname>
          </string-name>
          , Matchings and covers in hypergraphs,
          <source>Graphs and Combinatorics</source>
          <volume>4</volume>
          (
          <year>1988</year>
          )
          <fpage>115</fpage>
          -
          <lpage>206</lpage>
          . doi:
          <volume>10</volume>
          .1007/BF01864160.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>G. B.</given-names>
            <surname>Dantzig</surname>
          </string-name>
          ,
          <article-title>Maximization of a linear function of variables subject to linear inequalities</article-title>
          ,
          <source>Activity analysis of production and allocation 13</source>
          (
          <year>1951</year>
          )
          <fpage>339</fpage>
          -
          <lpage>347</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>L. G.</given-names>
            <surname>Khachiyan</surname>
          </string-name>
          ,
          <article-title>A polynomial algorithm in linear programming</article-title>
          ,
          <source>in: Soviet Mathematics Doklady</source>
          , volume
          <volume>20</volume>
          ,
          <year>1979</year>
          , pp.
          <fpage>191</fpage>
          -
          <lpage>194</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>N.</given-names>
            <surname>Karmarkar</surname>
          </string-name>
          ,
          <article-title>A new polynomial-time algorithm for linear programming</article-title>
          ,
          <source>in: Proceedings of the Sixteenth Annual ACM Symposium on Theory of Computing</source>
          , STOC '84,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>1984</year>
          , p.
          <fpage>302</fpage>
          -
          <lpage>311</lpage>
          . doi:
          <volume>10</volume>
          .1145/800057. 808695.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M.</given-names>
            <surname>Grötschel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Lovász</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Schrijver</surname>
          </string-name>
          ,
          <article-title>The ellipsoid method and its consequences in combinatorial optimization</article-title>
          ,
          <source>Combinatorica</source>
          <volume>1</volume>
          (
          <year>1981</year>
          )
          <fpage>169</fpage>
          -
          <lpage>197</lpage>
          . doi:
          <volume>10</volume>
          .1007/BF02579273.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>