<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Data Quality Dimensions for Fair AI ⋆</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Camilla</forename><surname>Quaresmini</surname></persName>
							<email>camilla.quaresmini@polimi.it</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Electronics, Information and Bioengineering</orgName>
								<orgName type="institution">Politecnico di Milano</orgName>
								<address>
									<addrLine>Piazza Leonardo da Vinci 32</addrLine>
									<postCode>20133</postCode>
									<settlement>Milan</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Giuseppe</forename><surname>Primiero</surname></persName>
							<email>giuseppe.primiero@unimi.it</email>
							<affiliation key="aff1">
								<orgName type="department">Department of Philosophy</orgName>
								<orgName type="laboratory">LUCI Lab and PhilTech Research Center</orgName>
								<orgName type="institution">Università degli Studi di Milano</orgName>
								<address>
									<addrLine>Via Festa del Perdono 7</addrLine>
									<postCode>20122</postCode>
									<settlement>Milan</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="institution" key="instit1">MIRAI</orgName>
								<orgName type="institution" key="instit2">Srl</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">Data Quality Dimensions for Fair AI ⋆</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">73357500A81AF62EEA4F8FF6A5C110E1</idno>
					<idno type="arXiv">arXiv:1901.04966.</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T16:36+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Bias mitigation</term>
					<term>Fairness</term>
					<term>Information Quality</term>
					<term>Mislabeling</term>
					<term>Timeliness</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Artificial Intelligence (AI) systems are not intrinsically neutral and biases trickle in any type of technological tool. In particular when dealing with people, the impact of AI algorithms' technical errors originating with mislabeled data is undeniable. As they feed wrong and discriminatory classifications, these systems are not systematically guarded against bias. In this article we consider the problem of bias in AI systems from the point of view of data quality dimensions. We highlight the limited model construction of bias mitigation tools based on accuracy strategy, illustrating potential improvements of a specific tool in gender classification errors occurring in two typically difficult contexts: the classification of non-binary individuals, for which the label set becomes incomplete with respect to the dataset; and the classification of transgender individuals, for which the dataset becomes inconsistent with respect to the label set. Using formal methods for reasoning about the behavior of the classification system in presence of a changing world, we propose to reconsider the fairness of the classification task in terms of completeness, consistency, timeliness and reliability, and offer some theoretical results.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Machine Learning (ML) models trained on huge amounts of data are intrinsically biased when dealing with people. Common face recognition systems used in surveillance tasks generate false positives labeling innocent people as suspects. Social credit systems link individuals to the state of their social credit, making decisions based on that score. In all of those cases, subjects suffer a credibility deficit due to prejudices related to their social identity <ref type="bibr" target="#b0">[1]</ref>: a dark-skinned man could be characterized by a higher risk of recidivism after being arrested; a short-haired skinny young woman -or a long-haired boy with feminine traits -might be the target of transphobic attacks following misgendering. Through the deployment of these technologies, society makes the gap separating rich from poor, cisnormative from non-cisnormative individuals, more constitutive as automatized and standardized.</p><p>Already before the explosion of ML algorithms, <ref type="bibr" target="#b1">[2]</ref> offered a framework for understanding three categories of bias in computer systems, assuming the absence of bias as necessary to define their quality. Later on, the emergence of contemporary, data-driven AI systems based on learning has significantly worsened the situation, see e.g. <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b3">4]</ref>. On this basis, the development and deployment of fairer Artificial Intelligence (AI) systems has been increasingly demanded. Such request appears especially relevant in certain application contexts. For example, as examined in <ref type="bibr" target="#b4">[5]</ref>, face is commonly used as a legitimate mean of gender classification, and this is operazionalized and automatized in technologies such as Automatic Gender Recognition (AGR), which algorithmically derives gender from faces' physical traits to perform classification <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b6">7]</ref>. This technique relies on the assumption that gender identity can be computationally derived from facial traits. However, a recent study <ref type="bibr" target="#b7">[8]</ref> shows that the most famous AGR systems are not able to classify non-binary genders, also performing poorly on transgender individuals. This is due to the fact that AGR incapsulates a binary, cisnormative conception of gender, adopting a male/female scheme which invalidates non-binary identities.</p><p>We declare ourselves against the use of gender classification, as considering face as a proxy for detecting gender identity seems to resonate with phrenology and physiognomy, and we believe that the process of automatic gender recognition can easily lead to mismatches between the theoretical understanding of constructs underlying identity and their operationalization <ref type="bibr" target="#b8">[9]</ref>, especially when it comes to classification of individuals who recognise themselves outside of binarism. However, we note that this kind of classification is already happening <ref type="bibr" target="#b9">[10]</ref>, spreading with commercial systems offering gender classification as a standard feature, causing a huge impact on the lives of misgendered individuals. Therefore there are contexts in which it is potentially inevitable that classification exists, and in these contexts it must be fairer. This translates into asking whether there is a strategy to ensure that the labels assigned during classification are as less stereotypical and archetypal as possible. While this paper does not investigates the ethical aspects of AGR, we aim at addressing the issues related to the classification strategies to make them fairer, as an initial study to prepare for implementing mitigation strategies.</p><p>An important task, common to technology and philosophy, is therefore the identification and verification of criteria that may help developing fairness conditions for AI systems. While a number of techniques are available to mitigate bias, their primary focus on purely statistical analysis to control accuracy across sensitive classes is clearly insufficient to control social discrimination. A different approach is represented by the explicit formulation of ethical principles to be verified across protected attributes, combining statistical measures with logical reasoning, as formally defined in <ref type="bibr" target="#b10">[11,</ref><ref type="bibr" target="#b11">12,</ref><ref type="bibr" target="#b12">13,</ref><ref type="bibr" target="#b13">14,</ref><ref type="bibr" target="#b14">15]</ref> and implemented by the BRIO tool in <ref type="bibr" target="#b15">[16,</ref><ref type="bibr" target="#b16">17]</ref>. In this latter context, an important direction to explore for a refined definition of ethically-laden verification criteria is the study of quality dimensions and associated biases. In the following of this paper, we offer a theoretical contribution in this direction, preparing the ground for a future implementation. We argue that, even if maximizing data quality and fairness simultaneously can be hard as improving one can deteriorate the other <ref type="bibr" target="#b17">[18]</ref>, the task of bias mitigation tools can be supported by reasoning on quality dimensions that so far have been left ignored. In particular, we offer examples to show how dimensions of consistency, completeness, timeliness and reliability can be used to establish fairer AI classification systems. This research is in line with the quest for integrating useful empirical metrics on fairness in AI with asking key (conceptual) questions, see <ref type="bibr" target="#b18">[19]</ref>.</p><p>The paper is structured as follows. In Section 2 we offer an overview of fainess definitions and bias types relevant for this work. In Section 3 we briefly overview the technical details of </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Bias type</head><p>Definition Literature</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Data Bias</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Behavioral bias</head><p>User's behavior can be different across contexts <ref type="bibr" target="#b37">[38]</ref> </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Exclusion bias</head><p>Systematic exclusion of some data <ref type="bibr" target="#b38">[39]</ref> </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Historical bias</head><p>Cultural prejudices are included into systematic processes <ref type="bibr" target="#b39">[40]</ref> </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Time interval bias</head><p>Data collection in a too limited time range <ref type="bibr" target="#b40">[41]</ref> Label Bias</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Chronological bias</head><p>Distortion due to temporal changes in the world which data are supposed to represent <ref type="bibr" target="#b38">[39]</ref> Historical bias Cultural prejudices are included into systematic processes <ref type="bibr" target="#b39">[40]</ref> Misclassification bias Data points are assigned to incorrect categories <ref type="bibr" target="#b41">[42]</ref> a particular bias mitigation tool to illustrate what we consider essential limitations of purely statistical analyses. In Section 4 we introduce data quality dimensions arguing for reconsidering their relevance in the task of evaluating the fairness of classification systems, presenting two examples to justify this requirement. In Section 5 we propose a definition of fair AI classification that includes such dimensions and formulate some theoretical results. Section 6 concludes the work illustrating future research lines.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Fairness and Bias in ML</head><p>Despite a unique definition missing in the literature <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b2">3,</ref><ref type="bibr" target="#b19">20,</ref><ref type="bibr" target="#b20">21,</ref><ref type="bibr" target="#b21">22,</ref><ref type="bibr" target="#b22">23,</ref><ref type="bibr" target="#b23">24,</ref><ref type="bibr" target="#b24">25]</ref>, fairness is often presented as corresponding to the avoidance of bias <ref type="bibr" target="#b25">[26]</ref>. This can be formulated at two distinct levels: first, identifying and correcting problems in datasets <ref type="bibr" target="#b26">[27,</ref><ref type="bibr" target="#b27">28,</ref><ref type="bibr" target="#b28">29,</ref><ref type="bibr" target="#b29">30,</ref><ref type="bibr" target="#b30">31,</ref><ref type="bibr" target="#b31">32]</ref>, as a model trained with a mislabeled dataset will provide biased outputs; second, correcting the algorithms <ref type="bibr" target="#b20">[21,</ref><ref type="bibr" target="#b32">33]</ref>, as even in the design of algorithms biases can emerge <ref type="bibr" target="#b33">[34]</ref>. In the present section we are interested in considering datasets and their labels. Indeed, bias may also affect the label set <ref type="bibr" target="#b34">[35,</ref><ref type="bibr" target="#b35">36]</ref>. Accordingly, we talk about label quality bias when errors hit the quality of labels. As shown in <ref type="bibr" target="#b36">[37]</ref>, the most well-known AI datasets are full of labeling errors. A crucial task is therefore the development of conceptual strategies and technical tools to mitigate bias emergence in both data and label sets.</p><p>A variety of approaches and contributions is available in the literature focusing on identifying bias in datasets and labels. Here we list the types of bias which are relevant to the present work, see Table <ref type="table" target="#tab_1">2</ref>. Albeit not exhaustive, these lists of biases represent a good starting point to investigate quality dimensions required to address them. We now analyze a common mitigation strategy used by existing tools addressing the issue of bias in data, showing their limitations. We then study the bias in the classification algorithm (i.e., bias in labels) of the mitigation tool. A correct label for the datapoint 𝑑 𝜋 Threshold variable</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Mitigating Bias</head><p>A bias mitigation algorithm is a procedure for reducing unwanted bias in training datasets or models, with the aim to improve the fairness metrics. Those algorithms can be classified into three categories <ref type="bibr" target="#b42">[43]</ref>: pre-processing, when the training data is modified; in-processing, when the learning algorithm is modified; post-processing, when the predictions are modified. Several tools are available to audit and mitigate biases in datasets, thereby attempting to implement diversity and to reach fairness. Among the most common are AIF360 <ref type="bibr" target="#b21">[22]</ref>, Aequitas <ref type="bibr" target="#b43">[44]</ref> and Cleanlab <ref type="bibr" target="#b44">[45]</ref>. Recently a post-hoc evaluation model for bias mitigation has been proposed by the tool BRIO <ref type="bibr" target="#b15">[16,</ref><ref type="bibr" target="#b16">17]</ref>. In this article, we consider Cleanlab as a testbed, illustrating below in Section 4 its limitations in view of data quality dimensions. Instead, we propose a theoretical frame for the resolution of such limitation in Section 5, further illustrating the possibility to implement the present analysis in the tool BRIO. For an overview of the symbols used from now on, see Table <ref type="table" target="#tab_1">2</ref>.</p><p>Cleanlab is a framework to find label errors in datasets. It uses Confident Learning (CL), an approach which focuses on label quality with the aim to address uncertainty in dataset labels using three principles: counting examples that are likely to belong to another class using the confident joint and probabilistic thresholds to find label errors and to estimate noise; pruning noisy data; and ranking examples to train with confidence on clean data. The three approaches are combined by an initial assumption of a class-conditional noise process, to directly estimate the joint distribution between noisy given labels and uncorrupted unknown ones. For every class, the algorithm learns the probability of it being mislabeled as any other class. This assumption may have exceptions but it is considered reasonable. For example, a "cat" is more likely to be mislabeled as "tiger" than as "airplane". This assumption is provided by the classification noise process (CNP, <ref type="bibr" target="#b45">[46]</ref>), which leads to the conclusion that the label noise only depends on the latent true class, not on the data. CL <ref type="bibr" target="#b44">[45]</ref> exactly finds label errors in datasets by estimating the joint distribution of noisy and true labels. The idea is that when the predicted probability of an example is greater than a threshold per class, we confidently consider that example as actually belonging to the class of that threshold, where the thresholds for each class are the average predicted probability of examples in that class. Given ỹ ∈ [𝑚] takes an observed, noisy label (potentially flipped to an incorrect class); and 𝑦 * ∈ [𝑚] takes the unknown (latent), true, uncorrupted label (latent true label), CL assumes that for every example it exists a correct label 𝑦 * and defines a class-conditional noise process mapping 𝑦 * → ỹ , such that every label in class 𝑗 ∈ [𝑚] may be independently mislabeled as class 𝑖 ∈ [𝑚], with probability 𝑝( ỹ = 𝑖 | 𝑦 * = 𝑗). So, maps are associations of data to wrong labels. Then CL estimates 𝑝( ỹ | 𝑦 * ) and 𝑝(𝑦 * ) jointly, evaluating the joint distribution of label noise 𝑝( ỹ , 𝑦 * ) between noisy given labels and uncorrupted unknown labels. CL aims to estimate every 𝑝( ỹ , 𝑦 * ) as a matrix 𝑄 ỹ ,𝑦 * to find all mislabeled examples 𝑥 in dataset 𝑋, where 𝑦 * ≠ ỹ . Given as inputs the out-of-sample predicted probabilities P 𝑘,𝑖 and the vector of noisy labels ỹ 𝑘 , the procedure is divided into three steps: estimation of Q ỹ ,𝑦 * to characterize class-conditional label noise, filtering of noisy examples, training with the errors found.</p><p>To estimate Q ỹ ,𝑦 * i.e. the joint distribution of noisy labels ỹ and true labels 𝑦 * , CL counts examples that may belong to another class using a statistical data structure named confident joint 𝐶 ỹ ,𝑦 * , formally defined as follows</p><formula xml:id="formula_0">𝐶 ỹ ,𝑦 * [𝑖][𝑗] ∶=| X ỹ =𝑖,𝑦 * =𝑗 |<label>(1)</label></formula><p>In other words, the confident joint estimates the set 𝑋 ỹ =𝑖,𝑦 * =𝑗 of examples with noisy label i which actually have true label j by making a partition of the dataset 𝑋 into bins X ỹ =𝑖,𝑦 * =𝑗 , namely the set of examples labeled ỹ = 𝑖 with large enough expected probability p ( ỹ = 𝑗; 𝑥, 𝜃) to belong to class 𝑦 * = 𝑗, determined by a per-class threshold 𝑡 𝑗 , where 𝜃 is the model. This kind of tools are extremely useful in estimating label error probabilities. However they have some limitations, and it is easy to formulate examples for which their strategy seems unsound. A first problem arises from the initial assumption of the categoricity of data. Take for example the case of gender labeling of facial images, which is typically binary (i.e. with values male, female). For each datapoint, a classification algorithm calculates the projected probability that an image is assigned to the respective label. Consider though two very noisy cases: images of non-binary individuals; images of transgender individuals. In the former case, the label set becomes incomplete with respect to the dataset; in the second case, the dataset is inconsistent with respect to the label set. Hence, there can be datapoints that have either 1) none of the available labels as the correct one, or 2) at different times they can be under different labels. By definition, if we have disjoint labels there can be high accuracy but only on those datapoints which identify themselves in the disjointed categories. In situations like these, it appears that the dimension of accuracy alone does no longer satisfy the correctness of the classification algorithm. In terms of quality dimensions, the possibility of an uncategorical datapoint or that of a moving datapoint is no longer only an accuracy problem. Hence, the identification of other data quality dimensions to be implemented in tools for bias mitigation may help achieve more fairness in the classification task. In the next section we suggest an improvement of the classification strategy by adding dimensions that should be considered when evaluating the fairness of the classification itself.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Extending Data Dimensions for Fair AI</head><p>In the literature, data quality dimensions are defined both informally and qualitatively. Metrics can be associated as indicators of the dimension's quality. However, there is no single and objective vision of data quality dimensions, nor a universal definition for each dimension. This is because often dimensions escape or exceed a formal definition. The cause of the large amount of dimensions <ref type="bibr" target="#b46">[47,</ref><ref type="bibr" target="#b47">48]</ref> also lies in the fact that data aim to represent all spatial, temporal and social phenomena of the real world <ref type="bibr" target="#b48">[49]</ref>. Furthermore, they are constantly evolving in response to continuous development of new data-driven technologies.</p><p>For the purposes of our analysis, we focus on the following basic set of data quality dimensions which is the focus of the majority of authors in the literature <ref type="bibr" target="#b49">[50,</ref><ref type="bibr" target="#b50">51]</ref>:</p><p>• Accuracy, i.e. the closeness between a value 𝑣 and a value 𝑣 ′ , where the latter is the correct representation of the real-life phenomenon that 𝑣 aims to represent <ref type="bibr" target="#b46">[47]</ref>; • Completeness, i.e. the level at which data have the sufficient breadth, depth, and scope for their task <ref type="bibr" target="#b47">[48,</ref><ref type="bibr" target="#b51">52,</ref><ref type="bibr" target="#b46">47]</ref>; • Consistency, i.e. the coherence dimension: it amounts to check whether or not the semantic rules defined on a set of data elements have been respected <ref type="bibr" target="#b46">[47]</ref>; • Timeliness, the data freshness over time for a specific task <ref type="bibr" target="#b52">[53,</ref><ref type="bibr" target="#b53">54]</ref>.</p><p>We thus indicate them as potential candidates to be implemented in the context of bias mitigation strategies. In particular, we argue that, as data are characterized by evolution over time, the timeliness dimension <ref type="bibr" target="#b46">[47]</ref> can be taken as basis for other categories of data quality. We aim at suggesting improvements on errors identification in the classification of datapoints, using the gender attribute as an illustrative case. We thus suggest the extension of classification with dimensions of completeness, consistency and timeliness and then return to Cleanlab to illustrate how this extension could be practically implemented.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Incomplete Label Set and Inconsistent Labeling</head><p>Consider the first example of a datapoint which represents a non-binary individual. This kind of identity is rarely considered in technology <ref type="bibr" target="#b54">[55]</ref>. Non-binary identities do not recognize themselves within the binary approach characteristic of classification systems. As such, individual identity is not correctly recognized by the classification system, highlighting the insufficiency of the model which flattens the gender identity umbrella on the two options of male/female.</p><p>The conceptual solution would be to simply assume the label set as incomplete. This means that the bias origin is in the pre-processing phase, and a possible strategy is to extend the partition of the labels adding categories as appropriate, e.g. "non binary". The problem is here reduced to the consideration of the completeness of the label set. <ref type="bibr" target="#b7">[8]</ref> can be considered a first attempt in this direction.</p><p>Consider now a transgender datapoint whose identity shifts over time, being a fluid datapoint by definition. Currently AI systems operationalize gender in a way which is completely transexclusive, see e.g. <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b5">6]</ref>. However, identity is not static: it may move with respect to the labels we have, leading the datapoint to be configured in a label or in a different one during a selected time range. In this case, any extension of the label set is misleading, or at least insufficient.</p><p>Here we cannot just add more categories, but we have to find a logical solution to changing the label of the same datapoint at different timepoints.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Enter Time</head><p>The two problems above can be formulated adding to completeness and consistency the dimension of temporality. Thus, an important starting point is represented by adding the dimension of timeliness, which concerns the degree to which data represent reality within a certain defined time range for a given population.</p><p>We suggest here considering the labeling task within a given time frame, whose length depends on the dataset and the classification task over the pairing of datapoints to labels, to measure a probability of a label-change over time. Intuitively, if the analysis is performed less than a certain number of timestamps away from the last data labeling, then we consider the labeling still valid. Otherwise, a new analysis with respect to both completeness of the dataset and label set must be performed. Technically, this means associating temporal parameters to labels and to compute the probability that a given label might change over the given time frame. The probability of a label being correct (its accuracy) decreases within the respective temporal window. In particular, reasoning on the temporal evolution of the dataset could allow us to model the evolution of the label partitions. Two fundamental theses are suggested for evaluation: the correctness of the task does no longer assume static completeness of the label set, i.e. given the label set is complete at time 𝑡 𝑛 , it can be incomplete at time 𝑡 𝑛+𝑚 ; the labeling does no longer assume static perseverance of the labels, that is, given a label 𝑖 that is correct at a time 𝑡 𝑛 for a datapoint 𝑑, it could be incorrect at a later time, and conversely if it is incorrect it could become correct.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3.">Back to Cleanlab</head><p>Considering a possible implementation in Cleanlab able to account for such differences implies renouncing the starting assumption on the categoricity of the data. Instead, assume that the probability of assigning a label may change over time. This can be formulated in two distinct ways. First, the probability value of a given label 𝑖 being wrong, given a label 𝑗 is correct (their distance) may change over time. The task is now to give a mapping of all the label-variable pairs, i.e. given a mapping 𝑦 * → ỹ between variables, where 𝑦 * is the correct label and ỹ the wrong one, compute the probability over the time frame</p><formula xml:id="formula_1">𝒯 ∶= {𝑡 1 , … , 𝑡 𝑛 } 𝑝 𝒯 [( ỹ = 𝑖) 𝑡 𝑛 | (𝑦 * = 𝑗) 𝑡 𝑛−𝑚 ] (2)</formula><p>such that label 𝑖 is wrong at time 𝑡 𝑛 , given that label 𝑗 was correct at time 𝑡 𝑛−𝑚 . This probability can increase or decrease, depending on the dataset and on the label set. For the definition of the confident joint, this means taking the evaluation of all the elements that have an incorrect label 𝑖 when their correct label is 𝑗, and then associate the wrong label to a time 𝑡 𝑛 and the correct label to a previous time. This estimate must be made on all time points, so for every 𝑚 &lt; 𝑛. Given a timepoint 𝑛 at which the label is wrong, the estimate on all pairs of probabilities for that point with a previous point in which another label can be correct has to be computed</p><formula xml:id="formula_2">𝐶 ỹ ,𝑦 * [𝑖, 𝑗, 𝒯 ] ∶= 𝑛∈𝒯 ∑ 1≤𝑚&lt;𝑛∈𝒯 | X ỹ =𝑖 𝑡 𝑛 ,𝑦 * =𝑗 𝑡 𝑛−𝑚 |<label>(3)</label></formula><p>Second, given a mapping 𝑦 * → ỹ between variables, where 𝑦 * is the correct label and ỹ the wrong one, what is the probability</p><formula xml:id="formula_3">𝑝 𝒯 [( ỹ = 𝑖) 𝑡 𝑛 | (𝑦 * = 𝑖) 𝑡 𝑛−𝑚 ]<label>(4)</label></formula><p>such that label 𝑖 is wrong at time 𝑡 𝑛 , given that the same label 𝑖 was correct at time 𝑡 𝑛−𝑚 ? In this case, the same label is fixed and the probability that it becomes incorrect can be calculated. The definition of confident joint thus becomes</p><formula xml:id="formula_4">𝐶 ỹ ,𝑦 * [𝑖, 𝒯 ] ∶= 𝑛∈𝒯 ∑ 1≤𝑚&lt;𝑛∈𝒯 | X ỹ =𝑖 𝑡 𝑛 ,𝑦 * =𝑖 𝑡 𝑛−𝑚 |<label>(5)</label></formula><p>To illustrate the point we consider a toy example. Compute </p><p>i.e. the error rate of 𝑦 * = 𝑚𝑎𝑙𝑒 has to be determined. First, a confusion matrix is constructed to analyze errors. Suppose to have a dataset of 10 datapoints, see Figure <ref type="figure" target="#fig_1">1</ref>. From the matrix, 𝑝(𝑦 * = 𝑗) = 5/10 and 𝑝( ỹ = 𝑖) = 4/10. So there are 5 women, of which 2 are incorrectly labeled "male" and 3 are correctly labeled "female", and 5 men of which 1 is incorrectly labeled "female" and 4 are correctly labeled "male". Replacing the values in Equation <ref type="formula" target="#formula_5">6</ref>, 𝑝( ỹ = 𝑖 | 𝑦 * = 𝑗) = 0.2. The obtained value represents the error rate of the "male" label, i.e. the probability of a male datapoint being labeled "female". Looking at the diagonals, the true positive rate TPR = 70% and the false positive rate FPR = 30%.</p><p>Consider now the same dataset at a later time 𝑡 𝑛+𝑚 , see Figure <ref type="figure" target="#fig_2">2</ref>. The labels might have changed. From the matrix, 𝑝(𝑦 * = 𝑗) = 5/10 and that 𝑝( ỹ = 𝑖) = 5/10. Now there are 5 women, of which 3 are incorrectly labeled "male" and 2 are correctly labeled "female", and 5 men of which 3 are incorrectly labeled"female" and 2 are correctly labeled "male". Replacing again the values in 6, 𝑝 ′ ( ỹ = 𝑖 | 𝑦 * = 𝑗) = 0.6. In this case the true positive rate TPR = 40% and the false positive rate FPR = 60%.  To understand how the error rate changes, the difference between the two matrices has to be considered. Thus, the change rate can be computed as 𝜀 = p </p><formula xml:id="formula_6">𝑝 𝒯 [( ỹ = 𝑖) 𝑡 𝑛+𝑚 | (𝑦 * = 𝑗) 𝑡 𝑛 ] = 𝑝[(𝑦 * = 𝑗) | ( ỹ = 𝑖)] 𝑡 𝑛+𝑚 ⋅ [𝑝( ỹ = 𝑖)𝑡 𝑛 ± 𝜀] 𝑝(𝑦 * = 𝑗) 𝑡 𝑛 = 0.288<label>(7)</label></formula><p>This value represents the (highest) probability that a given label is wrong at a given time, provided it was correct at some previous time. Indirectly, this also expresses the probability that the labeling set is applied to a dataset containing a point for which the labeling becomes inconsistent over time.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Temporal-based Fairness in AI</head><p>We have argued that a more general discussion on the data dimensions to be adopted in bias mitigation tools is needed, and in particular that the dimension of timeliness is crucial. In this section we summarise our proposal and offer non-exhaustive criteria for fairness in AI based on such temporal approach along with some basic theoretical results.</p><p>The first metric that has been addressed in this work is completeness as applied to the label set. In a world where gender classification is actually changing, the present strategy includes the completeness dimension in the quality assessment, verifying that the label set is complete with respect to the ontology of the world at the time this assessment is made. The solution here is to extend the label set as desired adding new labels for the classification task, as already suggested in <ref type="bibr" target="#b7">[8]</ref>. Additionally, we suggest an explicit temporal parametrization: completeness can be considered as a relationship between a label set and an individual 𝑝 belonging to a certain population 𝑃, where 𝑝 is any domain item that enters 𝑃 at a time 𝑡. We must ensure that a correct label 𝑙 exists for each datapoint in the dataset at each time. In other words, the completeness of a dataset over a time frame is granted if for every datapoint representing an element in the population of interest there exists at any two possibly consecutive points in time a correct label for it.</p><p>Next, we considered consistency of the label set with respect to datapoints possibly shifting in categorization. The method here again is to reduce consistency to timeliness. We suggest to compute the probability of an inconsistency arising from a correct label change. Accuracy, albeit the most used metric for evaluating classification models' performances due to its easy calculability and interpretation, is reductive, trivial and incorrect in some contexts. For example, if the distribution of the class is distorted, accuracy is no longer a useful, nor a relevant metric. Even worse, sometimes greater accuracy leads to greater unfairness <ref type="bibr" target="#b55">[56]</ref>: some labels like race or gender may allow models to be more predictive, although it seems to be often controversial to use such categories to increase predictive performance. We have suggested to consider temporal accuracy <ref type="bibr" target="#b56">[57]</ref> as a function of the error rate over time.</p><p>The ability to compute the variance in the error rate across time is functional to determine the reliability of AI systems. This metric is linked to the notion of accuracy, as it is considered as a measure of data correctness, see <ref type="bibr" target="#b46">[47]</ref>. In <ref type="bibr" target="#b47">[48]</ref> and <ref type="bibr" target="#b56">[57]</ref> reliability is even contained in the definition of accuracy itself: data must be reliable to satisfy the accuracy dimension. Overall, it seems that reliability is not actually controlled beyond physical reliability, as in the literature on data quality there is no formal definition to compute it. However, following <ref type="bibr" target="#b57">[58]</ref> the previously provided temporal approach is again useful: evaluating reliability is based on the revisions which show how close the initial estimate of accuracy is to the following ones. In this sense, reliability can be reduced to accuracy over time in terms of a threshold on the error rate: Definition 2 (Reliability of a classification algorithm). A classification algorithm in a AI system 𝑋 is considered reliable over a time frame 𝒯 ∶= {𝑡 1 , … , 𝑡 𝑛 } denoted as 𝑅𝑒𝑙 𝒯 (𝑋 ) iff 𝜀 𝒯 (𝑋 ) &lt; 𝜋, for some safe value 𝜋.</p><p>The change rate 𝜀 we have computed shows how much the system's accuracy deteriorates. If it exceeds a fixed safe value 𝜋, the system is no longer accurate. Plain accuracy is the numerical measure at some time 𝑡 ∈ 𝒯 ∶= {𝑡 1 , … , 𝑡 𝑛 }. If this value does not deteriorate over a certain fixed threshold, the system is considered reliable, and therefore accurate with respect to time.</p><p>The two previous definitions offer non-exhaustive criteria for the identification of fair AI systems: Definition 3 (Fairness for AI classification systems). 𝐹 𝑎𝑖𝑟 𝒯 (𝑋 ) only if 𝑅𝑒𝑙 𝒯 (𝑋 ) and 𝐶𝑜𝑚𝑝𝑙 𝒯 (𝐿(𝑋 )).</p><p>Hence we claim that fairness requires the system's ability to give reliable and correct outcomes over time. While we do not consider these properties sufficient, we believe they are necessary. On this basis, we can formulate two immediate theoretical results: Theorem 1. Given a label set 𝐿 complete at time 𝑡, a classification algorithm guarantees a fair classification at time 𝑡 ′ &gt; 𝑡 if and only if the change rate determined with respect to 𝐿 is 𝜖 &lt; 𝜋.</p><p>Proof. Assume 𝐶𝑜𝑚𝑝𝑙 𝑡 (𝐿(𝑋 )), then for 𝐹 𝑎𝑖𝑟 𝑡 ′ (𝑋 ) we need to show 𝑅𝑒𝑙 𝑡 ′ (𝑋 ) for 𝑡 ′ &gt; 𝑡 ∈ 𝒯. Assume 𝜖 &gt; 𝜋, then by Definition 2 reliability is not satisfied; hence, if 𝑅𝑒𝑙 𝒯 (𝑋 ), it must be the case that 𝜖 &lt; 𝜋.</p><p>Theorem 2. Given a fixed change rate 𝜖 &lt; 𝜋, a classification algorithm with fair behaviour at time 𝑡 remains fair at time 𝑡 ′ &gt; 𝑡 if and only if the change to make the label set complete at time 𝑡 ′ does not exceed an 𝜖 ′ such that 𝜖 + 𝜖 ′ &gt; 𝜋.</p><p>Proof. Consider 𝐹 𝑎𝑖𝑟 𝑡 (𝑋 ) with change rate 0 &lt; 𝜋 as a base case, then by Definition 3 𝑅𝑒𝑙 𝑡 (𝑋 ) and 𝐶𝑜𝑚𝑝𝑙 𝑡 (𝐿(𝑋 )). Now consider 𝑡 ′ &gt; 𝑡 and a required change 𝜖 ′ in 𝐶𝑜𝑚𝑝𝑙 𝑡 (𝐿(𝑋 )) such that 𝑅𝑒𝑙 𝑡 ′ (𝑋 ) holds. This obviously holds only if 0 + 𝜖 ′ &lt; 𝜋. Generalize for any 𝜖 &gt; 0.</p><p>Note that in these results the value of 𝜖, respectively 𝜖 ′ , is a proxy for how much the world has changed at 𝑡 ′ with respect to 𝐶𝑜𝑚𝑝𝑙 𝑡 (𝐿(𝑋 )).</p><p>In the context of an incomplete label set, a detected label bias can originate from an exclusion bias in data, which can also result from a time interval bias. In the case of label-changing datapoints a chronological bias occurs. Then, misclassification bias can be reduced to the two previous types. In the context of use, emergent bias can arise as a result of changes in societies and cultures. It might appear in data as chronological, historical or behavioral bias. Here, a different value bias occurs for example when the users are different from the assumed ones during the system's development. This is the case of ontology switching, to which a label set must adapt. These types of bias can be mitigated by implementing the proposed framework. The tool BRIO <ref type="bibr" target="#b15">[16,</ref><ref type="bibr" target="#b16">17]</ref> works as a post-hoc model evaluation, taking in input the test dataset of the model under investigation and its output. The tool allows to investigate behavioural differences of the model both with respect to an internal analysis on the classes of interest, and externally with respect to chosen reference metrics. Morever, it allows to measure bias amplification comparing the bias present in the dataset and how that manifests itself in the output. While the present work does not aim at offering a full implementation of our theoretical analysis for the BRIO tool, some remarks are appropriate. The time-based analysis of completeness and reliability offered in Definitions 1 and 2, in turn grounding a notion of fairness in Definition 3 are easily implementable in BRIO: both completeness and reliability require the definition of a timeframe to check respectively that any given datapoint of interest is matched against a desirable label and that the overall change rate of error for one or more classes of interest does not surpass a certain threshold. Both features rely on the user for the identification of the desirable label for any datapoint and for the admissible distance.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Conclusion</head><p>We presented some recommendations for AI systems design, focusing on timeliness as a founding dimension for developing fairer and more inclusive classification tools. Despite the crucial importance of accuracy as shown by significant works such as <ref type="bibr" target="#b3">[4]</ref> and <ref type="bibr" target="#b58">[59]</ref>, the problem of unfairness in AI systems is much broader and more foundational. This can be expressed in terms of data quality: AI systems are limited in that they maximize accuracy, and even if systems become statistically accurate some problems remain unsolved. This is exemplified by the case of binary gender labeling, which leads to inaccurate simplistic classifications <ref type="bibr" target="#b59">[60]</ref>. Furthermore, as the work of classification is always a reflection of culture, the completeness of the label set and the (constrained) consistency of labeling have an epistemological value: constructing AIs requires us to understand society, and society reflects an ontology of individuals. For this reason, misgendering is first of all an ontological error <ref type="bibr" target="#b5">[6]</ref>.</p><p>We suggested that timeliness is a crucial dimension for the definition of gender identity. If we are ready to consider gender as a property that shifts over time <ref type="bibr" target="#b60">[61]</ref>, and which can also be declined in the plural, as an individual may identify under more than one -not mutually exclusive -labels, then a change of paradigm is required. Design limitations such as binarism and staticity invalidate identities which do not fit into this paradigm. They must be addressed if fairer classifications and more inclusive models of gender are to be designed.</p><p>Further work in this direction includes: an implementation and empirical validation of the proposed model through the BRIO tool; and the design of an extension to compute the probability of incorrect labels becoming correct over time, i.e. the dual case of what presently addressed.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head></head><label></label><figDesc>𝑝( ỹ = 𝑖 | 𝑦 * = 𝑗) = 𝑝(𝑦 * = 𝑗 | ỹ = 𝑖) ⋅ 𝑝( ỹ = 𝑖) 𝑝(𝑦 * = 𝑗) = [𝑝(𝑦 * =𝑗∧ ỹ =𝑖)] 𝑝( ỹ =𝑖) ⋅ 𝑝( ỹ = 𝑖) 𝑝(𝑦 * = 𝑗)</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Confusion matrix at time 𝑛.</figDesc><graphic coords="9,99.71,84.19,187.51,69.48" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Confusion matrix at time 𝑛 + 𝑚.</figDesc><graphic coords="9,308.06,84.19,187.51,69.48" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head></head><label></label><figDesc>′ ( ỹ ; 𝑥 𝑖 ; 𝜃) − p ( ỹ ; 𝑥 𝑖 ; 𝜃) = 0.4. Now 𝑝 𝒯 [( ỹ = 𝑖) 𝑡 𝑛 | (𝑦 * = 𝑗) 𝑡 𝑛−𝑚 ] can be written as 𝑝 𝒯 [( ỹ = 𝑖) 𝑡 𝑛+𝑚 | (𝑦 * = 𝑗) 𝑡 𝑛 ]. Thus, at a time 𝑡 𝑛 we have 𝑝 𝑡 𝑛 (𝑦 * = 𝑗) = 1 − 𝑝( ỹ = 𝑖) 𝑡 𝑛 ). At a subsequent time 𝑡 𝑛+𝑚 we have 𝑝 𝑡 𝑛+𝑚 (𝑦 * = 𝑗) = 1 − 𝑝( ỹ = 𝑖) 𝑡 𝑛+𝑚 . Equation 6 can be computed with respect to time as</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Data and Label Bias.</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2</head><label>2</label><figDesc>Symbols used in the present work.</figDesc><table><row><cell>𝑡 𝑛</cell><cell>Time index</cell></row><row><cell>𝒯 ∶= {𝑡 1 , … , 𝑡 𝑛 }</cell><cell>Time frame</cell></row><row><cell>𝑑</cell><cell>Generic datapoint</cell></row><row><cell>𝑖, 𝑗, 𝑙</cell><cell>Data Labels</cell></row><row><cell>𝑦  *</cell><cell>Discrete random variable correctly labeled</cell></row><row><cell>ỹ</cell><cell>Discrete random variable wrongly labeled</cell></row><row><cell>[𝑚]</cell><cell>The set of unique class labels</cell></row><row><cell>𝑦 *  → ỹ</cell><cell>A mapping between variables</cell></row><row><cell>𝑝 𝒯 [( ỹ = 𝑖) 𝑡 𝑛 | (𝑦 *  = 𝑗) 𝑡 𝑛−𝑚 ]</cell><cell>The probability of label 𝑖 being wrong at time 𝑡 𝑛 , given that label 𝑗 was</cell></row><row><cell></cell><cell>correct at time 𝑡 𝑛−𝑚</cell></row><row><cell>𝐶 ỹ ,𝑦  *  [𝑖, 𝑗, 𝒯 ]</cell><cell>Temporal confident joint, where the correct label can change from 𝑖 to 𝑗</cell></row><row><cell></cell><cell>in time frame 𝒯</cell></row><row><cell>𝐶 ỹ ,𝑦 *  [𝑖, 𝒯 ]</cell><cell>Temporal confident joint, where the correctness of the same fixed label</cell></row><row><cell></cell><cell>𝑖 can change in time frame 𝒯</cell></row><row><cell>𝜀</cell><cell>Change rate</cell></row><row><cell>p ′ ( ỹ ; 𝑥 𝑖 ; 𝜃)</cell><cell>Predicted probability of label ỹ for variable 𝑥 𝑖 and model parameters 𝜃</cell></row><row><cell>𝐿</cell><cell>Label set</cell></row><row><cell>𝑋</cell><cell>AI system</cell></row><row><cell>𝐿 𝑡</cell><cell></cell></row></table><note>1 ∶= {𝑙 1 , … , 𝑙 𝑛 } Partition of the label set 𝑃 Population of interest 𝑝 An element from 𝑃 𝑑(𝑋 ) 𝒯 A datapoint in system 𝑋 over time frame 𝒯 𝑦 * (𝑑)</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head></head><label></label><figDesc>Definition 1 (Completeness of a label set). A label set 𝐿 for a classification algorithm in a AI system 𝑋 is considered complete over a time frame 𝒯 ∶ {𝑡 1 , … , 𝑡 𝑛 } denoted as 𝐶𝑜𝑚𝑝𝑙 𝒯 (𝐿(𝑋 )) iff given two partitions 𝐿 𝑡 1 ∶= {𝑙 1 , … , 𝑙 𝑛 } and 𝐿 𝑡 𝑛 ∶= {𝑙 ′ 1 , … , 𝑙 ′ 𝑛 }, where possibly 𝐿 𝑡 1 ∩ 𝐿 𝑡 𝑛 ≠ ∅ for all (𝑝 ∈ 𝑃) 𝒯 s.t. 𝑝 ∈ 𝑑(𝑋 ) 𝒯 there is 𝑙 ∈ 𝐿 𝑡 1 ∪ 𝐿 𝑡 𝑛 s.t. 𝑦 * (𝑑) = 𝑙.</figDesc><table /></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This research has been partially funded by the Projects: PRIN2020 BRIO (2020SSKZ7R), PRIN2022 SMARTEST (20223E8Y4X), "Departments of Excellence 2023-2027" of the Department of Philosophy "Piero Martinetti" of the University of Milan, all awarded by the Italian Ministry of University and Research (MUR); and MUSA -Multilayered Urban Sustainability Action, funded by the European Union -NextGenerationEU, under the National Recovery and Resilience Plan (NRRP) Mission 4 Component 2 Investment Line 1.5: Strenghtening of research structures and creation of R&amp;D "innovation ecosystems", set up of "territorial leaders in R&amp;D".</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Epistemic Injustice: Power and the Ethics of Knowing</title>
		<author>
			<persName><forename type="first">M</forename><surname>Fricker</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2007">2007</date>
			<publisher>Oxford University Press</publisher>
			<pubPlace>New York</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Bias in computer systems</title>
		<author>
			<persName><forename type="first">B</forename><surname>Friedman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Nissenbaum</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Trans. Inf. Syst</title>
		<imprint>
			<biblScope unit="volume">14</biblScope>
			<biblScope unit="page" from="330" to="347" />
			<date type="published" when="1996">1996</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m" type="main">A survey on bias and fairness in machine learning</title>
		<author>
			<persName><forename type="first">N</forename><surname>Mehrabi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Morstatter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Saxena</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Lerman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Galstyan</surname></persName>
		</author>
		<idno>CoRR abs/1908.09635</idno>
		<ptr target="http://arxiv.org/abs/1908.09635.arXiv:1908.09635" />
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Gender shades: Intersectional accuracy disparities in commercial gender classification</title>
		<author>
			<persName><forename type="first">J</forename><surname>Buolamwini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Gebru</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Conference on Fairness, Accountability and Transparency, FAT 2018</title>
				<meeting><address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>PMLR</publisher>
			<date type="published" when="2018-02-24">23-24 February 2018. 2018</date>
			<biblScope unit="volume">81</biblScope>
			<biblScope unit="page" from="77" to="91" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Auto-essentialization: Gender in automated facial analysis as extended colonial project</title>
		<author>
			<persName><forename type="first">A</forename><surname>Hanna</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Pape</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">K</forename><surname>Scheuerman</surname></persName>
		</author>
		<idno type="DOI">10.1177/20539517211053712</idno>
	</analytic>
	<monogr>
		<title level="j">Big Data and Society</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">The misgendering machines: Trans/HCI implications of automatic gender recognition</title>
		<author>
			<persName><forename type="first">O</forename><surname>Keyes</surname></persName>
		</author>
		<idno type="DOI">10.1145/3274357</idno>
		<idno>doi:</idno>
		<ptr target="10.1145/3274357" />
	</analytic>
	<monogr>
		<title level="j">Proc. ACM Hum.-Comput. Interact</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Gender recognition or gender reductionism? the social implications of embedded gender recognition systems</title>
		<author>
			<persName><forename type="first">F</forename><surname>Hamidi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">K</forename><surname>Scheuerman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">M</forename><surname>Branham</surname></persName>
		</author>
		<idno type="DOI">10.1145/3173574.3173582</idno>
		<idno>doi:10.1145/3173574.3173582</idno>
		<ptr target="https://doi.org/10.1145/3173574.3173582" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, CHI &apos;18</title>
				<meeting>the 2018 CHI Conference on Human Factors in Computing Systems, CHI &apos;18<address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computing Machinery</publisher>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="1" to="13" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">How computers see gender: An evaluation of gender classification in commercial facial analysis services</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">K</forename><surname>Scheuerman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Paul</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Brubaker</surname></persName>
		</author>
		<idno type="DOI">10.1145/3359246</idno>
	</analytic>
	<monogr>
		<title level="j">Proceedings of the ACM on Human-Computer Interaction</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="page" from="1" to="33" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">Z</forename><surname>Jacobs</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">M</forename><surname>Wallach</surname></persName>
		</author>
		<idno>CoRR abs/1912.05511</idno>
		<ptr target="http://arxiv.org/abs/1912.05511.arXiv:1912.05511" />
		<title level="m">Measurement and fairness</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title level="m" type="main">Machine learning algorithms for gender prediction</title>
		<author>
			<persName><forename type="first">A</forename><surname>Ramon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Olaoye</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Luz</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Probabilistic typed natural deduction for trustworthy computations</title>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">A</forename><surname>D'asaro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Primiero</surname></persName>
		</author>
		<ptr target="http://ceur-ws.org/Vol-3022/paper3.pdf" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 22nd International Workshop on Trust in Agent Societies (TRUST 2021) Co-located with the 20th International Conferences on Autonomous Agents and Multiagent Systems (AAMAS 2021)</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<editor>
			<persName><forename type="first">D</forename><surname>Wang</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">R</forename><surname>Falcone</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Zhang</surname></persName>
		</editor>
		<meeting>the 22nd International Workshop on Trust in Agent Societies (TRUST 2021) Co-located with the 20th International Conferences on Autonomous Agents and Multiagent Systems (AAMAS 2021)<address><addrLine>London, UK</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2021">May 3-7, 2021. 2021</date>
			<biblScope unit="volume">3022</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Proof-checking bias in labeling methods</title>
		<author>
			<persName><forename type="first">G</forename><surname>Primiero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">A</forename><surname>D'asaro</surname></persName>
		</author>
		<ptr target="https://ceur-ws.org/Vol-3319/paper1.pdf" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of 1st Workshop on Bias, Ethical AI, Explainability and the Role of Logic and Logic Programming (BEWARE 2022) co-located with the 21th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2022)</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<editor>
			<persName><forename type="first">G</forename><surname>Boella</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><forename type="middle">A</forename><surname>D'asaro</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Dyoub</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><surname>Primiero</surname></persName>
		</editor>
		<meeting>1st Workshop on Bias, Ethical AI, Explainability and the Role of Logic and Logic Programming (BEWARE 2022) co-located with the 21th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2022)<address><addrLine>Udine, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2022-12-02">December 2, 2022. 2022</date>
			<biblScope unit="volume">3319</biblScope>
			<biblScope unit="page" from="9" to="19" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Modelling accuracy and trustworthiness of explaining agents</title>
		<author>
			<persName><forename type="first">A</forename><surname>Termine</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Primiero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">A</forename><surname>D'asaro</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-030-88708-7_19</idno>
		<idno>doi:</idno>
		<ptr target="10.1007/978-3-030-88708-7\_19" />
	</analytic>
	<monogr>
		<title level="m">Logic, Rationality, and Interaction -8th International Workshop, LORI 2021</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<editor>
			<persName><forename type="first">S</forename><surname>Ghosh</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">T</forename><surname>Icard</surname></persName>
		</editor>
		<meeting><address><addrLine>Xi&apos;ian, China</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2021">October 16-18, 2021. 2021</date>
			<biblScope unit="volume">13039</biblScope>
			<biblScope unit="page" from="232" to="245" />
		</imprint>
	</monogr>
	<note>Proceedings</note>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<title level="m" type="main">Checking trustworthiness of probabilistic computations in a typed natural deduction system</title>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">A</forename><surname>D'asaro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Genco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Primiero</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2206.12934</idno>
		<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">A possible worlds semantics for trustworthy non-deterministic computations</title>
		<author>
			<persName><forename type="first">E</forename><surname>Kubyshkina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Primiero</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.ijar.2024.109212</idno>
		<ptr target="https://doi.org/10.1016/j.ijar.2024.109212" />
	</analytic>
	<monogr>
		<title level="j">International Journal of Approximate Reasoning</title>
		<imprint>
			<biblScope unit="page">109212</biblScope>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Brioxalkemy: a bias detecting tool</title>
		<author>
			<persName><forename type="first">G</forename><surname>Coraglia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">A</forename><surname>D'asaro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">A</forename><surname>Genco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Giannuzzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Posillipo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Primiero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Quaggio</surname></persName>
		</author>
		<ptr target="https://ceur-ws.org/Vol-3615/paper4.pdf" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2nd Workshop on Bias, Ethical AI, Explainability and the role of Logic and Logic Programming co-located with the 22nd International Conference of the Italian Association for Artificial Intelligence (AI*IA 2023)</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<editor>
			<persName><forename type="first">G</forename><surname>Boella</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><forename type="middle">A</forename><surname>D'asaro</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Dyoub</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><surname>Gorrieri</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><forename type="middle">A</forename><surname>Lisi</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Manganini</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><surname>Primiero</surname></persName>
		</editor>
		<meeting>the 2nd Workshop on Bias, Ethical AI, Explainability and the role of Logic and Logic Programming co-located with the 22nd International Conference of the Italian Association for Artificial Intelligence (AI*IA 2023)<address><addrLine>Rome, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2023-11-06">November 6, 2023. 2023</date>
			<biblScope unit="volume">3615</biblScope>
			<biblScope unit="page" from="44" to="60" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<title level="m" type="main">Evaluating ai fairness in credit scoring with the brio tool</title>
		<author>
			<persName><forename type="first">G</forename><surname>Coraglia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">A</forename><surname>Genco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Piantadosi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Bagli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Giuffrida</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Posillipo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Primiero</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2406.03292</idno>
		<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Data quality and fairness: Rivals or friends?</title>
		<author>
			<persName><forename type="first">F</forename><surname>Azzalini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Cappiello</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Criscuolo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Cuzzucoli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Dangelo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Sancricca</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Tanca</surname></persName>
		</author>
		<ptr target="https://ceur-ws.org/Vol-3478/paper68.pdf" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 31st Symposium of Advanced Database Systems</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<editor>
			<persName><forename type="first">S</forename><forename type="middle">M</forename></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><surname>Silvello</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><surname>Tanca</surname></persName>
		</editor>
		<meeting>the 31st Symposium of Advanced Database Systems<address><addrLine>Galzingano Terme, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2023">0001. July 2nd to 5th, 2023. 2023</date>
			<biblScope unit="volume">3478</biblScope>
			<biblScope unit="page" from="239" to="247" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Non-empirical problems in fair machine learning</title>
		<author>
			<persName><forename type="first">T</forename><surname>Scantamburlo</surname></persName>
		</author>
		<idno type="DOI">10.1007/s10676-021-09608-9</idno>
		<ptr target="https://doi.org/10.1007/s10676-021-09608-9.doi:10.1007/s10676-021-09608-9" />
	</analytic>
	<monogr>
		<title level="j">Ethics Inf. Technol</title>
		<imprint>
			<biblScope unit="volume">23</biblScope>
			<biblScope unit="page" from="703" to="712" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<title level="m" type="main">Fairness through awareness</title>
		<author>
			<persName><forename type="first">C</forename><surname>Dwork</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hardt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Pitassi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Reingold</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">S</forename><surname>Zemel</surname></persName>
		</author>
		<idno>CoRR abs/1104.3913</idno>
		<ptr target="http://arxiv.org/abs/1104.3913.arXiv:1104.3913" />
		<imprint>
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<title level="m" type="main">The case for process fairness in learning: Feature selection for fair decision making</title>
		<author>
			<persName><forename type="first">N</forename><surname>Grgic-Hlaca</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">B</forename><surname>Zafar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">P</forename><surname>Gummadi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Weller</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">K E</forename><surname>Bellamy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Dey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hind</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">C</forename><surname>Hoffman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Houde</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Kannan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Lohia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Martino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Mehta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Mojsilovic</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Nagar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">N</forename><surname>Ramamurthy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Richards</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Saha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Sattigeri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Singh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">R</forename><surname>Varshney</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhang</surname></persName>
		</author>
		<title level="m">Ai fairness 360: An extensible toolkit for detecting, understanding, and mitigating unwanted algorithmic bias</title>
				<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Bias mitigation with aif360: A comparative study</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">H</forename><surname>Aasheim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Hufthammer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ånneland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Brynjulfsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Slavkovik</surname></persName>
		</author>
		<ptr target="http://www.nik.no/" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the NIK-2020 Conference</title>
				<meeting>the NIK-2020 Conference</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Kusner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">R</forename><surname>Loftus</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Russell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Silva</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1703.06856</idno>
		<title level="m">Counterfactual fairness</title>
				<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<monogr>
		<title level="m" type="main">Equality of opportunity in supervised learning</title>
		<author>
			<persName><forename type="first">M</forename><surname>Hardt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Price</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Srebro</surname></persName>
		</author>
		<idno>CoRR abs/1610.02413</idno>
		<ptr target="http://arxiv.org/abs/1610.02413.arXiv:1610.02413" />
		<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Fair AI: Challenges and opportunities</title>
		<author>
			<persName><forename type="first">S</forename><surname>Feuerriegel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dolata</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Schwabe</surname></persName>
		</author>
		<idno type="DOI">10.1007/s12599-020-00650-3</idno>
	</analytic>
	<monogr>
		<title level="j">Business Information Systems Engineering</title>
		<imprint>
			<biblScope unit="volume">62</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Data pre-processing techniques for classification without discrimination</title>
		<author>
			<persName><forename type="first">F</forename><surname>Kamiran</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Calders</surname></persName>
		</author>
		<idno type="DOI">10.1007/s10115-011-0463-8</idno>
	</analytic>
	<monogr>
		<title level="j">Knowledge and Information Systems</title>
		<imprint>
			<biblScope unit="volume">33</biblScope>
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">-db: Functional dependencies to discover data bias and enhance data equity</title>
		<author>
			<persName><forename type="first">F</forename><surname>Azzalini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Criscuolo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Tanca</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E-Fair</forename></persName>
		</author>
		<idno type="DOI">10.1145/3552433</idno>
		<ptr target="https://doi.org/10.1145/3552433.doi:10.1145/3552433" />
	</analytic>
	<monogr>
		<title level="j">J. Data and Information Quality</title>
		<imprint>
			<biblScope unit="volume">14</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<monogr>
		<author>
			<persName><forename type="first">H</forename><surname>Weerts</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dudík</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Edgar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Jalali</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Lutz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Madaio</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2303.16626</idno>
		<title level="m">Fairlearn: Assessing and improving fairness of ai systems</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<monogr>
		<title level="m" type="main">Optimized data pre-processing for discrimination prevention</title>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">P</forename><surname>Calmon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Wei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">N</forename><surname>Ramamurthy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">R</forename><surname>Varshney</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1704.03354</idno>
		<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Feldman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Friedler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Moeller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Scheidegger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Venkatasubramanian</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1412.3756</idno>
		<title level="m">Certifying and removing disparate impact</title>
				<imprint>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">Learning fair representations</title>
		<author>
			<persName><forename type="first">R</forename><surname>Zemel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Swersky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Pitassi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Dwork</surname></persName>
		</author>
		<ptr target="https://proceedings.mlr.press/v28/zemel13.html" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 30th International Conference on Machine Learning</title>
				<editor>
			<persName><forename type="first">S</forename><surname>Dasgupta</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><surname>Mcallester</surname></persName>
		</editor>
		<meeting>the 30th International Conference on Machine Learning<address><addrLine>PMLR, Atlanta, Georgia, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2013">2013</date>
			<biblScope unit="volume">28</biblScope>
			<biblScope unit="page" from="325" to="333" />
		</imprint>
	</monogr>
	<note>Proceedings of Machine Learning Research</note>
</biblStruct>

<biblStruct xml:id="b32">
	<monogr>
		<title level="m" type="main">Fairness-aware classifier with prejudice remover regularizer</title>
		<author>
			<persName><forename type="first">T</forename><surname>Kamishima</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Akaho</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Asoh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Sakuma</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-642-33486-3_3</idno>
		<imprint>
			<date type="published" when="2012">2012</date>
			<biblScope unit="page" from="35" to="50" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b33">
	<analytic>
		<title level="a" type="main">Moving beyond algorithmic bias is a data problem</title>
		<author>
			<persName><forename type="first">S</forename><surname>Hooker</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Patterns</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b34">
	<monogr>
		<title level="m" type="main">Fairlabel: Correcting bias in labels</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">H</forename><surname>Sengamedu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Pham</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2311.00638</idno>
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b35">
	<monogr>
		<title level="m" type="main">Identifying and correcting label bias in machine learning</title>
		<author>
			<persName><forename type="first">H</forename><surname>Jiang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Nachum</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b36">
	<monogr>
		<title level="m" type="main">Pervasive label errors in test sets destabilize machine learning benchmarks</title>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">G</forename><surname>Northcutt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Athalye</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Mueller</surname></persName>
		</author>
		<ptr target="https://arxiv.org/pdf/2103.14749.pdf" />
		<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
	<note type="report_type">Preprint</note>
</biblStruct>

<biblStruct xml:id="b37">
	<analytic>
		<title level="a" type="main">Social data: Biases, methodological pitfalls, and ethical boundaries</title>
		<author>
			<persName><forename type="first">A</forename><surname>Olteanu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Castillo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Diaz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Kiciman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Frontiers in Big Data</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b38">
	<monogr>
		<title level="m" type="main">A survey on bias in visual datasets</title>
		<author>
			<persName><forename type="first">S</forename><surname>Fabbrizzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Papadopoulos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Ntoutsi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Kompatsiaris</surname></persName>
		</author>
		<idno>CoRR abs/2107.07919</idno>
		<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b39">
	<analytic>
		<title level="a" type="main">A framework for understanding sources of harm throughout the machine learning life cycle</title>
		<author>
			<persName><forename type="first">H</forename><surname>Suresh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Guttag</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Equity and Access in Algorithms, Mechanisms, and Optimization</title>
				<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b40">
	<monogr>
		<author>
			<persName><surname>Certnexus</surname></persName>
		</author>
		<ptr target="https://www.coursera.org/learn/promote-ethical-data-driven-technologies/lecture/5Ufbp/data-collection-bias" />
		<title level="m">Promote the ethical use of data-driven technologies</title>
				<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b41">
	<monogr>
		<title level="m" type="main">Centre for Evidence-Based, Catalogue of bias</title>
		<author>
			<persName><forename type="first">U</forename><forename type="middle">O O</forename></persName>
		</author>
		<ptr target="https://catalogofbias.org/biases/" />
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b42">
	<analytic>
		<title level="a" type="main">Conscientious classification: A data scientist&apos;s guide to discrimination-aware classification</title>
		<author>
			<persName><forename type="first">B</forename><surname>D'alessandro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>O'neil</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Lagatta</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Big Data</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page" from="120" to="134" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b43">
	<monogr>
		<author>
			<persName><forename type="first">P</forename><surname>Saleiro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Kuester</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Hinkson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>London</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Stevens</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Anisfeld</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">T</forename><surname>Rodolfa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Ghani</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1811.05577</idno>
		<title level="m">Aequitas: A bias and fairness audit toolkit</title>
				<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b44">
	<analytic>
		<title level="a" type="main">Confident learning: Estimating uncertainty in dataset labels</title>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">G</forename><surname>Northcutt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Jiang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">L</forename><surname>Chuang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Artificial Intelligence Research (JAIR)</title>
		<imprint>
			<biblScope unit="volume">70</biblScope>
			<biblScope unit="page" from="1373" to="1411" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b45">
	<analytic>
		<title level="a" type="main">Learning from noisy examples</title>
		<author>
			<persName><forename type="first">D</forename><surname>Angluin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">D</forename><surname>Laird</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Mach. Learn</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="343" to="370" />
			<date type="published" when="1987">1987</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b46">
	<monogr>
		<author>
			<persName><forename type="first">C</forename><surname>Batini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Scannapieco</surname></persName>
		</author>
		<title level="m">Data Quality: Concepts, Methodologies and Techniques</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b47">
	<analytic>
		<title level="a" type="main">Beyond accuracy: What data quality means to data consumers</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">Y</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Strong</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J. Manag. Inf. Syst</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<biblScope unit="page" from="5" to="33" />
			<date type="published" when="1996">1996</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b48">
	<analytic>
		<title level="a" type="main">Towards a contextual approach to data quality</title>
		<author>
			<persName><forename type="first">S</forename><surname>Canali</surname></persName>
		</author>
		<idno type="DOI">10.3390/data5040090</idno>
	</analytic>
	<monogr>
		<title level="j">Data</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page">90</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b49">
	<analytic>
		<title level="a" type="main">Methodologies for data quality assessment and improvement</title>
		<author>
			<persName><forename type="first">C</forename><surname>Batini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Cappiello</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Francalanci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Maurino</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM computing surveys (CSUR)</title>
		<imprint>
			<biblScope unit="volume">41</biblScope>
			<biblScope unit="page" from="1" to="52" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b50">
	<analytic>
		<title level="a" type="main">Data quality under a computer science perspective</title>
		<author>
			<persName><forename type="first">M</forename><surname>Scannapieco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Catarci</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of The ACM -JACM</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b51">
	<analytic>
		<title level="a" type="main">Data quality assessment</title>
		<author>
			<persName><forename type="first">L</forename><surname>Pipino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><forename type="middle">W</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">Y</forename><surname>Wang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Commun. ACM</title>
		<imprint>
			<biblScope unit="volume">45</biblScope>
			<biblScope unit="page" from="211" to="218" />
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b52">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Rula</surname></persName>
		</author>
		<title level="m">Time-related quality dimensions in linked data</title>
				<imprint>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b53">
	<analytic>
		<title level="a" type="main">Quality assessment for linked data: A survey</title>
		<author>
			<persName><forename type="first">A</forename><surname>Zaveri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Rula</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Maurino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Pietrobon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Lehmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Auer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Semantic Web</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page" from="63" to="93" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b54">
	<monogr>
		<title level="m" type="main">Patching gender: Non-binary utopias in hci</title>
		<author>
			<persName><forename type="first">K</forename><surname>Spiel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Keyes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Barlas</surname></persName>
		</author>
		<idno type="DOI">10.1145/3290607.3310425</idno>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="1" to="11" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b55">
	<monogr>
		<title level="m" type="main">Practical Fairness</title>
		<author>
			<persName><forename type="first">A</forename><surname>Nielsen</surname></persName>
		</author>
		<ptr target="http://gen.lib.rus.ec/book/index.php?md5=F9752B2F9693C98855A51504FE224DF6" />
		<imprint>
			<date type="published" when="2020">2020</date>
			<publisher>O&apos;Reilly Media, Inc</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b56">
	<analytic>
		<title level="a" type="main">From data quality to big data quality</title>
		<author>
			<persName><forename type="first">C</forename><surname>Batini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Rula</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Scannapieco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Viscusi</surname></persName>
		</author>
		<idno type="DOI">10.4018/JDM.2015010103</idno>
	</analytic>
	<monogr>
		<title level="j">Journal of Database Management</title>
		<imprint>
			<biblScope unit="volume">26</biblScope>
			<biblScope unit="page" from="60" to="82" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b57">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Black</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Van Nederpelt</surname></persName>
		</author>
		<title level="m">Dimensions of Data Quality (DDQ)</title>
				<imprint>
			<publisher>DAMA NL Foundation</publisher>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b58">
	<monogr>
		<title level="m" type="main">Machine bias</title>
		<author>
			<persName><forename type="first">J</forename><surname>Angwin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Larson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mattu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Kirchner</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2016">2016</date>
			<publisher>ProPublica</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b59">
	<analytic>
		<title level="a" type="main">An epistemic lens on algorithmic fairness</title>
		<author>
			<persName><forename type="first">E</forename><surname>Edenberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Wood</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Eaamo &apos;23: Proceedings of the 3Rd Acm Conference on Equity and Access in Algorithms, Mechanisms, and Optimization</title>
				<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="1" to="10" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b60">
	<analytic>
		<title level="a" type="main">Data for queer lives: How LGBTQ gender and sexuality identities challenge norms of demographics</title>
		<author>
			<persName><forename type="first">B</forename><surname>Ruberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ruelos</surname></persName>
		</author>
		<idno type="DOI">10.1177/2053951720933286</idno>
	</analytic>
	<monogr>
		<title level="j">Big Data and Society</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
