<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">AI-fairness and equality of opportunity: a case study on educational achievement</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Ángel</forename><forename type="middle">S</forename><surname>Marrero</surname></persName>
							<email>amarrerl@ull.edu.es</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Economics and Research Center of Social Inequality and Governance</orgName>
								<orgName type="institution">University of La Laguna</orgName>
								<address>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Gustavo</forename><forename type="middle">A</forename><surname>Marrero</surname></persName>
							<email>gmarrero@ull.edu.es</email>
						</author>
						<author>
							<persName><forename type="first">Carlos</forename><surname>Bethencourt</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Economics and Research Center of Social Inequality and Governance</orgName>
								<orgName type="institution">University of La Laguna</orgName>
								<address>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Liam</forename><surname>James</surname></persName>
							<email>liam.james2@unibo.it</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Economics and Research Center of Social Inequality and Governance</orgName>
								<orgName type="institution">University of La Laguna</orgName>
								<address>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="department">Department of Computer Science and Engineering Alma Mater Studiorum</orgName>
								<orgName type="institution">Univerisità di Bologna</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Roberta</forename><surname>Calegari</surname></persName>
							<email>roberta.calegari@unibo.it</email>
							<affiliation key="aff1">
								<orgName type="department">Department of Computer Science and Engineering Alma Mater Studiorum</orgName>
								<orgName type="institution">Univerisità di Bologna</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff2">
								<address>
									<settlement>Santiago de Compostela</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">AI-fairness and equality of opportunity: a case study on educational achievement</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">1A2F148792CC5456C9105AEA32756A0F</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T16:37+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>AI-fairness, socioeconomic equality of opportunity, AI-ethics Marrero)</term>
					<term>0000-0003-4030-0078 (G. A. Marrero)</term>
					<term>0000-0002-5605-9576 (C. Bethencourt)</term>
					<term>0009-0001-7809-7514 (L. James)</term>
					<term>0000-0003-3794-2942 (R. Calegari)</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This study focuses on predicting students' academic performance, examining how AI predictive models often reflect socioeconomic inequalities influenced by factors such as parental socioeconomic status and home environment, which affect the fairness of predictions. We compare three AI models aimed at performing an ablation study to understand how these sensitive features (referred to as circumstances) influence predictions. Our findings reveal biases in predictions that favor advantaged groups, depending on whether the goal is to identify excellence or underperformance. Additionally, a two-stage estimation procedure is proposed in the third model to mitigate the impact of sensitive features on predictions, thereby offering a model that can be considered fair with respect to inequality of opportunity.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Academic performance in primary school is a good predictor of an individual's future income and well-being <ref type="bibr" target="#b0">[1]</ref>. Anticipating low academic performance levels is relevant to implementing corrective policies at early ages, and anticipating high academic performance is also relevant to applying incentive mechanisms to achieve excellence <ref type="bibr" target="#b1">[2]</ref>. To achieve this, it is necessary to have good predictive models, in terms of accuracy. However, it is equally important for these models to generate fair predictions <ref type="bibr" target="#b2">[3]</ref>. This means they should provide consistent predictions across different groups, ensuring no disparity in awarding excellence prizes to students regardless of their social background or their parents' educational levels. It is therefore essential to have models that ensure fairness in their predictions.</p><p>In this study, which aims to predict 6th grade educational performance using data relative to the 3rd grade performances, we seek to link the outcomes of predictive models and their degree of bias (as traditionally measured within the AI fairness community) to the socio-economic concept of inequality of opportunity <ref type="bibr" target="#b3">[4]</ref> reflected in the data and real life. Inequality of opportunity in educational achievement is measured as the inequality that is explained by factors that are beyond the control of the individual, such as the socioeconomic status of the parents, the cultural environment at home, the immigrant status, the state of health at birth, the neighborhood of birth, etc. (in the terminology of the AI fairness literature these are the sensitive variables). In particular, we analyze how the existence of certain sensitive variables (referred to as circumstances in economics) that explain a relevant percentage of the inequality in educational achievement are the cause of generating unfair predictions, as long as they influence the predictive models directly or indirectly (i.e., through other predictors). Thus, for a model to generate fair predictions, the predictors included must be free of the influence of variables that generate inequality of opportunities in society.</p><p>To this end, in this paper, we provide a comparison of three AI models that allow us to conduct an ablation study to understand how circumstances can influence predictions in predictive models and how this type of analysis leads to strategies for detecting unfairness in predictions. A two stage estimation procedure is proposed. In the first stage, the predictors are cleaned of the influence of circumstances.</p><p>In the second stage, the target variable is predicted (or adjusted, depending on the objective) using a model that includes predictors not dependent on the sensitive variables. We also observe that a model incorporating sensitive variables that explain the existing inequality of opportunities generates biases in the predictions, favoring one class or another depending on the part of the distribution being predicted. Thus, if the aim is to predict the upper tail of the distribution (to detect excellence), the worst predictions occur for those classes most disadvantaged by the most relevant circumstances (i.e., students whose parents have lower levels of education or who live in homes with a poorer cultural environment). The opposite occurs when the objective is to predict the lower tail of the distribution (i.e., students with worse educational performance). The opposite occurs when the objective is to predict the lower tail of the distribution (i.e., students with worse educational performance). Accordingly, the paper is organized as follows. Section 2 describes the dataset. Section 3 reports the results and discusses them. Finally, Section 4 concludes this preliminary work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Data and Method</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Database</head><p>In this paper, we use the data on primary and secondary education provided by the Canary Agency for University Quality and Educational Evaluation (ACCUEE) from academic years 2015/16 to 2018/19. ACCUEE is a public organization founded with the purpose of continuously improving education at both the university level and other educational stages within the Canary Islands. Among its activities, ACCUEE is responsible for the evaluation and accreditation of programs implemented in various educational centers. It collects data on students' academic performance and relevant variables that determine their environment, considering census characteristics and demographic context, offering a more reliable approach than other national or even international statistics.</p><p>One of the main advantages of this database is that the surveys conducted by ACCUEE on the student population are longitudinal in nature, allowing for the evaluation of the progress and development of the individuals under study. This longitudinal data collection helps to control for factors intrinsic to temporal circumstances while enhancing data quality and enabling estimates and comparisons of the individual over time.</p><p>The database provided by ACCUEE comprises 83,857 observations. Each row refers to a single student at a given grade and academic year. Primary education data for the academic year 2015/16 and 2018/19 is gathered through a comprehensive census of the entire population. For other grades and academic years, the data is collected through sampling. Longitudinal data is also included: students in 3rd grade (primary school) during the 2015/2016 academic year are sampled again in their 6th grade, and this is the information that we use in the application of this paper. The database contains information on 561 variables (columns) for each student, representing data collected from various contextual questionnaires and performance tests across different subjects. Specifically, these columns are categorized into seven thematic blocks:</p><p>• Block 1: ID Variables. This block consists of variables that identify the individual through different approaches (organizational, educational center, academic year studied, survey ID etc. ). • Block 2: Informative Variables. These include codes that identify whether the surveyed individuals responded to the different blocks of questions. • Block 3: Grades Obtained. This block comprises the grades obtained by students in the reference subjects, using a continuous or categorical classification. • Block 4: Student Questionnaire. This block includes questions aimed at understanding the level of agreement (categorical) or the situation (coding/continuous) of the surveyed student. • Block 5: Principal Questionnaire. This block includes questions aimed at understanding the level of agreement (categorical) or the situation (coding/continuous) of the principal of the student's school. Coherence is assumed for the same educational centers. • Block 6: Family Questionnaire. This block includes questions aimed at understanding the level of agreement (categorical) or the situation (coding/continuous) of the family of the surveyed student. Coherence is assumed for the same family units. • Block 7: Teacher Questionnaire. This block includes questions aimed at understanding the level of agreement (categorical) or the situation (coding/continuous) of the tutor of the surveyed student.</p><p>Coherence is assumed for the same tutors.</p><p>The dataset consists of 47,043,777 data points (resulting from the multiplication of rows and columns). However, 21,225,226 of these are classified as missing values, constituting 45.12the database. The fact that this database is purely Canary in nature reduces statistical issues such as sample representativeness or selection bias. However, there are still potential biases in the database. The most relevant ones are:</p><p>• Family Questionnaire Bias. In each grade and academic year, there is a high percentage of students (around 20 -40%) without information on their family situation (Block 6). There may be unexamined correlations between not responding to the family questionnaire and the target variable (academic performance). • Missing Values Bias. As mentioned, there is a large amount of missing data, which could indicate correlations between the student's situation (unobservable) and the target variable. • Sample Bias. Since data collection takes place in the Canary Islands, transferring the obtained results to other regions might not account for characteristics unique to this specific territory.</p><p>We have a target variable 𝑌 𝑡 , which is the performance of students in the 6th grade of primary education. 𝑌 𝑡 − 1, the achievement observed for students (in the same school) in previous years (in 3rd grade in our application), is the most widely used predictor in the related literature. We have a set of sensitive factors (or circumstances), denoted by 𝐶 1 , . . . , 𝐶 𝑛 (see Table <ref type="table" target="#tab_0">1</ref>). These factors are all beyond a student's control, hence any inequality resulting from these factors must be considered as unfair and therefore corrected or compensated. Specifically, the considered circumstances are shown in Table <ref type="table" target="#tab_0">1</ref>.</p><p>Our protected and unprotected groups are defined according to the circumstances considered. Each circumstance is split into two categories, which form the groups. We consider the left category as the unprotected group, while the right category forms the protected groups. These groups can be seen in Table <ref type="table" target="#tab_1">2</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Inequality of Opportunity</head><p>To estimate the inequality of opportunity in educational achievement (IOEA) in 3rd grade, we follow the ex-ante approach proposed by <ref type="bibr" target="#b4">[5]</ref> and recently used by <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b6">7]</ref>, and estimate the following reduced form equation:</p><formula xml:id="formula_0">𝑌 𝑖,𝑡−1 = 𝛼 + 𝐾 ∑︁ 𝑘=1 𝛽 𝑘 𝐶 𝑖,𝑘,𝑡−1 + 𝑣 𝑖,𝑡−1<label>(1)</label></formula><p>From a practical point of view, we want to measure the IOEA at the time policy makers would take policy actions; that is, at the end of the 3rd grade, in order to correct potential unfairness in the 6th grade.</p><p>We estimate equation 1 by ordinary least square (OLS), recover the fitted part, and our measure of IOEA would be the ratio of the variance of 𝑌 𝑡−1 explained by the set of circumstances (i.e., the variance of the fitted part) with respect to the total variance. In a linear model, this ratio is exactly the 𝑅 2 of the estimates. <ref type="bibr" target="#b4">[5]</ref> explains that, when using standardized achievement measures such as those used in this paper, we must disregard using standard inequality indices such as the Gini or the MLD, and a convenient one is using the variance.</p><p>Next, we want to estimate the relative importance of each circumstance on educational achievement, and we use a multivariate regression-based decomposition approach <ref type="bibr" target="#b7">[8,</ref><ref type="bibr" target="#b8">9]</ref>, which adapts the decomposition of <ref type="bibr" target="#b9">[10]</ref> to determine the contribution of each factor (or group of factors) to explaining educational achievement. The relative factor IOEA weight for any circumstance 𝐶 𝑘 is given by:</p><formula xml:id="formula_1">𝑆 𝐶 𝑘 = 𝑐𝑜𝑣[𝛽 𝑘 ˆ𝐶𝑘 , 𝑌 ˆ] 𝜎 ˆ2 𝑌 ^= 𝛽 ˆ𝑘 𝜎 ˆ𝐶𝑘 𝜎 ˆ𝑌 ^𝑐𝑜𝑟[𝑌, 𝑌 ˆ]<label>(2)</label></formula><p>where 𝛽 ˆ𝑘 is the estimated OLS coefficient from 1 associated with circumstance 𝐶 𝑘 , and 𝜎 ˆ𝑌 ^is the variance of the achievement in the 3rd grade (i.e. the fitted target variable in 1).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Predictive Models</head><p>In this section, all the models used in the use case will be defined. The models employed will be based on regression models. We want to start with a simple linear framework to better understand the relevance of the different variables (features) included in the model. This analysis can be easily extended to more sophisticated predictive models, such as conditional inference trees, random forests, or neural network approaches. Our baseline (linear) predicted model is the following (Model 1):</p><formula xml:id="formula_2">𝑌 𝑖,𝑡 = 𝛼 + 𝛽 1 𝑌 𝑖,𝑡−1 + 𝜖 𝑖,𝑡<label>(3)</label></formula><p>where 𝑌 𝑖,𝑡−1 is the academic performance in mathematics at time 𝑡 (6th grade of primary education), 𝑌 𝑖,𝑡−1 is the academic performance at time 𝑡 − 1 (3rd grade of primary education), 𝛽 1 is a parameter to be estimated and 𝜖 𝑖,𝑡 is an error term.</p><p>In order to understand the origin of potential AI unfairness, we also use the following variants. Model 2 extend model 1 including the set of circumstances (protected features) measured at 𝑡 − 1: in addition to including the student's academic performance at 𝑡 − 1, we also include a set 𝑘 of circumstances.</p><formula xml:id="formula_3">𝑌 𝑖,𝑡 = 𝛼 + 𝛽 1 𝑌 𝑖,𝑡−1 + 𝑘 ∑︁ 𝑗=1 𝛽 2𝑗 𝐶 𝑖,𝑗,𝑡−1 + 𝜖 𝑖,𝑡<label>(4)</label></formula><p>Notice that the interpretation of 𝛽 1 in 4 is different than its interpretation in model 1, since 𝑌 𝑖,𝑡−1 correlates with 𝐶 𝑗,𝑡 . Now, the 𝛽 1 is capturing the impact of the educational achievement in 3rd and in 6th, but taking into consideration the potential (unfair) differences generated by our set of circumstances in 3rd grade and extended to 6th and probably to the future. Estimated 𝛽 2𝑗 represent the impact of circumstances in 3rd affecting the achievement in 6th not being channeled through its achievement in 3rd. If the entire impact of circumstances is channeled through its effect on the achievement in 3rd, the estimated 𝛽 2𝑗 coefficients should be close to zero.</p><p>Model 3 decomposes the predictor 𝑌 𝑖,𝑡−1 in the part explained by circumstances and the part nonexplained by circumstances (a residual term). In the IO literature, see <ref type="bibr" target="#b10">[11]</ref> this residual term is associated with the part of achievement not associated with (observed) circumstances and instead associated with effort-related aspects and non-observed circumstances. They might correlate differently with the target variable. For instance, <ref type="bibr" target="#b10">[11]</ref> and <ref type="bibr" target="#b11">[12]</ref>, show that the circumstance part is negatively related to posterior economic growth, while the effort component is positively correlated.</p><p>To estimate Model 3, we start from estimates of model <ref type="bibr" target="#b0">(1)</ref>. Then, we decompose 𝑌 𝑖,𝑡 into its fitted part (the part explained by circumstances and associated with the IO estimated in (1), 𝑌 ˆ𝑖,𝑡−1 and the residual part, 𝑣 ˆ𝑖,𝑡−1 , which captures other factors not included in the model uncorrelated with the considered circumstances. To simplify notation, we call this residual term "Effort". We then estimate Model 3 as follows:</p><p>𝑌 𝑖,𝑡 = 𝛼 + 𝛽 4 𝑌 ˆ𝑖,𝑡−1 + 𝛽 5 𝑣 ˆ𝑖,𝑡−1 + 𝜖 𝑖,𝑡</p><p>By doing that, we want to distinguish predictions of 𝑌 𝑖,𝑡 due exclusively to circumstances (𝛼 + 𝛽 4 𝑌 ˆ𝑖,𝑡−1 ) and due exclusively to effort (𝛼 + 𝛽 4 𝑌 ˆ𝑖,𝑡−1 + 𝛽 5 𝑣 ˆ𝑖,𝑡−1 ), where 𝑌 ˆ𝑖,𝑡−1 represents the average value of the predictions 𝑌 ˆ𝑖,𝑡−1 to have the same average levels in both predictions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4.">AI fairness metrics</head><p>We evaluate the models in terms of fairness using the equalized odds metric. This metric is satisfied when the model's predictions ensure that students from both protected and unprotected groups (e.g., females and males) have equal recall. Recall is defined as the ratio of True Positives (TP) to the sum of True Positives (TP) and False Negatives (FN).</p><p>For each model, we estimate the student's academic performance prediction (𝑌 𝑡 ˆ). Since this prediction is continuous, we discretize it into quartiles to create our predicted classes. Likewise, we categorize the actual academic performance variable into quartiles to establish our True Class.</p><p>Next, to evaluate the models' predictive fairness for each group, we construct confusion matrices for different quartiles of academic performance. For instance, the confusion matrix for the first quartile of academic performance (below 25th percentile) is shown in Table <ref type="table" target="#tab_2">3</ref>:</p><p>We specifically calculate the equalized odds metric for the low tail of the academic performance distribution (Q1, below the 25th percentile), the center of the distribution (between 25th and 75th percentiles), and the high tail of the distribution (above 75th percentile). For each sensitive feature, we calculate the equalized odds metric, whose values can be associated with fair or unfair model predictions. Fair: Odds are close to 1, thus the model is predicting equally well all groups within each circumstance. Unfair: Odds are far from 1; we might have Odds lower than 1, which means that the predictions will benefit protected groups (lower categories); Odds above 1, which means that they are benefiting unprotected groups (upper categories).</p><p>We will show that the degree of AI unfairness depends on the model used (the type of variables included in the model) and the part of the distributions we look at (upper, middle, or lower).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Results</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Inequality of opportunity estimates</head><p>We estimate equation 1 through OLS. Table <ref type="table">4</ref> shows the relative factor shares: father education, the number of books at home (as proxy of cultural environment), followed by mother's education, general socioeconomic status of the household and the start schooling age, are the most relevant circumstances explaining achievement variability in 3rd grade.</p><p>In the AI terminology, a circumstance must always be considered as a sensitive feature, since, in a fair society, they should not be correlated with its achievement. How does the relevance of each circumstance correlate with AI-(un)fairness measure associated with each sensitive factor?</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Discussion</head><p>We show below the equalized-odds measure for each circumstance (sensitive feature) obtained for each of the three models used (and four predictions generated): model 1, which includes only achievement in 3rd grade; model 2, which assumes model 1 extended with circumstances; model 3, in which we show the predictions of the part only using circumstances (which would be the inequality of opportunity model) and the part using the effort component (which would be the effort model). We do this for each circumstance and for the prediction of three parts of the distribution.</p><p>• Looking at performance below the 25th percentile, it could be relevant if the policy is aimed at giving reinforcement in 3rd grade to the most disadvantaged students to reduce educational failure in 6th grade. • Looking at performance in the upper tail (with performance above the 75th percentile), which could be relevant if the objective is to reward excellence with a scholarship program, for example. • Looking at intermediate performance, to detect a group of students representative of a class or school to select case and control groups for an education policy experiment.</p><p>The results are shown in Figures <ref type="figure" target="#fig_3">1-3</ref> for the different predicted percentiles. The detailed results for each sensitive variable (circumstance) are presented in Tables <ref type="table" target="#tab_5">5-7</ref>.</p><p>All detailed results can be found in the appendix. We summarize the main results below, which are also recapped in Table <ref type="table" target="#tab_3">5</ref>.</p><p>The model using the predictions of our effort proxy is the only one that achieves AI fairness regardless of which part of the distribution we want to predict. For all sensitive variables, the odd-ratio is very close to one, indicating that the model (whether good or bad) predicts almost equally well one group as another.</p><p>On the other hand, when we consider models (the rest of the predictions) that include circumstances in the model, the predictions come out differently depending on the group we consider and the part of the performance distribution we are predicting.</p><p>Thus, for example, when we seek to predict the upper part of the distribution (i.e., to predict whether the student will be excellent in 6th grade), the baseline model and the rest of the models that include information from the sensitive variables predict worse for the most disadvantaged category (i.e., lower socioeconomic status, with lower educational level of the parents, with fewer books in the home, etc.) than for the favored categories. Using these predictions for decision making (e.g., to award prizes for excellence) would generate a clear injustice in favor of those who have a more advantageous starting point, with the consequent increase (most likely) of unequal opportunities in the future. The model generates more unfair predictions the greater the weight of circumstances in the model (in our case the model that only uses circumstances as a predictor).</p><p>Moreover, the greatest differences in the predictions are seen among the circumstances that turn out to be most relevant in explaining the inequality of opportunities in the initial year. Thus, the model that generates the greatest injustice is when we compare the predictions between students of high and low educated parents, or when we compare students in homes with high and low cultural environments.</p><p>On the contrary, if the aim is to predict the lower tail of the distribution (for example, to give classes to reinforce learning), the prediction model that includes circumstances generates injustices (in the sense that it predicts better for one class than for another) but, in this case, the favored group (that generates better predictions) is the most disadvantaged. That is, it predicts school failure better for children from lower classes or worse circumstances than for children from better circumstances. Applying the predictions of this model would probably help to reduce inequality of opportunity in the future, but applying a bad policy: it improves inequality of opportunity by improving the most disadvantaged, and worsening the most advantaged.</p><p>Ideally, across the entire distribution, predictive models should be fair in the sense that they do not discriminate one group of different circumstances against another, and that this will eventually equalize the outcome of individuals.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Conclusion</head><p>In this study, we have explored the complexities of predicting students' academic performance using AI models, with a particular focus on addressing socioeconomic inequalities that influence predictive outcomes. Our analysis revealed significant biases in predictions favoring advantaged groups, particularly   Through a comparative evaluation of three AI models and an ablation study, we demonstrated how these sensitive features, also known as circumstances, can distort predictions, leading to unfair outcomes in educational assessments. Importantly, we proposed a two-stage estimation procedure in our third model to mitigate these biases.</p><p>The findings underscore the critical importance of integrating fairness considerations into predictive modeling practices, particularly in educational settings where equitable outcomes are essential. Future research should further refine and validate these methodologies across diverse datasets and educational contexts to foster more inclusive and equitable predictive models. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Complete Results</head><p>In the tables shown in the appendix, numbers or letters are used in brackets to distinguish results achieved by the different models presented in this work:  </p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Predictions below the 25th percentile. Odds of protected groups</figDesc><graphic coords="8,163.68,282.10,265.42,226.77" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 2 :Figure 3 :</head><label>23</label><figDesc>Figure 2: Predictions between the 25th and 75th percentiles. Odds of protected groups</figDesc><graphic coords="9,136.03,113.72,320.72,223.31" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>• 1 :</head><label>1</label><figDesc>refers to model 1 (Equation 3) • 2: refers to model 2 (Equation 4) • C: refers to the circumstance-based model • E: refers to the effort-based model (for this and the previous bullet point see the discussion regarding Model 3 in Section 2.3)</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Set of circumstances (or sensitive features)</figDesc><table><row><cell>Circumstance</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2</head><label>2</label><figDesc>Protected and unprotected groups</figDesc><table><row><cell>Circumstance</cell><cell>Group</cell></row><row><cell>Gender</cell><cell>Male, Female</cell></row><row><cell>Mother Education</cell><cell>Tertiary education, Rest of educational levels</cell></row><row><cell>Father education</cell><cell>Tertiary education, Rest of educational levels</cell></row><row><cell>Mother occupation</cell><cell>High occupation, Rest of occupational levels</cell></row><row><cell>Father occupation</cell><cell>High occupation, Rest of occupational levels</cell></row><row><cell>Immigrant status)</cell><cell>No, Yes</cell></row><row><cell>Start schooling age</cell><cell>Less than 3 years old, More than 3 years old</cell></row><row><cell>Number of books in household</cell><cell>More than 1000 books, Less than 1000 books</cell></row><row><cell>Public/private school</cell><cell>Public, Private</cell></row><row><cell>Capital/non capital island</cell><cell>Yes (Capital), No</cell></row><row><cell>See adults reading books</cell><cell>Daily, Not daily</cell></row><row><cell>Teachers who change school each year</cell><cell>Almost no change, Change</cell></row><row><cell>Socioeconomic status</cell><cell>Above the median, Below the median</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3</head><label>3</label><figDesc>Protected and unprotected groups</figDesc><table><row><cell>Circumstance</cell><cell>Group</cell></row><row><cell cols="2">Positive (Below 25) Negative (Above 25)</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 5</head><label>5</label><figDesc>Summary of results</figDesc><table><row><cell></cell><cell>Below 25th percentile</cell><cell>Between 25th and 75th percentiles</cell><cell>Above 75th percentile</cell></row><row><cell></cell><cell>Tail of the</cell><cell>Center of the</cell><cell>High tail of the</cell></row><row><cell></cell><cell>distribution</cell><cell>distribution</cell><cell>distribution</cell></row><row><cell></cell><cell>Unfairly favours</cell><cell></cell><cell>Unfairly favours</cell></row><row><cell>Model 1 (baseline)</cell><cell>protected groups</cell><cell>?</cell><cell>unprotected groups</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Table 6</head><label>6</label><figDesc>Prediction below 25th percentile</figDesc><table><row><cell></cell><cell></cell><cell></cell><cell>Eq.</cell><cell></cell><cell>Eq.</cell><cell></cell><cell>Eq.</cell><cell></cell><cell>Eq.</cell></row><row><cell>Circumstance</cell><cell>Category</cell><cell>Recall (1)</cell><cell>Odds (1)</cell><cell>Recall (2)</cell><cell>Odds (2)</cell><cell>Recall (C)</cell><cell>Odds (C)</cell><cell>Recall (E)</cell><cell>Odds (E)</cell></row><row><cell>Males</cell><cell></cell><cell>0.50</cell><cell>1.00</cell><cell>0.49</cell><cell>1.00</cell><cell>0.31</cell><cell>1.00</cell><cell>0.49</cell><cell>1.00</cell></row><row><cell>Gender</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>Females</cell><cell></cell><cell>0.47</cell><cell>0.94</cell><cell>0.50</cell><cell>1.02</cell><cell>0.40</cell><cell>1.31</cell><cell>0.43</cell><cell>0.89</cell></row><row><cell cols="2">Tertiary education</cell><cell>0.40</cell><cell>1.00</cell><cell>0.32</cell><cell>1.00</cell><cell>0.03</cell><cell>1.00</cell><cell>0.51</cell><cell>1.00</cell></row><row><cell>Mother education</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">Rest of educational levels</cell><cell>0.53</cell><cell>1.33</cell><cell>0.59</cell><cell>1.84</cell><cell>0.53</cell><cell>18.44</cell><cell>0.43</cell><cell>0.8</cell></row><row><cell cols="2">Tertiary education</cell><cell>0.39</cell><cell>1.00</cell><cell>0.24</cell><cell>1.00</cell><cell>0.02</cell><cell>1.00</cell><cell>0.55</cell><cell>1.00</cell></row><row><cell>Father education</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">Rest of educational levels</cell><cell>0.51</cell><cell>1.32</cell><cell>0.57</cell><cell>2.40</cell><cell>0.46</cell><cell>19.63</cell><cell>0.43</cell><cell>0.78</cell></row><row><cell cols="2">High occupation</cell><cell>0.43</cell><cell>1.00</cell><cell>0.45</cell><cell>1.00</cell><cell>0.22</cell><cell>1.00</cell><cell>0.48</cell><cell>1.00</cell></row><row><cell>Mother Occupation</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">Rest of occupational levels</cell><cell>0.50</cell><cell>1.15</cell><cell>0.51</cell><cell>1.14</cell><cell>0.39</cell><cell>1.77</cell><cell>0.45</cell><cell>0.95</cell></row><row><cell cols="2">High occupation</cell><cell>0.44</cell><cell>1.00</cell><cell>0.43</cell><cell>1.00</cell><cell>0.25</cell><cell>1.00</cell><cell>0.46</cell><cell>1.0</cell></row><row><cell>Father Occupation</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">Rest of occupational levels</cell><cell>0.50</cell><cell>1.13</cell><cell>0.52</cell><cell>1.19</cell><cell>0.39</cell><cell>1.58</cell><cell>0.46</cell><cell>0.99</cell></row><row><cell cols="2">More than 100</cell><cell>0.38</cell><cell>1.00</cell><cell>0.29</cell><cell>1.00</cell><cell>0.02</cell><cell>1.00</cell><cell>0.54</cell><cell>1.0</cell></row><row><cell>Books in household</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">Less than 100</cell><cell>0.51</cell><cell>1.32</cell><cell>0.53</cell><cell>1.83</cell><cell>0.42</cell><cell>17.60</cell><cell>0.44</cell><cell>0.8</cell></row><row><cell cols="2">Almost no change</cell><cell>0.43</cell><cell>1.00</cell><cell>0.43</cell><cell>1.00</cell><cell>0.16</cell><cell>1.00</cell><cell>0.50</cell><cell>1.0</cell></row><row><cell>Teachers changing school each year</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>Change</cell><cell></cell><cell>0.51</cell><cell>1.18</cell><cell>0.52</cell><cell>1.21</cell><cell>0.44</cell><cell>2.67</cell><cell>0.44</cell><cell>0.9</cell></row><row><cell>No</cell><cell></cell><cell>0.48</cell><cell>1.00</cell><cell>0.50</cell><cell>1.00</cell><cell>0.35</cell><cell>1.00</cell><cell>0.46</cell><cell>1.09</cell></row><row><cell>Immigration status</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>Yes</cell><cell></cell><cell>0.52</cell><cell>1.09</cell><cell>0.46</cell><cell>0.93</cell><cell>0.38</cell><cell>1.08</cell><cell>0.50</cell><cell>1.0</cell></row><row><cell>Private</cell><cell></cell><cell>0.40</cell><cell>1.00</cell><cell>0.35</cell><cell>1.00</cell><cell>0.09</cell><cell>1.00</cell><cell>0.50</cell><cell>1.00</cell></row><row><cell>Public/private school</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>Public</cell><cell></cell><cell>0.51</cell><cell>1.26</cell><cell>0.53</cell><cell>1.51</cell><cell>0.43</cell><cell>4.94</cell><cell>0.45</cell><cell>0.9</cell></row><row><cell>Yes</cell><cell></cell><cell>0.47</cell><cell>1.00</cell><cell>0.49</cell><cell>1.00</cell><cell>0.32</cell><cell>1.00</cell><cell>0.45</cell><cell>1.00</cell></row><row><cell>Capital island</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>No</cell><cell></cell><cell>0.57</cell><cell>1.20</cell><cell>0.55</cell><cell>1.12</cell><cell>0.58</cell><cell>1.81</cell><cell>0.49</cell><cell>1.08</cell></row><row><cell cols="2">Less than 3 years old</cell><cell>0.47</cell><cell>1.00</cell><cell>0.44</cell><cell>1.00</cell><cell>0.26</cell><cell>1.00</cell><cell>0.48</cell><cell>1.00</cell></row><row><cell>Start schooling age</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">More than 3 years old</cell><cell>0.50</cell><cell>1.06</cell><cell>0.57</cell><cell>1.29</cell><cell>0.51</cell><cell>2.00</cell><cell>0.43</cell><cell>0.88</cell></row><row><cell>Daily</cell><cell></cell><cell>0.48</cell><cell>1.00</cell><cell>0.47</cell><cell>1.00</cell><cell>0.29</cell><cell>1.00</cell><cell>0.48</cell><cell>1.00</cell></row><row><cell>See adults reading books</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">More rarely (not daily)</cell><cell>0.49</cell><cell>1.02</cell><cell>0.52</cell><cell>1.11</cell><cell>0.40</cell><cell>1.40</cell><cell>0.45</cell><cell>0.94</cell></row><row><cell cols="2">Above the median</cell><cell>0.41</cell><cell>1.00</cell><cell>0.34</cell><cell>1.00</cell><cell>0.07</cell><cell>1.00</cell><cell>0.51</cell><cell>1.00</cell></row><row><cell>Socioeconomic status</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">Below the median</cell><cell>0.54</cell><cell>1.32</cell><cell>0.60</cell><cell>1.79</cell><cell>0.55</cell><cell>8.44</cell><cell>0.43</cell><cell>0.84</cell></row><row><cell>Average</cell><cell></cell><cell></cell><cell>1.18</cell><cell></cell><cell>1.41</cell><cell></cell><cell>6.36</cell><cell></cell><cell>0.92</cell></row><row><cell>Weighted Average</cell><cell></cell><cell></cell><cell>1.29</cell><cell></cell><cell>1.89</cell><cell></cell><cell>14.68</cell><cell></cell><cell>0.83</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_5"><head>Table 7</head><label>7</label><figDesc>Predictions between the 25th and 75th percentiles</figDesc><table><row><cell></cell><cell></cell><cell></cell><cell>Eq.</cell><cell></cell><cell>Eq.</cell><cell></cell><cell>Eq.</cell><cell></cell><cell>Eq.</cell></row><row><cell>Circumstance</cell><cell>Category</cell><cell>Recall (1)</cell><cell>Odds (1)</cell><cell>Recall (2)</cell><cell>Odds (2)</cell><cell>Recall (C)</cell><cell>Odds (C)</cell><cell>Recall (E)</cell><cell>Odds (E)</cell></row><row><cell>Males</cell><cell></cell><cell>0.56</cell><cell>1.00</cell><cell>0.58</cell><cell>1.00</cell><cell>0.51</cell><cell>1.00</cell><cell>0.55</cell><cell>1.00</cell></row><row><cell>Gender</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>Females</cell><cell></cell><cell>0.54</cell><cell>0.96</cell><cell>0.55</cell><cell>0.95</cell><cell>0.51</cell><cell>1.00</cell><cell>0.52</cell><cell>0.95</cell></row><row><cell cols="2">Tertiary education</cell><cell>0.58</cell><cell>1.00</cell><cell>0.62</cell><cell>1.00</cell><cell>0.53</cell><cell>1.00</cell><cell>0.52</cell><cell>1.00</cell></row><row><cell>Mother education</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">Rest of educational levels</cell><cell>0.53</cell><cell>0.92</cell><cell>0.52</cell><cell>0.83</cell><cell>0.49</cell><cell>0.92</cell><cell>0.55</cell><cell>1.06</cell></row><row><cell cols="2">Tertiary education</cell><cell>0.56</cell><cell>1.00</cell><cell>0.61</cell><cell>1.00</cell><cell>0.41</cell><cell>1.00</cell><cell>0.50</cell><cell>1.00</cell></row><row><cell>Father education</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">Rest of educational levels</cell><cell>0.55</cell><cell>0.97</cell><cell>0.54</cell><cell>0.89</cell><cell>0.55</cell><cell>1.32</cell><cell>0.55</cell><cell>1.12</cell></row><row><cell cols="2">High occupation</cell><cell>0.56</cell><cell>1.00</cell><cell>0.56</cell><cell>1.00</cell><cell>0.42</cell><cell>1.00</cell><cell>0.53</cell><cell>1.00</cell></row><row><cell>Mother Occupation</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">Rest of occupational levels</cell><cell>0.55</cell><cell>0.99</cell><cell>0.56</cell><cell>1.01</cell><cell>0.53</cell><cell>1.27</cell><cell>0.54</cell><cell>1.02</cell></row><row><cell cols="2">High occupation</cell><cell>0.56</cell><cell>1.00</cell><cell>0.60</cell><cell>1.00</cell><cell>0.44</cell><cell>1.00</cell><cell>0.53</cell><cell>1.00</cell></row><row><cell>Father Occupation</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">Rest of occupational levels</cell><cell>0.55</cell><cell>0.99</cell><cell>0.55</cell><cell>0.92</cell><cell>0.53</cell><cell>1.21</cell><cell>0.54</cell><cell>1.02</cell></row><row><cell cols="2">More than 100</cell><cell>0.59</cell><cell>1.00</cell><cell>0.62</cell><cell>1.00</cell><cell>0.41</cell><cell>1.00</cell><cell>0.53</cell><cell>1.00</cell></row><row><cell>Books in household</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">Less than 100</cell><cell>0.54</cell><cell>0.92</cell><cell>0.55</cell><cell>0.88</cell><cell>0.53</cell><cell>1.32</cell><cell>0.54</cell><cell>1.02</cell></row><row><cell cols="2">Almost no change</cell><cell>0.58</cell><cell>1.00</cell><cell>0.60</cell><cell>1.00</cell><cell>0.50</cell><cell>1.00</cell><cell>0.53</cell><cell>1.00</cell></row><row><cell>Teachers changing school each year</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>Change</cell><cell></cell><cell>0.54</cell><cell>0.93</cell><cell>0.54</cell><cell>0.91</cell><cell>0.51</cell><cell>1.03</cell><cell>0.54</cell><cell>1.03</cell></row><row><cell>No</cell><cell></cell><cell>0.55</cell><cell>1.00</cell><cell>0.56</cell><cell>1.00</cell><cell>0.51</cell><cell>1.00</cell><cell>0.54</cell><cell>1.00</cell></row><row><cell>Immigration status</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>Yes</cell><cell></cell><cell>0.55</cell><cell>0.99</cell><cell>0.62</cell><cell>1.11</cell><cell>0.54</cell><cell>1.07</cell><cell>0.53</cell><cell>0.99</cell></row><row><cell>Private</cell><cell></cell><cell>0.59</cell><cell>1.00</cell><cell>0.62</cell><cell>1.00</cell><cell>0.46</cell><cell>1.00</cell><cell>0.55</cell><cell>1.00</cell></row><row><cell>Public/private school</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>Public</cell><cell></cell><cell>0.54</cell><cell>0.91</cell><cell>0.55</cell><cell>0.89</cell><cell>0.52</cell><cell>1.12</cell><cell>0.53</cell><cell>0.97</cell></row><row><cell>Yes</cell><cell></cell><cell>0.56</cell><cell>1.00</cell><cell>0.56</cell><cell>1.00</cell><cell>0.51</cell><cell>1.00</cell><cell>0.53</cell><cell>1.00</cell></row><row><cell>Capital island</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>No</cell><cell></cell><cell>0.52</cell><cell>0.94</cell><cell>0.58</cell><cell>1.04</cell><cell>0.48</cell><cell>0.93</cell><cell>0.55</cell><cell>1.04</cell></row><row><cell cols="2">Less than 3 years old</cell><cell>0.56</cell><cell>1.00</cell><cell>0.59</cell><cell>1.00</cell><cell>0.52</cell><cell>1.00</cell><cell>0.52</cell><cell>1.00</cell></row><row><cell>Start schooling age</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">More than 3 years old</cell><cell>0.55</cell><cell>0.98</cell><cell>0.52</cell><cell>0.88</cell><cell>0.48</cell><cell>0.91</cell><cell>0.57</cell><cell>1.09</cell></row><row><cell>Daily</cell><cell></cell><cell>0.56</cell><cell>1.00</cell><cell>0.59</cell><cell>1.00</cell><cell>0.51</cell><cell>1.00</cell><cell>0.53</cell><cell>1.00</cell></row><row><cell>See adults reading books</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">More rarely (not daily)</cell><cell>0.55</cell><cell>0.98</cell><cell>0.54</cell><cell>0.91</cell><cell>0.50</cell><cell>0.98</cell><cell>0.54</cell><cell>1.03</cell></row><row><cell cols="2">Above the median</cell><cell>0.57</cell><cell>1.00</cell><cell>0.61</cell><cell>1.00</cell><cell>0.54</cell><cell>1.00</cell><cell>0.52</cell><cell>1.00</cell></row><row><cell>Socioeconomic status</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="2">Below the median</cell><cell>0.54</cell><cell>0.94</cell><cell>0.52</cell><cell>0.85</cell><cell>0.48</cell><cell>0.90</cell><cell>0.55</cell><cell>1.07</cell></row><row><cell>Average</cell><cell></cell><cell></cell><cell>0.96</cell><cell></cell><cell>0.93</cell><cell></cell><cell>1.08</cell><cell></cell><cell>1.03</cell></row><row><cell>Weighted Average</cell><cell></cell><cell></cell><cell>0.94</cell><cell></cell><cell>0.87</cell><cell></cell><cell>1.15</cell><cell></cell><cell>1.06</cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This paper was partially supported by the "AEQUITAS" project funded by the European Union's Horizon Europe research and innovation programme under grant number 101070363.</p></div>
			</div>

			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0" />			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Factors predicting the subjective well-being of nations</title>
		<author>
			<persName><forename type="first">E</forename><surname>Diener</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">L</forename><surname>Diener</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Diener</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:20833520" />
	</analytic>
	<monogr>
		<title level="j">Journal of personality and social psychology</title>
		<imprint>
			<biblScope unit="volume">69</biblScope>
			<biblScope unit="page" from="851" to="864" />
			<date type="published" when="1995">1995</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">F</forename><surname>Ladd</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:21305932" />
		<title level="m">Holding schools accountable: Performance-based reform in education</title>
				<imprint>
			<date type="published" when="1996">1996</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Towards accurate and fair prediction of college success: Evaluating different sources of student data</title>
		<author>
			<persName><forename type="first">R</forename><surname>Yu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Fischer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Doroudi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Xu</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:220486717" />
	</analytic>
	<monogr>
		<title level="m">Educational Data Mining</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Inequality of opportunity for higher education</title>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">H</forename><surname>Sewell</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:34820586" />
	</analytic>
	<monogr>
		<title level="j">American Sociological Review</title>
		<imprint>
			<biblScope unit="volume">36</biblScope>
			<biblScope unit="page">793</biblScope>
			<date type="published" when="1971">1971</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">H G</forename><surname>Ferreira</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Gignoux</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:260636749" />
		<title level="m">The measurement of educational inequality : Achievement and opportunity 1</title>
				<imprint>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Unfair inequality and growth</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">A</forename><surname>Marrero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">G</forename><surname>Rodríguez</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:258023458" />
	</analytic>
	<monogr>
		<title level="j">The Scandinavian Journal of Economics</title>
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Inequality of opportunity in educational achievement in western europe: contributors and channels</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">A</forename><surname>Marrero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">C</forename><surname>Palomino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Sicilia</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:265183210" />
	</analytic>
	<monogr>
		<title level="j">The Journal of Economic Inequality</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="page" from="383" to="410" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Brewer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Wren-Lewis</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:19612310" />
		<title level="m">Accounting for changes in income inequality: Decomposition analyses for the uk</title>
				<imprint>
			<date type="published" when="1978">1978. 2016</date>
		</imprint>
	</monogr>
	<note>ERN: Other Econometrics: Mathematical Methods &amp; Programming (Topic)</note>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">S</forename><surname>Fields</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:202257605" />
		<title level="m">Accounting for income inequality and its change: A new method, with application to the distribution of earnings in the united states</title>
				<imprint>
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Inequality decompositions by factor components</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">F</forename><surname>Shorrocks</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:7478703" />
	</analytic>
	<monogr>
		<title level="j">Econometrica</title>
		<imprint>
			<biblScope unit="volume">50</biblScope>
			<biblScope unit="page" from="193" to="211" />
			<date type="published" when="1982">1982</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<title level="m" type="main">Inequality of opportunity and growth</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">A</forename><surname>Marrero</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:67827959" />
		<imprint>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">A</forename><surname>Marrero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">G</forename><surname>Rodríguez</surname></persName>
		</author>
		<ptr target="https://api.semanticscholar.org/CorpusID:154345034" />
		<title level="m">Inequality of opportunity in europe, Microeconomics: Welfare Economics &amp; Collective Decision-Making eJournal</title>
				<imprint>
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
