<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Gender violence's models and discrimination-aware data mining?</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Ponti cia Universidad Catolica del Peru</institution>
          ,
          <addr-line>Av. Universitaria 1801, San Miguel Lima-</addr-line>
          <country country="PE">Peru</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The violence against women, bases in the inequity of opportunities in every social stratum. According to national surveys, women and girls are considered a vulnerable population due to the inequalities in access to essential services such as education, economic independence and technology. Currently, governments show interest to address this problem and provide parity of opportunities to contribute to the development of society. Therefore, several social studies analysed the situation of women and their impact on welfare indicators regarding the development of the population, in this context, some techniques propose solutions through Data Mining to measure and recognise possible discrimination and violence. The thesis work intends to develop data mining models to identi ed discrimination mainly towards women and girls, taking into account the environmental factors such as individual, community, social and institutions.</p>
      </abstract>
      <kwd-group>
        <kwd>Discrimination aware association rules Spatial-Data Mining Gender violence</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The World Health Organisation de nes gender violence as behaviour towards a
person, which caused physical, psychological or sexual harm. Around the world,
one out of every three women over 15 years old, is a victim of physical or verbal
aggression [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Gender Violence is considered a global problem, categorised as
a hate crime according to the legislation of some countries [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] and as part of a
series of consequences related to social stability.
      </p>
      <p>
        The relationship between discrimination and violence has been de ned by
[
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], since the feeling of disagreement, whether by the oppressor or the relegated
group, often generates a violent response.
      </p>
      <p>However, most of social studies seek global di erences rather than analysed
gender attribute in isolation, also gender discrimination can appear in speci c
contexts, because of this, registries from discrimination facts are scares and
comes from di erent references, complicating the analysis.</p>
      <p>
        In contrast, the use of social networks and digital devices (mobile phones,
banking transactions, etc.), leave a trace which records the behaviour of people
? Supported by Universidad del Pac co-Early Stage
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], making possible to describe the context of interaction, through an
automated data collection in higher volume of data.
      </p>
      <p>
        For instance, the notion of ubiquity shows inequity can be compare on the
level of mobility in men and women. It is not possible to a rm that the role of
a housewife is rooted in a particular geographic space, but through studies of
trajectories, we can see the distances that women travel with respect to men are
considerably smaller [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] or like shopping habits [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], where women with regard
to men register higher expenses in supermarkets and grocery stores.
      </p>
      <p>This context leads us to pose the following question: &gt;How to measure gender
discrimination in society through digital records?.</p>
      <p>This article is organised as follow: The state of the art gathers research
antecedents from social sciences and data sciences, to identify phenomena and
factors related to violence; the proposal and approximations regarding the state
of the art, the methodology to validate the hypothesis; so far, the results show
information about environmental factors and nally the discussion about the
di culties that this research face it.
2</p>
    </sec>
    <sec id="sec-2">
      <title>State-of-the-art</title>
      <p>
        Our research lines has two main groups, the social sciences and the data sciences.
In this context, International Institutions are interest to address this problem on
detail. They develop indicators about environment factors, which might arise the
persistence of violence in emerging societies or not. [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ]. These studies describes
a large gap for access to basic services of housing, health, food and education
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        Another view, mention that lifestyle have an in uence of men who report
having perpetrated physical violence towards their couple throughout his life
[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. For instance, multi-variant analysis of community factors, also found that
men who were witness of family violence from father to mother, are more likely
to engage in gender violence, reinforcing the theory of the transmission of
intergenerational violence [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. In an attempt to predict gender gaps in children,
they included the participation of parents and their perspective of gender roles,
nding that parents with gender role paradigms, will have children with the
same stereotypes [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        They concluded, that it is necessary to understand how inequity a ects the
development of societies, the di culties to face these problems and the factors
predisposing to disparity [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ].
      </p>
      <p>
        Although most of these studies propose indicators to face inequity [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], they
are not in agreement with each other, due to a variety of variables, which is valid
and does not turn out to be a problem in itself. However, they are not addressing
the interaction of these factors nor the in uence on discrimination, that might
give emphasises a particular element to establish priorities. The proposal refers
to this problem, to incorporated data mining techniques to improve the
granularity of the rules that support decision making process.
In contrast, Data Mining techniques allow a variate of task (managing data,
measuring and predicting social phenomena), as long as it relies on information
of the real context (variable and labels) to build models.
      </p>
      <p>
        The measures to determine discrimination, in general, have not been fully
developed by the data sciences, although, the Discrimination-Awareness Data mining
(DADM) [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] address this issue becoming into an ever-increasing eld,
discovering discrimination hidden in a volume of historical records for decision making,
recognising direct discrimination 1 and indirect 2 according to social context.
      </p>
      <p>
        As indirect discrimination is not explicit in electronic digital records, to infer
direct discrimination it is necessary to know the external and individual factors
of the study sample (sex, race, pregnancy status, age, ethnicity, religion, state
civil). Thus, some research papers [
        <xref ref-type="bibr" rid="ref2 ref21">2, 21</xref>
        ] nd a relationship of inequity between
social factors such as illiteracy, child malnutrition, access to contraceptives, and
external factors (climate, geography, etc.), in geographic spaces. Methodological
works proposed another measures of discrimination [
        <xref ref-type="bibr" rid="ref12 ref15 ref23 ref9">9, 12, 15, 23</xref>
        ], considering the
social and individual factors, concerning the causes of direct segregation towards
people. Some variations of the data transformation process, suggests using the
evenly distributed information to improve the representativeness of all the groups
in the datasets [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ].
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Proposed Approach</title>
      <p>
        The thesis work proposes two goals within the framework of state-of-the-art:
First, to develop models to measure discrimination based on digital records.
The guiding is the DADM, to build classi cation rules that recognise direct and
indirect discrimination, as long as there will two elements: the class that de nes
a discriminating rules and the context to validate the elements of the rule are
discriminant, as shown in Figure 1 [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. where it is necessary know the background
to unveil discrimination.
      </p>
      <p>
        Because discrimination records are scare, we include historical information
from two di erent periods and various geographical spaces. According
state-ofart, the rules generated from databases with a known class type, provide enough
information to classi ed the rules into potentially discriminant (PD) and
nondiscriminating(PND). However, is not the case of our data, so it is necessary
to develop new strategies to determine the class. These rules are relationships
and will add quality to the results like as co-occurrences, which means that it
is likely they would be related to social context. The second goal, refers to the
validation of the models carried out in the rst stage through socio-demographic
context. However, it would be necessary to demonstrate that these variables are
su cient or in any case, to experiment and applying engineering features.
1 Direct discrimination: explicit, impose barriers between a group. Ex. Ethnicity:
indigenous, History: Good Credit: Poor
2 Indirect discrimination: not explicit, but impose barriers between a group (conscious
or unconscious). Ex. Provenance: Rural, History: Good Credit: Bad; if it is known
that the rural population has a high percentage of indigenous people.
The KDD (Knowledge discovery databases) process of extracting patterns from
databases is composed of four stages [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. The process begins obtaining the data
and ends with the validation of the results (patterns). These stages will serve as
the guiding thread for the realisation of the research project.
      </p>
    </sec>
    <sec id="sec-4">
      <title>Results</title>
      <p>
        The rst approach to the fusion of social sciences to data sciences for this work,
was the prepossessing of qualitative information extracted from interviews with
university females students [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. In that work, relevant testimonies of the
interviewees were extracted through a process based on the use of TF-IDF (Term
frequency - Inverse document frequency).
      </p>
      <p>
        The research mentioned above, aims to know which agents were involved
in some episodes of discrimination within university atmosphere. To this end,
a survey of open questions about events of discrimination was developed, such
as: "Did you ever hear in the university environment mentioning that women
are di erent concerning to their professional performance? Mention what you
heard and the person who said it". This survey was formulated by social experts
who manage the variables according to the-state-of-the-art mentioned in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
In sum, through the extraction of relevant words from each testimony, it is
possible to know which testimonials represent others in a set of documents. Our
methodology is available in Github 3.
      </p>
      <p>In this proposal, the propose method named A0 are more likely to contain
sets of di erent words related to the topic. In contrast, Topic Modeling methods
are memory expensive compared to methods based on TF-IDF. We conclude
that the process A0 is e cient in the extraction of relevant words, optimising
the retrieval information for qualitative research in simple and complex data.
However, the e ciency of A0 is linked to the improvement of data pre-processing
techniques.</p>
      <p>The sample for that work is small (214 records). Although the extracted
testimonies may represent the discrimination su ered by the students of that
school, it is not possible to generalise what happens with all the women in the
same space. Due to this, the decision was made to collect structured information
available in di erent resources and to unify it to be processed later.</p>
      <p>For this article, databases were compiled related to the individual and
environmental factors that in uence gender violence. As seen in Tables 1, 2 and 3,
each factor mentioned by the studies described in Section 2, it corresponds to a
series of les that contain information related to the description of the variables.</p>
      <p>The description of this data has been crucial to understand how the
individual, community, social and institutional factors are related, as well as the
recognition of variables between the automatic and socio-demographic
information.</p>
      <p>For instance, suppose that the amount of monthly expenditure sustains the
purchasing power of the people and is de ned by the type of work that they
have. A well-paid job requires specialised skills and a high degree of training
or education. Although it is not possible to assure that there is a relationship
between educational level and the amount of monthly expenses in our data. As
Figure 2 shows, inequality in educational level between men and women from 0
to 89 years (y axis) in Peru in 2016, the green bars represent the data of women
3 Source code: https://github.com/bitmapup/violencePatterns
who, as observed, have a numerical di erence with respect to their male pairs
at each level of the x axis (blue bars). In contrast, Figure 5 shows population
Factor Vfiaocletonrce Name of datasets Columns Records wFeiigleht tDyapteas Concatleenated
Individual IWndomivaidnual ab..rreecch00151 ab.. 75 ccoolluummnnss ab.. 3354302002 ab.. 11..93 MMBB ab.. iinntt6644((64)),,oobbjejecct(t1(1)) CRWoeecliuogmrhdtns:s:2:3.8150M84B
Home Community ab.. rreeccv081411 ab.. 752ccoolulummnnss ab.. 3334106082 ab.. 118.3.2MMBB ab.. inota6t46(44()7,1o)b,joebcjte(c1t)(1) CRWoeecliuogmrhdtns:s:5:1.68165M43B
Table 1. Characteristics of the ENDES datasets associated with gender violence factors
Violence
Factor</p>
      <p>Name</p>
      <p>Weight</p>
      <p>Columns Records of the le
Institutional Chapter100 37 columns 1177
Data Concatenated
types le</p>
      <p>Weight: 285.1 MB
340.4 MB int64(28), object(3) Records: 1177</p>
      <p>
        Columns: 31
Where n is the total of transactions and P u is the accumulated average monthly
expenditure per user; class indicates the social class associated with that type of
behaviour and monthly expense amount. This research will not speci cally deal
with the veri cation of this formula, but it is necessary to point out that in [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ],
there are certain parameters to follow to nd this variable social class. As Figure
5, di erences of socio-economic class between man(blue bars) and women(green
bars) are remarkable repect to the amount of expenditure (axisy). Error showed
is caused due to few register with outlier measure.
6
      </p>
    </sec>
    <sec id="sec-5">
      <title>Discussion</title>
      <p>In contrast to DADM studies so far, information used in the thesis proposal is
varied and voluminous, coming from di erent sources of known space-time
contexts, which in some way allows knowing the continuity of particular phenomena
(prediction of events). The discrimination metrics proposed by the social sciences
were formulated from multivariate analysis and classical statistics with some
aspects and factors. In this research, the maximum number of possible variables
that could or could not appear according to the proposed model will be used
simultaneously, forming strong association rules automatically and objectively.
So, is recommendable begin through the de nition of discrimination measures
in vast and varied information, that contributes to social research in the search
for explanations of social events such as gender violence.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>H.</given-names>
            <surname>Alatrista-Salas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hidalgo-Leon</surname>
          </string-name>
          ,
          <article-title>Nun~ez-del-</article-title>
          <string-name>
            <surname>Prado</surname>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Documents Retrieval for qualitative Research: Gender Discrimination Analysis</article-title>
          .
          <string-name>
            <surname>LA-CCI</surname>
            <given-names>IEEE</given-names>
          </string-name>
          , conference.(
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bosco</surname>
          </string-name>
          ,
          <string-name>
            <surname>Claudio</surname>
          </string-name>
          , et al.
          <article-title>"Exploring the high-resolution mapping of genderdisaggregated development indicators</article-title>
          .
          <source>" Journal of The Royal Society Interface 14.129</source>
          (
          <year>2017</year>
          ):
          <fpage>20160825</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Braveman</surname>
            , Paula, and
            <given-names>Eleuther</given-names>
          </string-name>
          <string-name>
            <surname>Tarimo</surname>
          </string-name>
          .
          <article-title>"Social inequalities in health within countries: not only an issue for a uent nations</article-title>
          .
          <source>" Social science medicine 54.11</source>
          (
          <year>2002</year>
          ):
          <fpage>1621</fpage>
          -
          <lpage>1635</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Croft</surname>
          </string-name>
          ,
          <string-name>
            <surname>Alyssa</surname>
          </string-name>
          , et al.
          <article-title>"The second shift re ected in the second generation: Do parents' gender roles at home predict children's aspirations?</article-title>
          .
          <source>" Psychological Science 25.7</source>
          (
          <year>2014</year>
          ):
          <fpage>1418</fpage>
          -
          <lpage>1428</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5. Centro de Investigacion y Desarrollo (
          <article-title>CIDE), "Factores asociados a la presencia de violencia hacia la mujer"</article-title>
          <source>Talleres de la O cina Tecnica de Administracion del INEI</source>
          ,
          <fpage>000</fpage>
          -
          <string-name>
            <surname>OTA-INEI</surname>
          </string-name>
          (
          <year>2002</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <article-title>Centro internacional para la prevencion de la criminalidad (CIPC), "Informe Internacional sobre la prevencion de la criminalidad y la seguridad cotidiana: Tendencias y perspectiva"</article-title>
          , (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Chant</surname>
          </string-name>
          , Sylvia.
          <article-title>"Re-thinking the \feminization of poverty" in relation to aggregate gender indices</article-title>
          .
          <source>" Journal of human development 7</source>
          .2 (
          <year>2006</year>
          ):
          <fpage>201</fpage>
          -
          <lpage>220</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Choy</surname>
          </string-name>
          ,
          <string-name>
            <surname>Olivia</surname>
          </string-name>
          , et al.
          <article-title>"Explaining the gender gap in crime: The role of heart rate</article-title>
          .
          <source>" Criminology 55.2</source>
          (
          <year>2017</year>
          ):
          <fpage>465</fpage>
          -
          <lpage>487</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>D.</given-names>
            <surname>Pedreschi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ruggieri</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F.</given-names>
            <surname>Turini</surname>
          </string-name>
          , \
          <article-title>Integrating Induction and Deduction for Finding Evidence of Discrimination,"</article-title>
          <source>Proc. 12 th ACM Int'l Conf. Arti cial Intelligence and Law (ICAIL '09)</source>
          , pp.
          <fpage>157</fpage>
          -
          <lpage>166</lpage>
          , (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Devries</surname>
          </string-name>
          ,
          <string-name>
            <surname>Karen</surname>
            <given-names>M.</given-names>
          </string-name>
          , et al.
          <article-title>"The global prevalence of intimate partner violence against women</article-title>
          .
          <source>" Science 340.6140</source>
          (
          <year>2013</year>
          ):
          <fpage>1527</fpage>
          -
          <lpage>1528</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Di</surname>
            <given-names>Clemente</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Riccardo</surname>
          </string-name>
          , et al.
          <article-title>"Sequence of purchases in credit card data reveal life styles in urban populations</article-title>
          .
          <source>" arXiv preprint arXiv:1703.00409</source>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>F.</given-names>
            <surname>Kamiran</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Calders</surname>
          </string-name>
          , \
          <article-title>Classi cation with no Discrimination by Preferential Sampling,"</article-title>
          <source>Proc.19th Machine Learning Conf. Belgium and The Netherlands</source>
          , (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Fayyad</surname>
          </string-name>
          ,
          <string-name>
            <surname>Usama</surname>
            <given-names>M.</given-names>
          </string-name>
          , et al.
          <article-title>"The KDD process for extracting useful knowledge from volumes of data</article-title>
          .
          <source>"</source>
          (
          <year>1996</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Fleming</surname>
            ,
            <given-names>Paul J.</given-names>
          </string-name>
          , et al.
          <article-title>"Risk factors for men's lifetime perpetration of physical violence against intimate partners: results from the international men and gender equality survey (IMAGES) in eight countries</article-title>
          .
          <source>" PloS one 10.3</source>
          (
          <year>2015</year>
          ):
          <fpage>e0118639</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Hajian</surname>
          </string-name>
          , Sara, and
          <string-name>
            <surname>Josep</surname>
          </string-name>
          Domingo-Ferrer.
          <article-title>"A methodology for direct and indirect discrimination prevention in data mining." IEEE transactions on knowledge and data engineering 25.7 (</article-title>
          <year>2013</year>
          ):
          <fpage>1445</fpage>
          -
          <lpage>1459</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Jayachandran</surname>
            ,
            <given-names>Seema. "</given-names>
          </string-name>
          <article-title>The roots of gender inequality in developing countries</article-title>
          .
          <source>" economics 7</source>
          .1 (
          <year>2015</year>
          ):
          <fpage>63</fpage>
          -
          <lpage>88</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Johnson</surname>
          </string-name>
          , Wendi L., et al.
          <article-title>"The age{IPV curve: Changes in the perpetration of intimate partner violence during adolescence and young adulthood</article-title>
          .
          <source>" Journal of youth and adolescence 44.3</source>
          (
          <year>2015</year>
          ):
          <fpage>708</fpage>
          -
          <lpage>726</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Karlsen</surname>
          </string-name>
          ,
          <article-title>Sa ron</article-title>
          , and
          <string-name>
            <surname>James</surname>
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Nazroo</surname>
          </string-name>
          .
          <article-title>"Relation between racial discrimination, social class, and health among ethnic minority groups."</article-title>
          <source>American journal of public health 92.4</source>
          (
          <year>2002</year>
          ):
          <fpage>624</fpage>
          -
          <lpage>631</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Leo</surname>
          </string-name>
          ,
          <string-name>
            <surname>Yannick</surname>
          </string-name>
          , et al.
          <article-title>"Correlations of consumption patterns in social-economic networks</article-title>
          .
          <source>" Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining</source>
          . IEEE Press,
          <article-title>(</article-title>
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Muller</surname>
          </string-name>
          ,
          <string-name>
            <surname>Robert</surname>
            <given-names>T.</given-names>
          </string-name>
          , John E. Hunter, and
          <string-name>
            <given-names>Gary</given-names>
            <surname>Stollak</surname>
          </string-name>
          .
          <article-title>"The intergenerational transmission of corporal punishment: A comparison of social learning and temperament models</article-title>
          .
          <source>" Child Abuse Neglect</source>
          <volume>19</volume>
          .11 (
          <year>1995</year>
          ):
          <fpage>1323</fpage>
          -
          <lpage>1335</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Pappalardo</surname>
          </string-name>
          ,
          <string-name>
            <surname>Luca</surname>
          </string-name>
          , et al.
          <article-title>"Using big data to study the link between human mobility and socio-economic development." Big Data (Big Data)</article-title>
          ,
          <source>2015 IEEE International Conference on. IEEE</source>
          ,(
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Romei</surname>
            , Andrea,
            <given-names>Salvatore</given-names>
          </string-name>
          <string-name>
            <surname>Ruggieri</surname>
            , and
            <given-names>Franco</given-names>
          </string-name>
          <string-name>
            <surname>Turini</surname>
          </string-name>
          .
          <article-title>"Discovering gender discrimination in project funding." Data Mining Workshops (ICDMW</article-title>
          ),
          <source>2012 IEEE 12th International Conference on. IEEE</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Ruggieri</surname>
            , Salvatore,
            <given-names>Dino</given-names>
          </string-name>
          <string-name>
            <surname>Pedreschi</surname>
            , and
            <given-names>Franco</given-names>
          </string-name>
          <string-name>
            <surname>Turini</surname>
          </string-name>
          .
          <article-title>"Data mining for discrimination discovery." ACM Transactions on Knowledge Discovery from Data (TKDD) 4</article-title>
          .2 (
          <year>2010</year>
          ):
          <fpage>9</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>