<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Case Study of Natural Gender Phenomena in Translation A Comparison of Google Translate, Bing Microsoft Translator and DeepL for English to Italian, French and Spanish</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Argentina Anna Rescigno</string-name>
          <email>a.rescigno1@studenti.unior.it</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Eva Vanmassenhove</string-name>
          <email>e.o.j.vanmassenhove@tilburguniversity.edu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Johanna Monti</string-name>
          <email>jmonti@unior.it</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andy Way</string-name>
          <email>andy.way@adaptcentre.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ADAPT Centre Dublin City University</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of CSAI, Tilburg University</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>UNIOR NLP Research Group, University of Naples L'Orientale</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <issue>3</issue>
      <fpage>222</fpage>
      <lpage>232</lpage>
      <abstract>
        <p>This paper presents the results of an evaluation of Google Translate, DeepL and Bing Microsoft Translator with reference to natural gender translation and provides statistics about the frequency of female, male and neutral forms in the translations of a list of personality adjectives, and nouns referring to professions and bigender nouns. The evaluation is carried out for English!Spanish, English!Italian and English!French.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Gender manifests itself in a language in many ways,
and different languages use different linguistic
devices to mark (or sometimes ‘not mark’) gender.
When dealing with language, three types of
gender come into play: natural gender, grammatical
gender and social gender. Natural gender is
generally based on the sex of a person or an animal
realised by means of the male/female polarity or
on the absence of sex for neutral nouns.
Grammatical gender, instead, is not always coherent with
the semantic categorization of a word and can vary
from language to language since it depends on the
representation of objects in the world on the
basis of specific properties attributed to them in a
specific cultural context. Social gender is used in
relation to the properties of a word on the basis
of which the speakers of a language associate the
natural gender of a person to a word
        <xref ref-type="bibr" rid="ref8">(Hellinger
and Motschenbacher, 2015)</xref>
        : this mainly happens
with names of professions, such as for instance
doctor or nurse which are interpreted according
to social stereotypes concerning the roles of males
and females in the society. Gender is present in
the data we use to train MT systems due to the
demographic features of the human training data and
because of the nature of stereotypes and biases we
communicate in our day-to-day communications.
As most state-of-the-art MT systems handle
translations on the sentence-level, gender phenomena
are, usually, resolved on statistics inferred from the
training data. Mistranslations of gender
information occur more frequently when translating from
gender-neutral languages, such as English1, into
morphological-rich languages, such as Italian or
French, which explicitly mark gender and require
additional information to correctly translate gender
phenomena. When such additional information or
context is not provided, the system will pick the
most likely variant. A recent study by Prates et al.
(2018) showed how Google Translate (GT) yields
more male defaults than what ought to be expected
when looking at demographic data on its own,
alluding that there might be a phenomenon they refer
to as machine bias
        <xref ref-type="bibr" rid="ref11 ref13">(Prates et al., 2018;
Vanmassenhove et al., 2019)</xref>
        . In this paper, we systematically
evaluate: (a) single-word queries, containing
personality adjectives and profession nouns, and (b)
bigender nouns2 in an EN ! IT, FR, ES translation
setting for GT, DeepL (DL) and Bing Microsoft
Translator (BMT) to verify the diversity in
translations provided by these MT providers.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>As recent years have seen an increase in literature
on bias in NLP, we focus particularly on work that
has attempted controlling the seemingly random
fluctuations in terms of gender in the translations
provided by large MT providers, with a specific
focus on neural approaches. Rabinovich et al. (2016)
conducted a more elaborate series of experiments
very similar to the work by Bawden et al. (2016).
Their work on preserving original author traits
fo1Aside from pronouns such as ’she’ and ’he’ or some
exceptions such as ‘actress’ vs ‘actor’.</p>
      <p>
        2Bigender nouns do not have a fixed grammatical
gender; their gender is determined by the context and without
any further context, they are valid for both male and female
referents.
cuses particularly on gender. They treated
personalizing PB-SMT systems as a domain-adaptation task
where the female and male gender are two separate
domains. In NMT, Vanmassenhove et al. (2018)
experimented with the insertion of an artificial
token at the beginning of the sentence, indicating the
gender of the speaker
        <xref ref-type="bibr" rid="ref1 ref13 ref18">(Vanmassenhove and
Hardmeier, 2018)</xref>
        . This approach is similar to Sennrich
et al. (2016) who added an ‘informal’ or ‘polite’
tag indicating the level of politeness expressed to
the training sentences.
      </p>
      <p>The work by Elaraby et al. (2018) presents a
technique for the translation of speech-like texts
focusing particularly on English-to-Arabic. They
train a baseline on generic data (4M sentences) and
use a set of gender-labelled sentences (900k) in
order to tune the system towards generating
translations with correct gender agreement.</p>
      <p>
        More recently, Moryossef et al. (2019) presented
a simple yet effective black-box approach to control
the NMT system’s translations in case of gender
ambiguity. Instead of appending a token, they
concatenate unambiguous artificial antecedents with
information on the speaker and the interlocutors to
ambiguous English sentences. Some recent studies
have addressed the problem of the scarcity of
publicly available corpora and create corpora
specifically designed to evaluate or to test MT
performance with respect to gender translation
        <xref ref-type="bibr" rid="ref11 ref12 ref5 ref7">(Font and
Costa-Jussa, 2019; Di Gangi et al., 2019)</xref>
        . Finally,
Monti (2020) provides an overview of outstanding
issues and topics related to gender in MT and Sun
et al. (2019) a literature review of work related to
gender bias in the field of NLP.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Experimental setup</title>
      <p>For the experiments, we compiled a dataset with
the translation of both English single-word queries
and short sentences into Italian (IT), French (FR)
and Spanish (ES) with GT, BMT and DL. We
experimented with both single-words and short
sentences as e.g. GT provides, since end 20183
gendered translations for single-word queries (limited
to nouns and adjectives) for EN–FR, EN–ES and
EN–IT. Similarly, BMT and DL provide users with
alternative translations on the user interface.
Differently from GT, these are simply alternative
translations for the word in question. As such, GT is the
3https://www.blog.
google/products/translate/
reducing-gender-bias-google-translate/
only system that currently, to some extent, deals
with gendered variants in a systematic way.
3.1</p>
      <sec id="sec-3-1">
        <title>Compilation of Datasets</title>
        <p>The datasets4 are compiled on the basis of a list
of nouns and adjectives collected from different
sources (see Table 1). The translations generated
and their manual evaluations are also part of the
datasets. The setup for this experiment consists
of both words and sentences. We collected a set
of 136 personality adjectives and 107 nouns of
professions from three different sources and 30 of
the most common bigender nouns in the Italian
language. We tested these separate sets and
analysed the behaviour of the major state-of-the-art MT
systems, comparing the translations of the three
language pairs. The first two sets of words have
been assessed alone, without any context, while
the last set has been examined within the sentence
level.</p>
        <p>Alongside with the number of adjectives and
nouns retrieved, Table 1 provides more detailed
information on the sources and the original
language in which the data was retrieved.</p>
        <p>
          # Sources
Adjectives 136
          <xref ref-type="bibr" rid="ref10 ref5 ref9">(I, 2019a)</xref>
          ;
          <xref ref-type="bibr" rid="ref19 ref20 ref5">(II, 2019a)</xref>
          ;(III, 2019)
Professions 107
          <xref ref-type="bibr" rid="ref10 ref9">(I, 2019b)</xref>
          ;
          <xref ref-type="bibr" rid="ref19 ref20">(II, 2019b)</xref>
          Bigender 30
          <xref ref-type="bibr" rid="ref3">(Cacciari et al., 1997)</xref>
          ;
          <xref ref-type="bibr" rid="ref4">(Cacciari et al., 2011)</xref>
          <xref ref-type="bibr" rid="ref17">(Thornton and Anna, 2004)</xref>
          GT: Launched in 2003 as a statistical MT
system, GT has switched to a NMT system in 2016
          <xref ref-type="bibr" rid="ref21">(Monti, 2017)</xref>
          . The translations are generated at the
sentence level. Since 2018, Google provides two
alternatives when translating ’ambiguous’ or
underspecified English words into languages that have
malenfemale alternatives for various languages (e.g.
Italian, French and Spanish). The malenfemale
variants are listed alphabetically (i.e. first the
female variant, then the male one).
        </p>
        <p>
          4available at https://github.com/
argentina-res/gender_project.git
BMT: MT system owned by Microsoft that
originally used a statistical approach
          <xref ref-type="bibr" rid="ref21">(Monti, 2017)</xref>
          but
more recently switched to a neural system
          <xref ref-type="bibr" rid="ref1">(Almahasees, 2018)</xref>
          . Unlike GT, BMT does not provide
alternatives in the translation box itself. However, it
does give synonyms in the “Other ways to say”
section and provides examples of usage in the “How
to use...” section, where sometimes the female
form of a word is listed (by chance rather than in a
consistent way).
        </p>
        <p>
          DL: The most recent platform launched in
August 2017 by a German company, DeepL GmbH.
DL uses convolutional neural networks based on
the Linguee database
          <xref ref-type="bibr" rid="ref11 ref12 ref7">(Mora´n Vallejo and others,
2019)</xref>
          . Even though it only supports nine
languages (all Indoeuropean), DL is, according to
a recent study, outperforming the other
competitors
          <xref ref-type="bibr" rid="ref11 ref12 ref7">(Mora´n Vallejo and others, 2019)</xref>
          . The layout
of the interface is similar to that of GT.
Nevertheless, its suggestions for alternatives are not
systematically morphological variants (although they
are often somehow included in the alternatives
provided). Underneath the actual translations, there
is also a dictionary-like section when translating
words in isolation.
4
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Results</title>
      <p>In this section, the results obtained with our
experiments and our subsequent evaluation will be
presented. A more in-depth analysis and a
discussion of some concrete examples is provided in
Section 4.1. All the manual evaluations were
conducted in November, 20195.</p>
      <sec id="sec-4-1">
        <title>The adjectives and profession nouns were</title>
        <p>both evaluated in single-word settings. The
bigender nouns were evaluated in short sentences.
Bigender nouns in Italian, such as for example the
nouns giornalista, pianista are by themselves not
marked for gender. As such, a single-word query
would not reveal any gender marking. However,
the articles, adjectives, verbs, etc. that agree with
these bigender nouns are (often) marked for gender.
Therefore, these specific nouns were evaluated in
short sentence settings.</p>
        <p>We manually evaluated all the outputs and will
report on the percentage of male (M), female (F)
and ‘neutral’ (N) or ‘covered’ (C) (i.e. no
explicit gender in the target language) variants for
the single-word and sentence queries.
Additionally, we report on the errors (e.g. untranslated or
mistranslated words).</p>
        <p>As mentioned earlier, GT is currently the only
system that provides both male and female
variants when given a single-word ambiguous query
(EN!FR, ES, IT). The alternatives are limited to
adjectives and nouns6. DL and BMT provide the
user with multiple alternatives underneath the
primary translation for both nouns and adjectives but
not in a systematic way. As such, these
alternatives do not necessarily consist of morphological
variants in terms of gender.</p>
        <p>For our evaluation, only the main translations
offered by the MT systems were considered, i.e. we
evaluated both gendered forms for GT, but did not
evaluate the list of alternative translations provided
by BMT and DL as they can be alternatives of any
kind and they differ depending on the query.</p>
        <p>In the interest of clarity and order, we will
separately consider the different test-sets, single words
(i.e. adjectives and nouns), and sentences with
bi-gender nouns. Each set has been tested in the
previously stated systems and language pairs. In
the tests, we were especially investigating – apart
from any translation error – the occurrence of
female forms, to see if there is somehow a bias
towards the gender. From a practical point of view,
we will consider only the first output as a valid
result, since the systems also give “alternatives”
for single-word translations. For Bing, it is quite
straightforward: adjectives do not present
alternatives, while nouns sometimes do. Google Translate
produces now two different results, marked with
the gender (feminine/masculine, in this sequence
for alphabetical order). DeepL, in particular,
provides at least three alternative translations, for both
nouns and adjectives. However, even though we
are not considering alternatives, we will explain or
anyway mention the ones that are significative in
this research. All the results have been recorded in
November 2019. However, the systems continually
improve, so results may vary also in a short time.</p>
        <p>Table 2 presents the results for the single-word
translations consisting of adjectives: for all three
systems, the male variant was the most common.
This was especially so for BMT, where only 1.5%
of the adjectives were translated into a female
variant. The ‘other’ category consists of translations:
5The translations were evaluated by a native Italian
speakernlinguist for Italian, and by a professional linguist
for French and Spanish.</p>
        <p>6In the target languages certain verbs are also sometimes
marked for gender, e.g. reciprocal verbs, passive constructions,
past participles...
(a) that were ambiguous (e.g. mean was translated
as a verb instead of as an adjective), (b) words that
were not translated by the systems and (c) errors.</p>
        <p>We ought to note that, none of the single words
were mistranslated by GT. The only single-word
queries that caused a divergence between male
and female forms were queries that were a
compound noun in either the source or the target
(e.g.good-tempered). They are not treated as
a single unit by GT and thus the system fails
to render both variants. From the Table 2, it is
clear that GT performs best in terms of balanced
single-word adjective translations. Table 3 presents
a similar set of results but for the nouns indicating
professions. Like Table 2 GT generated the most
diverse translations, while BMT the least. As far
as the set with sentences is concerned, we used
bigender nouns from the Italian language. We
used 30 common bigender nouns in two different
contexts: (a) first, in a minimal sentence that would
allow us to infer the gender based on the article
in the target language “I am a(n)..” and (b) with
a referring adjective. We used beautiful, efficient,
intelligent, sad and famous. In Table 4, our results
are presented for the bigender nouns on minimal
sentences (“I am a(n)...”) and in combination
with the aforementioned adjectives (“I am a(n) +
adj...”). In the results, we oppose the translation
where we added beautiful as an adjective as they
differed considerably from the others. Table 4
presents the results for the translations generated
by BMT for bigender nouns in sentences for
Italian, French and Spanish. It can be noted that
the male translation is the most common in the
simple sentences that do not contain an adjective
in all three languages. However, when adding the
adjective beautiful to the phrase, the female forms
are the most common for all three languages. An
example of such sentences is given below:
(a) EN: “I am a pianist”</p>
        <p>IT: “Sono un pianista.”
FR: “Je suis pianiste.”</p>
        <p>ES: “Soy pianista.”
(b) EN “I am a beautiful pianist.” (N)
IT “Sono una bellissima pianista.” (F)
FR “Je suis une belle pianiste.” (F)
ES “Soy una hermosa pianista.” (F)
(c) EN ‘I am a famous pianist.”</p>
        <p>IT “Sono un famoso pianista.”
FR “Je suis un pianiste ce´le`bre.”
ES “Soy un pianista famoso.”
(N)
(M)
(N)
(N)
(N)
(M)
(M)
(M)</p>
        <p>The results obtained for DL are very similar to
the ones obtained with BMT except for the fact that
DL generates overall more female forms than BMT.
Interestingly, among all systems, GT is the most
biased towards male forms when evaluating entire
sentences for all three language pairs, with the male
forms being the dominant ones for all categories
and for several set-ups we observe more than 90%
male variants.</p>
      </sec>
      <sec id="sec-4-2">
        <title>NOUN</title>
        <p>F
M
N
Other
Total
In the analysis we compare the results of the three
systems, comparing the occurrences of the
femalegendered translated forms in terms of the different
systems and the different languages.</p>
        <p>GT: for the single nouns and adjectives, GT
provides both male and female forms. The system,
however, produces more male outputs as
sometimes the alternative is not provided (e.g. for
compound nouns). One of the provided nouns was
ambiguous (printer) in English. The system
translated this as the object instead of the profession.
Whenever an ambiguous word was translated
accurately, yet not in the way we intended it to be
translated, we included it into the ‘other’ category.
Considering the adjectives, we observed one
incorrect translation where the adjective supportive was
translated into an Italian noun (‘supporto’ meaning
support). Two other adjectives were ambiguous
(mean and kind) and were translated into a verb
and a noun respectively by GT. For the
sentenceevaluation, GT has the strongest preference for
translations using male-endings.</p>
        <p>BMT: for the nouns, BMT has a strong tendency
to output male variants. The only two exceptions
are nurse for IT, FR and ES and makeup artist for
FR and ES. Besides, words such as newsreader,
translator, warder were not translated by BMT
and others were mistranslated, e.g. garbage man
and window cleaner, for which the translations
provided were too literal in IT (uomo spazzatura) and
pulizia finestre where “pulizia” is the equivalent
of “cleaning”). Concerning the adjectives, unlike
DL and GT, BMT rarely offers alternatives and
the majority of the translations generated are in
the male form. Exceptional female variants were
found in: (a) Italian for: devious/subdola and
joyful/gioiosa; (b) Spanish for: artistic, bossy, calm,
diplomatic, dynamic, extrovert, humorous,
industrious, placid. We ought to note that a considerable
amount of the adjectives (33.1% for Italian and
29.4% for Spanish) have a translation with a
bigender adjective. These words have the same form for
both genders and are included in the ’N’ (neutral)
group. A small number of adjectives were
translated with an expression, such as good-tempered
! IT di buon umore, FR de bonne humeur, SP de
buen humor. These expressions can be assigned to
both male/female referents and are thus considered
covered/neutral. We did not observe any errors
except for the adjective frank (without a capital
letter), which was left untranslated by DL (as if it
were the first-name “Frank”). However, the most
appropriate translation was among the alternatives
suggested. Moreover, sometimes the adjectives are
ambiguous, therefore the system has opted for a
“non-gendered” alternative (e.g. mean was
translated as a verb instead of an adjective).</p>
        <p>DL: DL provides multiple options, for both
nouns and adjectives, but for only 7.5% of nouns
the female form is the first result, as, for example,
assistant and nurse for French and Italian, doctor,
secretary and shop assistant for Spanish and Italian,
soldier and teacher for Italian. For the adjectives,
instead, the number of female forms increases to
22.8%. Similarly to some of the observations for
BMT and GT, some of our intended nouns were
ambiguous (‘model’ can be a noun or a verb) and
the system opted for the verb translation. The only
error we observed is the translation of tailor which
was incorrectly translated into the adjective
sartoriale.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusions and Future Work</title>
      <p>In future work, we would like to conduct a larger
evaluation comprising of more language pairs and
a more diverse set of words. Furthermore, we aim
to compile a challenge set focusing specifically on
gender phenomena in language that can be used and
automatically evaluated. We also envisage training
our own state-of-the-art MT system to verify how
and whether machine bias indeed influences the
output of the translations generated.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Zakaryia</given-names>
            <surname>Mustafa Almahasees</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Assessment of google and microsoft bing translation of journalistic texts</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Rachel</given-names>
            <surname>Bawden</surname>
          </string-name>
          , Guillaume Wisniewski, and He´le`ne Maynard.
          <year>2016</year>
          .
          <article-title>Investigating gender adaptation for speech translation</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Cristina</given-names>
            <surname>Cacciari</surname>
          </string-name>
          , Manuel Carreiras, and Cristina Barbolini Cionini.
          <year>1997</year>
          .
          <article-title>When words have two genders: Anaphor resolution for italian functionally ambiguous words</article-title>
          .
          <source>Journal of memory and language</source>
          ,
          <volume>37</volume>
          (
          <issue>4</issue>
          ):
          <fpage>517</fpage>
          -
          <lpage>532</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Cristina</given-names>
            <surname>Cacciari</surname>
          </string-name>
          , Paola Corradini, Roberto Padovani, and
          <string-name>
            <given-names>Manuel</given-names>
            <surname>Carreiras</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Pronoun resolution in italian: The role of grammatical gender and context</article-title>
          .
          <source>Journal of Cognitive Psychology</source>
          ,
          <volume>23</volume>
          (
          <issue>4</issue>
          ):
          <fpage>416</fpage>
          -
          <lpage>434</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Mattia A Di Gangi</surname>
            , Roldano Cattoni, Luisa Bentivogli, Matteo Negri, and
            <given-names>Marco</given-names>
          </string-name>
          <string-name>
            <surname>Turchi</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Must-c: a multilingual speech translation corpus</article-title>
          .
          <source>In 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</source>
          , pages
          <fpage>2012</fpage>
          -
          <lpage>2017</lpage>
          .
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Mostafa</given-names>
            <surname>Elaraby</surname>
          </string-name>
          , Ahmed Y Tawfik, Mahmoud Khaled, Hany Hassan, and
          <string-name>
            <given-names>Aly</given-names>
            <surname>Osama</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Gender aware spoken language translation applied to english-arabic</article-title>
          .
          <source>In 2018 2nd International Conference on Natural Language and Speech Processing (ICNLSP)</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          , April.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Joel</given-names>
            <surname>Escude</surname>
          </string-name>
          ´ Font and
          <string-name>
            <surname>Marta R Costa-Jussa</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Equalizing gender biases in neural machine translation with word embeddings techniques</article-title>
          . arXiv preprint arXiv:
          <year>1901</year>
          .03116.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Marlis</given-names>
            <surname>Hellinger</surname>
          </string-name>
          and
          <string-name>
            <given-names>Heiko</given-names>
            <surname>Motschenbacher</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Gender across languages. the linguistic representation of women and men</article-title>
          , volume
          <volume>4</volume>
          . amsterdam &amp; philadelphia.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Adjectives Sources</surname>
            <given-names>I.</given-names>
          </string-name>
          <year>2019a</year>
          .
          <article-title>Personality adjectives source i</article-title>
          . https://www. esolcourses.com/content/exercises/ grammar/adjectives/personality/ words
          <article-title>-for-describing-personality</article-title>
          . html.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Professions Source</surname>
            <given-names>I.</given-names>
          </string-name>
          <year>2019b</year>
          .
          <article-title>Professions source i</article-title>
          . https://www.scribd.com/doc/ 82021393/List-of-Common-Jobs.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Alberto</given-names>
            <surname>Mora´n Vallejo</surname>
          </string-name>
          et al.
          <year>2019</year>
          .
          <article-title>The translation of spanish agri-food texts into english and italian using machine translation engines: A contrastive study</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Amit</given-names>
            <surname>Moryossef</surname>
          </string-name>
          , Roee Aharoni, and
          <string-name>
            <given-names>Yoav</given-names>
            <surname>Goldberg</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Filling gender &amp; number gaps in neural machine translation with black-box context injection</article-title>
          . arXiv preprint arXiv:
          <year>1903</year>
          .03467.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Marcelo OR Prates</surname>
          </string-name>
          ,
          <string-name>
            <surname>Pedro H Avelar</surname>
            , and Lu´ıs
            <given-names>C</given-names>
          </string-name>
          <string-name>
            <surname>Lamb</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Assessing gender bias in machine translation: a case study with google translate</article-title>
          .
          <source>Neural Computing and Applications</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>19</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Ella</given-names>
            <surname>Rabinovich</surname>
          </string-name>
          , Shachar Mirkin, Raj Nath Patel, Lucia Specia, and
          <string-name>
            <given-names>Shuly</given-names>
            <surname>Wintner</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Personalized machine translation: Preserving original author traits</article-title>
          .
          <source>arXiv preprint arXiv:1610</source>
          .
          <fpage>05461</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>Rico</given-names>
            <surname>Sennrich</surname>
          </string-name>
          , Barry Haddow, and
          <string-name>
            <given-names>Alexandra</given-names>
            <surname>Birch</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Controlling politeness in neural machine translation via side constraints</article-title>
          .
          <source>In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</source>
          , pages
          <fpage>35</fpage>
          -
          <lpage>40</lpage>
          , San Diego, California, June.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <given-names>Tony</given-names>
            <surname>Sun</surname>
          </string-name>
          , Andrew Gaut, Shirlyn Tang, Yuxin Huang,
          <string-name>
            <surname>Mai</surname>
            <given-names>ElSherief</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jieyu</surname>
            <given-names>Zhao</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Diba</given-names>
            <surname>Mirza</surname>
          </string-name>
          , Elizabeth Belding,
          <string-name>
            <surname>Kai-Wei</surname>
            <given-names>Chang</given-names>
          </string-name>
          , and William Yang Wang.
          <year>2019</year>
          .
          <article-title>Mitigating gender bias in natural language processing: Literature review</article-title>
          .
          <source>In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics</source>
          , pages
          <fpage>1630</fpage>
          -
          <lpage>1640</lpage>
          , Florence, Italy, July.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>ANNA</given-names>
            <surname>Thornton and M Anna</surname>
          </string-name>
          .
          <year>2004</year>
          . Mozione. La Formazione Delle Parole in Italiano, pages
          <fpage>218</fpage>
          -
          <lpage>225</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>Eva</given-names>
            <surname>Vanmassenhove</surname>
          </string-name>
          and
          <string-name>
            <given-names>Christian</given-names>
            <surname>Hardmeier</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Europarl datasets with demographic speaker information</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Adjectives Source</surname>
            <given-names>II</given-names>
          </string-name>
          .
          <year>2019a</year>
          .
          <article-title>Personality adjectives source ii</article-title>
          . https://www. Eva Vanmassenhove,
          <string-name>
            <given-names>Christian</given-names>
            <surname>Hardmeier</surname>
          </string-name>
          , and
          <article-title>Andy esolcourses</article-title>
          .com/content/exercises/ Way.
          <year>2018</year>
          .
          <article-title>Getting gender right in neural machine grammar/adjectives/personality/ translation</article-title>
          .
          <source>In Proceedings of the</source>
          <year>2018</year>
          <article-title>Conference more-words-for-describing-personality. on Empirical Methods in Natural Language Processhtml</article-title>
          . ing, pages
          <fpage>3003</fpage>
          -
          <lpage>3008</lpage>
          , Brussels, Belgium, OctoberNovember.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Professions Source</surname>
            <given-names>II</given-names>
          </string-name>
          .
          <year>2019b</year>
          .
          <article-title>Professions source ii</article-title>
          . https://www.vocabulary.cl/Basic/ Professions.htm.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <given-names>Johanna</given-names>
            <surname>Monti</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Questioni di genere in traduzione automatica</article-title>
          . In Al femminile.
          <source>Scritti linguistici in onore di Cristina Vallini</source>
          , pages
          <fpage>411</fpage>
          -
          <lpage>431</lpage>
          . Cesati.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <given-names>Johanna</given-names>
            <surname>Monti</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>Gender issues in machine translation: An unsolved problem? In The Routledge Handbook of Translation, Feminism</article-title>
          and Gender, pages
          <fpage>457</fpage>
          -
          <lpage>468</lpage>
          . Routledge.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>