<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Back-translation Approach for Code-switching Machine Translation: A Case Study</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Maraim Masoud[</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Daniel Torregrosa</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paul Buit</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>l Ar˘</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Insight Centre for Data Analytics Data Science Institute National University of Ireland Galway</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>Recently, machine translation has demonstrated significant progress in terms of translation quality. However, most of the research has focused on translating with pure monolingual texts in the source and the target side of the parallel corpora, when in fact code-switching is very common in communication nowadays. Despite the importance of handling code-switching in the translation task, existing machine translation systems fail to accommodate the code-switching content. In this paper, we examine the phenomenon of code-switching in machine translation for low-resource languages. Through different approaches, we evaluate the performance of our systems and make some observations about the role of code-mixing in the available corpora.</p>
      </abstract>
      <kwd-group>
        <kwd>Machine-translation</kwd>
        <kwd>Code-switching</kwd>
        <kwd>Back-translation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The popularity of social media platforms creates an opportunity for multi-lingual
speakers and language learners to alternate between one or many languages. This
results in a new form of a hybrid language form called code-mixed language.
Code-mixing1 is defined as "the embedding of linguistic units such as phrases,
words, and morphemes of one language into an utterance of another language"
[
        <xref ref-type="bibr" rid="ref25">25</xref>
        ]. The phenomenon is commonly observed in multilingual communities [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]
and usually employed for different communication purposes such as asking
questions [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], swearing [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], expressing emotions [
        <xref ref-type="bibr" rid="ref43">43</xref>
        ], and content clarification [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
An example of a code-mixing, as shown in [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ], is presented in Table 1.
      </p>
      <p>
        Studies have linked many triggers for the use of mixed-code in speech and
writing such as metaphorical switching, situational switching and lexical
borrowing [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The phenomenon presents itself prominently in user-generated contents,
especially with low-resource languages. Consequently, there is a growing need
for translating code-mixed hybrid language into standard languages. Thus,
automatic machine translation has been an important task for this phenomenon.
1 The terms "code-mixing" and "code-switching" are used interchangeably in the
machine translation sub-field.
      </p>
      <p>Source Sentence(ES):
Translation Sentence(EN):</p>
      <p>I put the fork en la mesa
I put the fork on the table</p>
      <p>However, due to the lack of parallel data for low-resourced scenarios where
codeswitching is very common, existing machine translation systems fail to properly
handle code-mixed text.</p>
      <p>
        The current neural machine translation (NMT) using a sequence to sequence
translation framework [
        <xref ref-type="bibr" rid="ref40">40</xref>
        ] has achieved impressive results in recent years [
        <xref ref-type="bibr" rid="ref44">44</xref>
        ].
One of the key innovations that led to this advancement is the introduction
of the attention mechanism [
        <xref ref-type="bibr" rid="ref24 ref3">3, 24</xref>
        ]. While being able to approach human-level
translation [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ], NMT still requires a huge amount of parallel data. Such data
might not always be available for low-resource languages. As monolingual data
is easily available, one way to utilize them for the machine translation task is
following back-translation [
        <xref ref-type="bibr" rid="ref36">36</xref>
        ]. This technique is used to leverage monolingual data
during the training. It is an inverse target-to-source translation approach which
generates synthetic source sentences by translating monolingual sentences of the
target language into the source language with a pre-existing target-to-source
translation model. These pseudo-source sentences together with the original
target sentences are then concatenated to the original parallel corpus to train a new
source-to-target MT system.
      </p>
      <p>
        Current machine translation systems do not support code-mixed text, and
are only designed to work with a monolingual language in both ends of the
translation system [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. This limitation makes it unsuitable to rely on current
NMT systems for daily communications where code-mixed language is prevalent.
      </p>
      <p>This paper presents code-mixed machine translation for Tamil-English
language pair; however, this context is very common with other languages.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        The code-switching behaviour has been investigated from different perspectives
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and for many languages [
        <xref ref-type="bibr" rid="ref1 ref23">23, 1</xref>
        ]. Early work in this domain focused on exploring
the phenomenon from linguistics and sociolingusitic perspectives, and then move
towards investigating it computationally for NLP applications [
        <xref ref-type="bibr" rid="ref17 ref8">17, 8</xref>
        ]. Recently,
code-mixed languages have seen a lot of interest in downstream NLP tasks such
as part of speech tagging [
        <xref ref-type="bibr" rid="ref14 ref39">39, 14</xref>
        ], named entity recognition [
        <xref ref-type="bibr" rid="ref1 ref46">46, 1</xref>
        ], dependency
parsing [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ]. Additionally, the phenomenon has also been considered for NLP
applications such as sentiment analysis [
        <xref ref-type="bibr" rid="ref16 ref22 ref42">22, 42, 16</xref>
        ], machine translation [
        <xref ref-type="bibr" rid="ref15 ref37">37, 15</xref>
        ],
and question answering [
        <xref ref-type="bibr" rid="ref12 ref9">12, 9</xref>
        ]. Despite all the research attempts, code-mixing
still presents serious challenges for the natural language processing community
[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The noticeable lack of resources such as annotated corpora and NLP tools
continue to pose a challenge and reduces the chances of improving [
        <xref ref-type="bibr" rid="ref12 ref7">7, 12</xref>
        ].
      </p>
      <p>
        A lot of work has been done on machine translation for non-code-switching
cases [
        <xref ref-type="bibr" rid="ref24 ref3 ref35">35, 3, 24</xref>
        ]. However, there is relatively little work focus on mixed language
machine translation. Sinha et al. [
        <xref ref-type="bibr" rid="ref37">37</xref>
        ] performed cross morphological analysis
to handle code-mixed translation task from Hinglish into both pure English
and pure Hindi. The work of Johnson et al. [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] on Google’s multilingual
zeroshot translation handles code-switching phenomenon. In their work, they show
that the model can represent multiple languages in the same space. However,
the results are not as good as monolingual inputs. As opposed to monolingual
inputs, the lack of gold standard parallel data has significantly contributed to
the minimum research in code-switch translation.
      </p>
      <p>
        Back-translation has been proposed as a corpus augmentation technique
which has been widely used to expand parallel corpora for machine
translation tasks [
        <xref ref-type="bibr" rid="ref36 ref45">45, 36</xref>
        ]. It is a way to leverage monolingual data without modifying
the translation model. It requires a counter loop of training (target-to-source) to
generate synthetic parallel data from the target data [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. The idea dates back
to statistical machine translation [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Recently, back-translation has been widely
adopted for neural machine translation systems [
        <xref ref-type="bibr" rid="ref36 ref45">36, 45</xref>
        ] and shown to be
beneficial when training data is scarce as in low-resource languages scenarios [
        <xref ref-type="bibr" rid="ref18 ref30 ref41">18, 30,
41</xref>
        ]. Currey et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and Karakanta et al. [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] show how synthetic data can
improve low-resource language pairs. While the former applies a single round of
back-translation, where the source is a copy of the monolingual target data, the
latter tries multiple rounds of back-translation.
      </p>
      <p>
        A comparative analysis on the effect of synthetic data on NMT is
demonstrated by Park et al. [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] and Poncelas et al. [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ]. In the work of Park et al. the
models trained only with synthetic data. Then, the performance was evaluated
with models trained with parallel corpora composed of: (i) synthetic data in the
source-side only; (ii) synthetic data in the target side only; and (iii) a mixture of
parallel sentences of which either the source-side or the target-side is synthetic.
In the work of Poncelas et al., the NMT model was trained with three different
parallel corpora: A synthetic (source side only), a synthetic (target side only),
and a mixture in either source or target side.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Methodology</title>
      <p>
        To tackle the code-switching issue in the translation task, we were inspired by
the evaluation pipeline introduced by Poncelas et al. [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ] on testing the impact
of back-translation. We evaluate three different approaches. Each approach is
deployed with a different NMT model and a different dataset variation; a dataset
with original translation, a dataset with hybrid back-translated (synthetic) data,
and only monolingual source and target (no-code-mixing) dataset. The three
approaches are:
– Baseline approach: In this approach, the NMT model is trained using
the original dataset in its base form, without any modification, with the
      </p>
      <p>NMT Sytem
(TA-EN)</p>
      <p>exception to the standard pre-processing steps (tokenization, lowercasing
and cleaning). In this setting, the code-mixed tokens are kept without any
modification. The model in this approach serves as a baseline for comparison.</p>
      <p>Figure 1 shows a diagram for the baseline approach.
– Hybrid approach: The models in this approach are built with original
sentence pairs combined with back-translated code-mixed tokens. In this
setting, the corpus is modified with two variations of the back-translated data.
Firstly, the English tokens (code-mixed tokens) are identified and extracted
from Tamil sentences. These tokens are then translated using different
translation models: (a) the baseline model, (b) Google Translate2. Upon
generating two versions of Tamil translations of these English tokens, these tokens
are injected back to the Tamil sentences to create Tamil only sentences on
the target side of the corpus. Thus, the final resulted corpus containing
original and synthetic (back-translated corpus) is then used to train our model.</p>
      <p>A visualization of the pipeline for this approach is illustrated in Figure 2.
– Monolingual (no-code-mixing) approach: In this approach, the NMT
model is trained on a refined corpus in which the code-mixing tokens (
English tokens in the Tamil side) are identified and removed, creating a
monolingual data for the Tamil side. Figure 3 shows the pipeline for this approach.</p>
      <sec id="sec-3-1">
        <title>2 translate.google.com retrieved February 2019.</title>
        <p>In this section, we describe the dataset as well as the framework used to train
and evaluate the approaches.
For the scope of this work, we use a parallel corpus of code-mixed English-Tamil
and English. The choice of adding this particular dataset was influenced by the
availability of a public dataset for this task. Additionally, as the code-mixing</p>
        <p>Tamil English</p>
        <p>Tokens Lines Tokens Lines</p>
        <p>Train 906,391 159,182 966,911 159,182
Validation 11,446 2,000 11,725 2,000</p>
        <p>Evaluation 12,016 2,000 12,873 2,000
phenomenon is well-noticed among Indian language speakers, the code-mixing
dataset for English-Tamil was selected for this task. The dataset was combined
from OPUS3 and EnTam4. Although OPUS has a large dataset for Tamil ↔
English pairs, we excluded some data due to encoding issues. The final dataset
is cleaned, shuffled, tokenized and lowercased using the OpenNMT toolkit5. In
total, the dataset contains 163,182 sentences and around 129,710 English tokens
in the Tamil side of the corpus. Table 2 shows a breakdown of the number of
tokens and sentences in the English and the Tamil sides of the corpus.</p>
        <p>To study the effects of code-mixing in the translation, different data settings
have been used in the training and the evaluation of the NMT models. The
dataset variations are original data, hybrid (original mixed with synthetic –
backtranslated) data, and monolingual data in both sides. These settings allow us to
reuse the code-mixed tokens as well as observe their role in the demonstrated
corpus.</p>
        <p>An NMT model has been built for each dataset variation as explained in
Section 3. These different configuration scenarios allow us to trace the quality
of the code-switching translations as well as its role in conversation.
4.2</p>
        <sec id="sec-3-1-1">
          <title>NMT Framework</title>
          <p>
            The experiment was performed using the OpenNMT [
            <xref ref-type="bibr" rid="ref19">19</xref>
            ], which is a generic
deep learning framework based on sequence to sequence models. The framework
is used for a variety of NLP tasks including machine translation. We deployed
the framework in its default setting: two hidden layers, 500 hidden LSTM (Long
Short Term Memory) units per layer, 13 epochs, batch size of 64, and 0.3 dropout
probability and word embeddings of 500 dimension. To compensate for the
limited vocabulary issue, the Byte Pair Encoding (BPE) [
            <xref ref-type="bibr" rid="ref35">35</xref>
            ], which is a form of byte
compression, was used with the following parameters: a maximum vocabulary
size of 50,000 subwords and a maximum of 32,000 unique BPE merge
operations. For each approach mentioned above, word and BPE translation models
were trained.
          </p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>3 http://opus.nlpl.eu/ retrieved February 2019</title>
        <p>4 http://ufal.mff.cuni.cz/~ramasamy/parallel/html/#download retrieved
January 2019
5 http://opennmt.net/OpenNMT used on February 2019</p>
        <sec id="sec-3-2-1">
          <title>Evaluation Metrics</title>
          <p>
            The performance of the different approaches was evaluated using different
translation evaluation metrics: BLEU [
            <xref ref-type="bibr" rid="ref26">26</xref>
            ], TER [
            <xref ref-type="bibr" rid="ref38">38</xref>
            ], METEOR [
            <xref ref-type="bibr" rid="ref4">4</xref>
            ] and chrF [
            <xref ref-type="bibr" rid="ref32">32</xref>
            ].
BLEU (Bilingual Evaluation Understudy) is an automatic evaluation that boasts
high correlation with human judgements, and METEOR (Metric for Evaluation
of Translation with Explicit ORdering) is based on the harmonic mean of
precision and recall. ChrF is a character n-gram metric, which has shown very good
correlations with human judgements especially when translating to
morphologically rich(er) languages. Finally, translation error rate (TER) [
            <xref ref-type="bibr" rid="ref38">38</xref>
            ] is a metric
that represents the cost of editing the output of the MT systems to match the
reference. High score of BLEU, METEOR, and Chrf means the system produces
a highly fluent translation, but a high score of TER is a sign of more post-editing
effort and thus the lower the score the better. Additionally, we used bootstrap
resampling [
            <xref ref-type="bibr" rid="ref20">20</xref>
            ] with a sample size of 1,000 and 1,000 iterations, and reported
statistical significance with p &lt; 0.05.
5
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Results and Discussion</title>
      <p>This section describes the quantitative and qualitative results of the four
models; the baseline (Baseline), the hybrid model with baseline back-translated
tokens (Hybrid-Baseline), the hybrid model with Google back-translated tokens
(Hybrid-Goggle), and monolingual model with no-code-mixing (Monolingual).
5.1</p>
      <sec id="sec-4-1">
        <title>Quantitative Results</title>
        <p>We report the performance of the different models using the following metrics:
BLEU, Meteor, TER and ChrF. The quantitative evaluation for Tamil ↔
English is presented in Table 3. All models slightly outperform the no-code-mixing
models, which reports a decrease of ~1 BLEU point for the Tamil → English
translation direction and ~5 BLEU points in English → Tamil direction. This
suggests that by removing the code-switching tokens, the model gets confused
due to the ordering and misalignment; thus the drop in the score. The decline in
the model performance after removing the code-mixed tokens can be related to
the high performance shown in the back-translated code-mixed models. These
models report the best performance in both translation direction. The Hybrid
Google back-translated model for English into Tamil translation reports 24.65
and 25.28 BLEU points for the word and BPE based translation models,
respectively. Our back-translated models reports lower results of 21.96 for the
word-based model and 22.53 for the BPE-based model.</p>
        <p>In the case of translating Tamil text into English, where the code-mixing
takes place, the BPE baseline performed best, followed by close results in terms of
BLEU and METEOR for the back-translated models, whereas our model (21.93)
performed similarly to Google Translate (21.35) for the BPE based model. From
the results, we observed that the approaches with the back-translated models</p>
        <p>Tamil→English</p>
        <p>English→Tamil
Model BLEU METEOR ChrF TER BLEU METEOR ChrF TER
Word-Baseline 20.57 24.69 42.46 0.68 16.14 19.49 60.63 0.74
Word-Monolingual 19.46 23.99 42.28 0.69 16.65 24.10 70.39 0.74
Word-Hybrid-Baseline 21.05 24.60 42.92 0.68 21.96 21.51 70.16 0.86
Word-Hybrid-Google 21.85 24.48 42.73 0.68 24.65 23.35 72.48 0.69
BPE-Baseline 22.46 25.50 44.14 0.66 18.47 20.17 68.33 0.97
BPE-Monolingual 19.99 24.36 42.43 0.68 16.98 25.16 71.15 0.73
BPE-Hybrid-Baseline 21.93 25.39 44.55 0.66 22.53 23.34 71.63 0.82
BPE-Hybrid-Google 21.35 25.25 44.13 0.67 25.28 25.13 73.71 0.65
outperform both baseline and the no-code-mixing models. From this experiment,
we observed that the code-mixing tokens, as in our demonstrated corpus, play
an important role in the meaning of the sentences. Thus, the drop in BLEU
score is observed when code-mixing tokens are eliminated.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion and Future Work</title>
      <p>In this work, we explored the code-switching phenomenon in machine translation
for a low-resourced scenario considering English-Tamil as our target language
pair. We further investigated how back-translation can be used as a strategy
to handle this phenomenon. The results show that the code-mixing part in this
particular dataset potentially plays a supportive role. This can be observed by
the little impact on the translated sentences when the code-mixing tokens are
removed. This also explained by the slight improvement in the translation score
(~1 BLEU points) when the code-mixing tokens (English tokens) in the Tamil
side are correctly translated before training the models.</p>
      <p>
        One future work will further investigate the role of code-switching in the
available corpora. A second path will experiment with multilingual embedding
as a preprocessing step for the translation of code-switched languages. This
approach has already shown good performance in tasks such as sentiment analysis
[
        <xref ref-type="bibr" rid="ref33">33</xref>
        ].
      </p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This publication has emanated from research supported in part by a research
grant from Science Foundation Ireland (SFI) under grant agreement number
SFI/12/RC/2289_P2, co-funded by the European Regional Development Fund,
and the Enterprise Ireland (EI) Innovation Partnership Programme under grant
number IP20180729, NURS – Neural Machine Translation for Under-Resourced
Scenarios.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Aguilar</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>AlGhamdi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soto</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Diab</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hirschberg</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Solorio</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Named entity recognition on code-switched data: Overview of the CALCS 2018 shared task</article-title>
          .
          <source>In: Proceedings of the Third Workshop on Computational Approaches</source>
          to Linguistic Code-Switching. pp.
          <fpage>138</fpage>
          -
          <lpage>147</lpage>
          . Association for Computational Linguistics, Melbourne,
          <source>Australia (Jul</source>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Alex</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Automatic detection of english inclusions in mixed-lingual data with an application to parsing</article-title>
          . The University of Edinburgh (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bahdanau</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cho</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bengio</surname>
            ,
            <given-names>Y.:</given-names>
          </string-name>
          <article-title>Neural machine translation by jointly learning to align and translate</article-title>
          .
          <source>3rd International Conference on Learning Representations, ICLR</source>
          <year>2015</year>
          , San Diego, CA, USA, May 7-
          <issue>9</issue>
          ,
          <year>2015</year>
          , Conference Track Proceedings (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Banerjee</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lavie</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. In: Proceedings of the ACL workshop on intrinsic and extrinsic evaluation measures for machine translation and/or summarization</article-title>
          . pp.
          <fpage>65</fpage>
          -
          <lpage>72</lpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Bojar</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tamchyna</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Improving translation model by monolingual data</article-title>
          .
          <source>In: Proceedings of the Sixth Workshop on Statistical Machine Translation</source>
          . pp.
          <fpage>330</fpage>
          -
          <lpage>336</lpage>
          . Association for Computational Linguistics, Edinburgh, Scotland (Jul
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Boztepe</surname>
          </string-name>
          , E.:
          <article-title>Issues in code-switching: Competing theories and models</article-title>
          .
          <source>Working Papers in TESOL and Applied Linguistics</source>
          <volume>3</volume>
          (
          <issue>2</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>27</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Çetinoğlu</surname>
          </string-name>
          , Ö.,
          <string-name>
            <surname>Schulz</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vu</surname>
          </string-name>
          , N.T.:
          <article-title>Challenges of computational processing of codeswitching</article-title>
          . In: Proceedings of the Second Workshop on Computational Approaches to Code Switching. pp.
          <fpage>1</fpage>
          -
          <lpage>11</lpage>
          . Association for Computational Linguistics, Austin, Texas (Nov
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Çetinoğlu</surname>
          </string-name>
          , Ö.,
          <string-name>
            <surname>Schulz</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vu</surname>
          </string-name>
          , N.T.:
          <article-title>Challenges of computational processing of codeswitching</article-title>
          . In: Proceedings of the Second Workshop on Computational Approaches to Code Switching. pp.
          <fpage>1</fpage>
          -
          <lpage>11</lpage>
          . Association for Computational Linguistics, Austin, Texas (Nov
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Chandu</surname>
            ,
            <given-names>K.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chinnakotla</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Black</surname>
            ,
            <given-names>A.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shrivastava</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <string-name>
            <surname>Webshodh</surname>
          </string-name>
          :
          <article-title>A code mixed factoid question answering system for web</article-title>
          .
          <source>In: International Conference of the Cross-Language Evaluation Forum for European Languages</source>
          . pp.
          <fpage>104</fpage>
          -
          <lpage>111</lpage>
          . Springer (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Currey</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miceli</surname>
            <given-names>Barone</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>A.V.</given-names>
            ,
            <surname>Heafield</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          :
          <article-title>Copied monolingual data improves low-resource neural machine translation</article-title>
          .
          <source>In: Proceedings of the Second Conference on Machine Translation</source>
          . pp.
          <fpage>148</fpage>
          -
          <lpage>156</lpage>
          . Association for Computational Linguistics, Copenhagen, Denmark (Sep
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Edunov</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ott</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Auli</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grangier</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Understanding back-translation at scale</article-title>
          .
          <source>In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing</source>
          . pp.
          <fpage>489</fpage>
          -
          <lpage>500</lpage>
          . Association for Computational Linguistics, Brussels, Belgium (Oct-Nov
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Gupta</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chinnakotla</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shrivastava</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Transliteration better than translation? answering code-mixed questions over a knowledge base</article-title>
          .
          <source>In: Proceedings of the Third Workshop on Computational Approaches</source>
          to Linguistic Code-Switching. pp.
          <fpage>39</fpage>
          -
          <lpage>50</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Hidayat</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>An analysis of code switching used by facebookers (a case study in a social network site) (</article-title>
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Jamatia</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gambäck</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Das</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Part-of-speech tagging for code-mixed EnglishHindi twitter and Facebook chat messages</article-title>
          .
          <source>In: Proceedings of the International Conference Recent Advances in Natural Language Processing</source>
          . pp.
          <fpage>239</fpage>
          -
          <lpage>248</lpage>
          . INCOMA Ltd. Shoumen, Bulgaria, Hissar,
          <source>Bulgaria (Sep</source>
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Johnson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schuster</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Le</surname>
            ,
            <given-names>Q.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krikun</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thorat</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , Vié- gas, F.,
          <string-name>
            <surname>Wattenberg</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hughes</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <source>Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation. Transactions of the Association for Computational Linguistics</source>
          <volume>5</volume>
          ,
          <fpage>339</fpage>
          -
          <lpage>351</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Joshi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Prabhu</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shrivastava</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Varma</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Towards sub-word level compositions for sentiment analysis of Hindi-English code mixed text</article-title>
          .
          <source>In: Proceedings of COLING</source>
          <year>2016</year>
          ,
          <source>the 26th International Conference on Computational Linguistics: Technical Papers</source>
          . pp.
          <fpage>2482</fpage>
          -
          <lpage>2491</lpage>
          .
          <article-title>The COLING 2016 Organizing Committee</article-title>
          , Osaka,
          <source>Japan (Dec</source>
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Joshi</surname>
            ,
            <given-names>A.K.</given-names>
          </string-name>
          :
          <article-title>Processing of sentences with intra-sentential code-switching</article-title>
          .
          <source>In: Coling 1982: Proceedings of the Ninth International Conference on Computational Linguistics</source>
          (
          <year>1982</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Karakanta</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dehdari</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Genabith</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>Neural machine translation for lowresource languages without parallel corpora</article-title>
          .
          <source>Machine Translation</source>
          <volume>32</volume>
          (
          <issue>1-2</issue>
          ),
          <fpage>167</fpage>
          -
          <lpage>189</lpage>
          (
          <year>Jun 2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Klein</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deng</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Senellart</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rush</surname>
            ,
            <given-names>A.M.:</given-names>
          </string-name>
          <article-title>OpenNMT: Open-Source Toolkit for Neural Machine Translation</article-title>
          .
          <source>Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics System Demonstrations</source>
          ,
          <fpage>67</fpage>
          -
          <lpage>72</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Koehn</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Statistical significance tests for machine translation evaluation</article-title>
          .
          <source>In: Proceedings of the 2004 conference on empirical methods in natural language processing</source>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Lantto</surname>
          </string-name>
          , H.:
          <article-title>Code-switching, swearing and slang: The colloquial register of basque in greater bilbao</article-title>
          .
          <source>International Journal of Bilingualism</source>
          <volume>18</volume>
          (
          <issue>6</issue>
          ),
          <fpage>633</fpage>
          -
          <lpage>648</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>Emotion in code-switching texts: Corpus construction and analysis</article-title>
          .
          <source>In: Proceedings of the Eighth SIGHAN Workshop on Chinese Language Processing</source>
          . pp.
          <fpage>91</fpage>
          -
          <lpage>99</lpage>
          . Association for Computational Linguistics, Beijing, China (Jul
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>D.C.</given-names>
          </string-name>
          :
          <string-name>
            <surname>Cantonese-English</surname>
          </string-name>
          code
          <article-title>-switching research in Hong Kong: A Y2K review</article-title>
          .
          <source>World Englishes</source>
          <volume>19</volume>
          (
          <issue>3</issue>
          ),
          <fpage>305</fpage>
          -
          <lpage>322</lpage>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Luong</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pham</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manning</surname>
          </string-name>
          , C.D.:
          <article-title>Effective approaches to attention-based neural machine translation</article-title>
          .
          <source>In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing</source>
          . pp.
          <fpage>1412</fpage>
          -
          <lpage>1421</lpage>
          . Association for Computational Linguistics, Lisbon, Portugal (Sep
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Myers-Scotton</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Common and uncommon ground: Social and structural factors in codeswitching</article-title>
          .
          <source>Language in society 22(4)</source>
          ,
          <fpage>475</fpage>
          -
          <lpage>503</lpage>
          (
          <year>1993</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Papineni</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roukos</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ward</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>W.J.:</given-names>
          </string-name>
          <article-title>BLEU: a method for automatic evaluation of machine translation</article-title>
          .
          <source>In: Proceedings of the 40th annual meeting on association for computational linguistics</source>
          . pp.
          <fpage>311</fpage>
          -
          <lpage>318</lpage>
          . Association for Computational Linguistics (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Park</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Song</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yoon</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Building a neural machine translation system using only synthetic parallel data</article-title>
          .
          <source>CoRR</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Parshad</surname>
          </string-name>
          , R.D.,
          <string-name>
            <surname>Bhowmick</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chand</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kumari</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sinha</surname>
          </string-name>
          , N.:
          <article-title>What is India speaking? Exploring the “Hinglish” invasion</article-title>
          .
          <source>Physica A: Statistical Mechanics and its Applications</source>
          <volume>449</volume>
          ,
          <fpage>375</fpage>
          -
          <lpage>389</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Partanen</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lim</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rießler</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Poibeau</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Dependency parsing of codeswitching data with cross-lingual feature representations</article-title>
          .
          <source>In: Proceedings of the Fourth International Workshop on Computational Linguistics of Uralic Languages</source>
          . pp.
          <fpage>1</fpage>
          -
          <lpage>17</lpage>
          . Association for Computational Linguistics, Helsinki, Finland (Jan
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Poncelas</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shterionov</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Way</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , de Buy Wenniger,
          <string-name>
            <given-names>G.M.</given-names>
            ,
            <surname>Passban</surname>
          </string-name>
          ,
          <string-name>
            <surname>P.</surname>
          </string-name>
          :
          <article-title>Investigating backtranslation in neural machine translation</article-title>
          .
          <source>In: 21st Annual Conference of the European Association for Machine Translation</source>
          . p.
          <fpage>249</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Popel</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>CUNI transformer neural MT system for WMT18</article-title>
          .
          <source>In: Proceedings of the Third Conference on Machine Translation: Shared Task Papers</source>
          . pp.
          <fpage>482</fpage>
          -
          <lpage>487</lpage>
          . Association for Computational Linguistics, Belgium, Brussels (Oct
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Popović</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>ChrF: character n-gram F-score for automatic MT evaluation</article-title>
          .
          <source>In: Proceedings of the Tenth Workshop on Statistical Machine Translation</source>
          . pp.
          <fpage>392</fpage>
          -
          <lpage>395</lpage>
          . Association for Computational Linguistics, Lisbon, Portugal (Sep
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33.
          <string-name>
            <surname>Pratapa</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Choudhury</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sitaram</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Word embeddings for code-mixed language processing</article-title>
          .
          <source>In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing</source>
          . pp.
          <fpage>3067</fpage>
          -
          <lpage>3072</lpage>
          . Association for Computational Linguistics, Brussels, Belgium (Oct-Nov
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          34.
          <string-name>
            <surname>Ramirez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Bilingualism through Schooling: Cross-Cultural Education for Minority and Majority Students</article-title>
          . State University of New York Press (
          <year>1985</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          35.
          <string-name>
            <surname>Sennrich</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haddow</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Linguistic Input Features Improve Neural Machine Translation</article-title>
          .
          <source>In: Proceedings of the First Conference on Machine Translation</source>
          . pp.
          <fpage>83</fpage>
          -
          <lpage>91</lpage>
          . Association for Computational Linguistics, Berlin, Germany (
          <year>August 2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          36.
          <string-name>
            <surname>Sennrich</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haddow</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Birch</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Improving neural machine translation models with monolingual data</article-title>
          .
          <source>In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)</source>
          . pp.
          <fpage>86</fpage>
          -
          <lpage>96</lpage>
          . Association for Computational Linguistics, Berlin, Germany (Aug
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          37.
          <string-name>
            <surname>Sinha</surname>
            ,
            <given-names>R.M.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thakur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Machine translation of bi-lingual Hindi-English (Hinglish) text</article-title>
          .
          <source>In: 10th Machine Translation summit (MT Summit X)</source>
          . pp.
          <fpage>149</fpage>
          -
          <lpage>156</lpage>
          . Phuket,
          <string-name>
            <surname>Thailand</surname>
          </string-name>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          38.
          <string-name>
            <surname>Snover</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dorr</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schwartz</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Micciulla</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Makhoul</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>A study of translation edit rate with targeted human annotation</article-title>
          .
          <source>In: Proceedings of association for machine translation in the Americas</source>
          . vol.
          <volume>200</volume>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          39.
          <string-name>
            <surname>Solorio</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Part-of-Speech tagging for English-Spanish code-switched text</article-title>
          .
          <source>In: Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing</source>
          . pp.
          <fpage>1051</fpage>
          -
          <lpage>1060</lpage>
          . Association for Computational Linguistics, Honolulu, Hawaii (Oct
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          40.
          <string-name>
            <surname>Sutskever</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vinyals</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Le</surname>
            ,
            <given-names>Q.V.</given-names>
          </string-name>
          :
          <article-title>Sequence to sequence learning with neural networks</article-title>
          .
          <source>In: Advances in Neural Information Processing Systems</source>
          . pp.
          <fpage>3104</fpage>
          -
          <lpage>3112</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          41.
          <string-name>
            <surname>Torregrosa</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pasricha</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Masoud</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chakravarthi</surname>
            ,
            <given-names>B.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alonso</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Casas</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Arcan</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Leveraging rule-based machine translation knowledge for underresourced neural machine translation models</article-title>
          .
          <source>In: Proceedings of Machine Translation Summit XVII Volume 2: Translator, Project and User Tracks</source>
          . pp.
          <fpage>125</fpage>
          -
          <lpage>133</lpage>
          . European Association for Machine Translation, Dublin, Ireland (
          <volume>19</volume>
          -
          <fpage>23</fpage>
          Aug
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          42.
          <string-name>
            <surname>Vilares</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alonso</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gómez-Rodríguez</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Sentiment analysis on monolingual, multilingual and code-switching twitter corpora</article-title>
          .
          <source>In: Proceedings of the 6th Workshop on Computational Approaches</source>
          to Subjectivity,
          <article-title>Sentiment and Social Media Analysis</article-title>
          . pp.
          <fpage>2</fpage>
          -
          <lpage>8</lpage>
          . Association for Computational Linguistics, Lisboa,
          <source>Portugal (Sep</source>
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          43.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhou</surname>
          </string-name>
          , G.:
          <article-title>Emotion detection in code-switching texts via bilingual and sentimental information</article-title>
          .
          <source>In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)</source>
          . pp.
          <fpage>763</fpage>
          -
          <lpage>768</lpage>
          . Association for Computational Linguistics, Beijing, China (Jul
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          44.
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schuster</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Le</surname>
            ,
            <given-names>Q.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Norouzi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Macherey</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krikun</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cao</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gao</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Macherey</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Klingner</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shah</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Johnson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaiser</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gouws</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kato</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kudo</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kazawa</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stevens</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kurian</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Patil</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Young</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Riesa</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rudnick</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vinyals</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hughes</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Google's neural machine translation system: Bridging the gap between human and machine translation</article-title>
          .
          <source>CoRR</source>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          45.
          <string-name>
            <surname>Zhang</surname>
          </string-name>
          , J.,
          <string-name>
            <surname>Matsumoto</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Corpus augmentation for neural machine translation with chinese-japanese parallel corpora</article-title>
          .
          <source>Applied Sciences</source>
          <volume>9</volume>
          (
          <issue>10</issue>
          ),
          <year>2036</year>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          46.
          <string-name>
            <surname>Zirikly</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Diab</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Named entity recognition for Arabic social media</article-title>
          .
          <source>In: Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing</source>
          . pp.
          <fpage>176</fpage>
          -
          <lpage>185</lpage>
          . Association for Computational Linguistics, Denver, Colorado (Jun
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>