<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>IIITG-ADBU@HASOC-Dravidian-CodeMix-FIRE2020: Ofensive Content Detection in Code-Mixed Dravidian Text</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Arup Baruah</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kaushik Amar Das</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ferdous Ahmed Barbhuiya</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kuntal Dey</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Accenture Technology Labs</institution>
          ,
          <addr-line>Bangalore</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Indian Institute of Information Technology</institution>
          ,
          <addr-line>Guwahati</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper presents the results obtained by our SVM and XLM-RoBERTa based classifiers in the shared task “Dravidian-CodeMix-HASOC 2020”. The SVM classifier trained using TF-IDF features of character and word n-grams performed the best on the code-mixed Malayalam text. It obtained a weighted F1 score of 0.95 (1st Rank) and 0.76 (3rd Rank) on the YouTube and Twitter dataset respectively. The XLMRoBERTa based classifier performed the best on the code-mixed Tamil text. It obtained a weighted F1 score of 0.87 (3rd Rank) on the code-mixed Tamil Twitter dataset.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;SVM</kwd>
        <kwd>XLM-RoBERTa</kwd>
        <kwd>Ofensive Language</kwd>
        <kwd>Code-Mixed</kwd>
        <kwd>Dravidian Language</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>We participated in both the tasks. We used SVM and XLM-RoBERTa classifiers in our study.
The SVM classifier was trained using TF-IDF features of character n-grams, word n-grams, and
character and word n-grams combined.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        Ofensive language detection in English has witnessed the use of SVM [
        <xref ref-type="bibr" rid="ref3 ref4 ref5 ref6 ref7">3, 4, 5, 6, 7</xref>
        ], Logistic
Regression [
        <xref ref-type="bibr" rid="ref10 ref11 ref6 ref8 ref9">8, 9, 10, 6, 11</xref>
        ], and deep learning techniques [
        <xref ref-type="bibr" rid="ref12 ref13 ref14 ref15 ref16 ref17">12, 13, 14, 15, 16, 17</xref>
        ]. The main
focus of [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] was to tackle the use of code words for obfuscating the hate words. Traditional
machine learning and deep learning techniques have also been used in the detection of ofensive
language in code-mixed Hindi-English text [
        <xref ref-type="bibr" rid="ref18 ref19 ref20 ref21 ref22 ref23 ref24">18, 19, 20, 21, 22, 23, 24</xref>
        ]. Work performed on
code-mixed Tamil-English and Malayalam-English text includes corpus created for sentiment
analysis for these two languages [
        <xref ref-type="bibr" rid="ref25">25, 26</xref>
        ]. [27] focused on machine translation of code-mixed
text in Dravidian languages. It was found that removal of code-mixing improves the quality of
machine translation.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Dataset</title>
      <p>Table 1 shows the statistics of the dataset provided as part of this shared task. The instances in
the dataset were labeled as “not ofensive” (NOT) or “ofensive” (OFF). Task 1 was conducted for
Malayalam language only. The source of the dataset for this task was YouTube. As can be seen
from the table, this dataset is imbalanced with about 83% labeled as NOT. Task 2 was conducted
for both Tamil and Malayalam languages. The source of the datasets for this task was Twitter.
As can be seen from the tables, the dataset for this task was balanced. Train, development, and
test set was provided for Task 1. For task 2, only train and test set was provided. We created
the development set for Task 2, by doing a stratified split and retaining 85% of the dataset for
training and 15% as development dataset.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Methodology</title>
      <p>In this study we used SVM and XLM-RoBERTa based classifiers. The SVM classifier was trained
using TF-IDF features of character n-grams, word n-grams, and combination of character and
word n-grams. In our study, we used character n-grams of size 1 to 6, and word n-grams of size
1 to 3.</p>
      <p>XLM-RoBERTa model [28] is based on the RoBERTa model [29]. RoBERTa model is based on
the transformer architecture. XLM-RoBERTa is a multi-lingual model trained on 100 diferent
languages including Tamil and Malayalam. In our study, we used the pre-trained base model.
The Adam optimizer with weight decay was used during training. The learning rate and epsilon
parameter for the optimizer were set to 2e-5 and 1e-8 respectively. We used the class provided
by HuggingFace Transformers library 1 for sequence classification in our study. This class
provides a linear layer on top of the pooled output to perform the binary classification.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Results</title>
      <p>1 https://huggingface.co/transformers/
The XLM-RoBERTa classifier obtained a weighted F1 score of 0.8650 and was the second best
performing classifier on the dev set for this task. For code-mixed Malayalam-English text
of the task 2 dev set, the best performing classifier was the SVM classifier trained using the
combination of TF-IDF features of character and word n-grams. It obtained a weighted F1 score
of 0.7610. The XLM-RoBERTa classifier obtained a weighted F1 score of 0.5171 and was the
worst performing classifier for this task.</p>
      <p>Table 3 shows the results that our submitted classifiers obtained on the test set. The SVM
classifiers mentioned in this table are the only one submitted for the tasks. These classifiers
were selected based on their performance on the development set. As can be seen from the table,
the SVM classifier trained on the combination of TF-IDF features of character and word n-grams
performed the best in task 1 with as weighted F1 score of 0.9471. It obtained the 1st rank for
the task. XLM-RoBERTa was the best performing classifier for the Tamil-English dataset of
task 2. It was a weighted F1 score of 0.8669 and obtained the 3rd rank for the task. The SVM
classifier trained on the combination of TF-IDF features of character and word n-grams again
performed the best for the Malayalam-English dataset of task 2 with a weighted F1 score of
0.7623. It obtained the 3rd rank for the task. Table 4 shows the confusion matrices obtained on
the test set by classifiers submitted for the shared task.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion</title>
      <p>We used the SVM and XLM-RoBERTa based classifiers to detect ofensive language in
codemixed Tamil-English and Malayalam-English text. In our study, the SVM classifier trained
using combination of TF-IDF features of character and word n-grams performed the best
for code-mixed Malayalam-English text (both YouTube and Twitter dataset). This classifier
obtained the weighted F1 score of 0.95 (1st rank) and 0.76 (3rd rank) for Task 1 and Task 2
(Malayalam) respectively. The XLM-RoBERTa based classifier performed the best for the
TamilEnglish dataset of Task 2 and obtained an weighted F1 score of 0.87 (3rd rank) for the task. On
comparing the performance of our SVM models on the YouTube and Twitter data for Malayalam
language, we can observe that the performance of the classifier degraded considerably for
the Twitter dataset. Whether this degradation is due to the type of language used in Twitter
conversation, length of the text etc. can be performed as a future study.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>Supported by Visvesvaraya PhD Scheme, MeitY, Govt. of India, MEITY-PHD-3050.
Workshop on Spoken Language Technologies for Under-resourced languages (SLTU)
and Collaboration and Computing for Under-Resourced Languages (CCURL), European
Language Resources association, Marseille, France, 2020, pp. 202–210.
[26] B. R. Chakravarthi, N. Jose, S. Suryawanshi, E. Sherly, J. P. McCrae, A sentiment analysis
dataset for code-mixed Malayalam-English, in: Proceedings of the 1st Joint Workshop on
Spoken Language Technologies for Under-resourced languages (SLTU) and Collaboration
and Computing for Under-Resourced Languages (CCURL), European Language Resources
association, Marseille, France, 2020, pp. 177–184.
[27] B. R. Chakravarthi, Leveraging orthographic information to improve machine translation
of under-resourced languages, Ph.D. thesis, NUI Galway, 2020.
[28] A. Conneau, K. Khandelwal, N. Goyal, V. Chaudhary, G. Wenzek, F. Guzmán, E. Grave,
M. Ott, L. Zettlemoyer, V. Stoyanov, Unsupervised cross-lingual representation learning
at scale, in: D. Jurafsky, J. Chai, N. Schluter, J. R. Tetreault (Eds.), Proceedings of the
58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online,
July 5-10, 2020, Association for Computational Linguistics, 2020, pp. 8440–8451. URL:
https://www.aclweb.org/anthology/2020.acl-main.747/.
[29] Y. Liu, M. Ott, N. Goyal, J. Du, M. Joshi, D. Chen, O. Levy, M. Lewis, L. Zettlemoyer, V.
Stoyanov, Roberta: A robustly optimized BERT pretraining approach, CoRR abs/1907.11692
(2019). URL: http://arxiv.org/abs/1907.11692. arXiv:1907.11692.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>B. R.</given-names>
            <surname>Chakravarthi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>McCrae</surname>
          </string-name>
          , P. B,
          <string-name>
            <surname>S. KP</surname>
          </string-name>
          , T. Mandl,
          <article-title>Overview of the track on "hasoc-ofensive language identification- dravidiancodemix"</article-title>
          ,
          <source>in: Proceedings of the 12th Forum for Information Retrieval Evaluation</source>
          ,
          <source>FIRE '20</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>B. R.</given-names>
            <surname>Chakravarthi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>McCrae</surname>
          </string-name>
          , P. B,
          <string-name>
            <surname>S. KP</surname>
          </string-name>
          , T. Mandl,
          <article-title>Overview of the track on "hasoc-ofensive language identification- dravidiancodemix"</article-title>
          ,
          <source>in: Working Notes of the Forum for Information Retrieval Evaluation (FIRE</source>
          <year>2020</year>
          ). CEUR Workshop Proceedings. In: CEUR-WS. org, Hyderabad, India,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Malmasi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zampieri</surname>
          </string-name>
          , Detecting Hate Speech in Social Media,
          <source>in: RANLP</source>
          <year>2017</year>
          , Varna, Bulgaria,
          <year>2017</year>
          , pp.
          <fpage>467</fpage>
          -
          <lpage>472</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Malmasi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zampieri</surname>
          </string-name>
          ,
          <article-title>Challenges in discriminating profanity from hate speech</article-title>
          ,
          <source>Journal of Experimental &amp; Theoretical Artificial Intelligence</source>
          <volume>30</volume>
          (
          <year>2018</year>
          )
          <fpage>187</fpage>
          -
          <lpage>202</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>R.</given-names>
            <surname>Magu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Joshi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Luo</surname>
          </string-name>
          ,
          <source>Detecting the Hate Code on Social Media, in: AAAI ICWSM</source>
          <year>2017</year>
          , Montreal,
          <year>2017</year>
          , pp.
          <fpage>608</fpage>
          -
          <lpage>611</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>T.</given-names>
            <surname>Davidson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Warmsley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Macy</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Weber</surname>
          </string-name>
          ,
          <source>Automated Hate Speech Detection and the Problem of Ofensive Language, in: AAAI ICWSM</source>
          <year>2017</year>
          , Montreal,
          <year>2017</year>
          , pp.
          <fpage>512</fpage>
          -
          <lpage>515</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>N.</given-names>
            <surname>Samghabadi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Maharjan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sprague</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Diaz-Sprague</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Solorio</surname>
          </string-name>
          , Detecting Nastiness in Social Media,
          <source>in: ALW1 at ACL</source>
          <year>2017</year>
          , Vancouver,
          <year>2017</year>
          , pp.
          <fpage>63</fpage>
          -
          <lpage>72</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>E.</given-names>
            <surname>Wulczyn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Thain</surname>
          </string-name>
          , L. Dixon, Ex Machina: Personal Attacks Seen at Scale,
          <source>in: WWW</source>
          <year>2017</year>
          , Perth,
          <year>2017</year>
          , pp.
          <fpage>1391</fpage>
          -
          <lpage>1399</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Waseem</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Hovy</surname>
          </string-name>
          ,
          <article-title>Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter, in: NAACL-HLT 2016</article-title>
          , California,
          <year>2016</year>
          , pp.
          <fpage>88</fpage>
          -
          <lpage>93</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>N.</given-names>
            <surname>Djuric</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Morris</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Grbovic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Radosavljevic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Bhamidipati</surname>
          </string-name>
          ,
          <article-title>Hate Speech Detection with Comment Embeddings</article-title>
          ,
          <source>in: WWW</source>
          <year>2015</year>
          , Florence, Italy,
          <year>2015</year>
          , pp.
          <fpage>29</fpage>
          -
          <lpage>30</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>J.</given-names>
            <surname>Risch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Krestel</surname>
          </string-name>
          ,
          <article-title>Delete or not Delete? Semi-Automatic Comment Moderation for the Newsroom</article-title>
          , in: TRAC-1
          <source>at COLING</source>
          <year>2018</year>
          ,
          <string-name>
            <surname>Santa</surname>
            <given-names>Fe</given-names>
          </string-name>
          , USA,
          <year>2018</year>
          , pp.
          <fpage>166</fpage>
          -
          <lpage>176</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>P.</given-names>
            <surname>Badjatiya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Varma</surname>
          </string-name>
          ,
          <article-title>Deep Learning for Hate Speech Detection in Tweets</article-title>
          ,
          <source>in: WWW</source>
          <year>2017</year>
          , Perth,
          <year>2017</year>
          , pp.
          <fpage>759</fpage>
          -
          <lpage>760</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>B.</given-names>
            <surname>Gamback</surname>
          </string-name>
          , U. Sikdar,
          <article-title>Using Convolutional Neural Networks to Classify Hate-Speech</article-title>
          ,
          <source>in: ALW1 at ACL</source>
          <year>2017</year>
          , Vancouver,
          <year>2017</year>
          , pp.
          <fpage>85</fpage>
          -
          <lpage>90</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>J.</given-names>
            <surname>Park</surname>
          </string-name>
          , P. Fung,
          <article-title>One-step and Two-step Classification fro Abusive Language Detection on Twitter</article-title>
          ,
          <source>in: ALW1 at ACL</source>
          <year>2017</year>
          , Vancouver,
          <year>2017</year>
          , pp.
          <fpage>41</fpage>
          -
          <lpage>45</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>J.</given-names>
            <surname>Pavlopoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Malakasiotis</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Androutsopoulos</surname>
          </string-name>
          ,
          <article-title>Deep Learning for User Comment Moderation</article-title>
          ,
          <source>in: ALW1 at ACL</source>
          <year>2017</year>
          , Vancouver, 2017a, pp.
          <fpage>25</fpage>
          -
          <lpage>35</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Mehdad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tetreault</surname>
          </string-name>
          , Do Characters Abuse More Than Words?,
          <source>in: SIGDIAL</source>
          <year>2016</year>
          , Los Angeles,
          <year>2016</year>
          , pp.
          <fpage>299</fpage>
          -
          <lpage>303</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>A.</given-names>
            <surname>Baruah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. A.</given-names>
            <surname>Barbhuiya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Dey</surname>
          </string-name>
          , ABARUAH at semeval
          <article-title>-2019 task 5 : Bi-directional LSTM for hate speech detection</article-title>
          ,
          <source>in: Proceedings of the 13th International Workshop on Semantic Evaluation, SemEval@NAACL-HLT</source>
          <year>2019</year>
          ,
          <article-title>Minneapolis</article-title>
          , MN, USA, June 6-7,
          <year>2019</year>
          ,
          <year>2019</year>
          , pp.
          <fpage>371</fpage>
          -
          <lpage>376</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>T. Y. S. S.</given-names>
            <surname>Santosh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. V. S.</given-names>
            <surname>Aravind</surname>
          </string-name>
          ,
          <article-title>Hate speech detection in hindi-english code-mixed social media text</article-title>
          ,
          <source>in: Proceedings of the ACM India Joint International Conference on Data Science and Management of Data, COMAD/CODS</source>
          <year>2019</year>
          , Kolkata, India, January 3-
          <issue>5</issue>
          ,
          <year>2019</year>
          , ACM,
          <year>2019</year>
          , pp.
          <fpage>310</fpage>
          -
          <lpage>313</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>A.</given-names>
            <surname>Bohra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Vijay</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. S.</given-names>
            <surname>Akhtar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Shrivastava</surname>
          </string-name>
          ,
          <article-title>A dataset of hindi-english code-mixed social media text for hate speech detection</article-title>
          ,
          <source>in: Proceedings of the Second Workshop on Computational Modeling of People's Opinions</source>
          , Personality, and Emotions in Social Media,
          <source>PEOPLES@NAACL-HTL</source>
          <year>2018</year>
          , New Orleans, Louisiana, USA, June 6,
          <year>2018</year>
          , Association for Computational Linguistics,
          <year>2018</year>
          , pp.
          <fpage>36</fpage>
          -
          <lpage>41</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>S.</given-names>
            <surname>Kamble</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Joshi</surname>
          </string-name>
          ,
          <article-title>Hate speech detection from code-mixed hindi-english tweets using deep learning models</article-title>
          , CoRR abs/
          <year>1811</year>
          .05145 (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>K.</given-names>
            <surname>Sreelakshmi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Premjith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Soman</surname>
          </string-name>
          ,
          <article-title>Detection of hate speech text in hindi-english code-mixed data</article-title>
          ,
          <source>in: Proceedings of the 3rd International Conference on Computing and Network Communications</source>
          ,
          <year>2019</year>
          , India,
          <source>Dec 18-21</source>
          ,
          <year>2019</year>
          ,
          <string-name>
            <surname>Elsevier</surname>
            <given-names>B.V.</given-names>
          </string-name>
          ,
          <year>2020</year>
          , pp.
          <fpage>737</fpage>
          -
          <lpage>744</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>P.</given-names>
            <surname>Mathur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. R.</given-names>
            <surname>Shah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Sawhney</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Mahata</surname>
          </string-name>
          ,
          <article-title>Detecting ofensive tweets in hindi-english code-switched language</article-title>
          ,
          <source>in: Proceedings of the Sixth International Workshop on Natural Language Processing for Social Media</source>
          ,
          <source>SocialNLP@ACL</source>
          <year>2018</year>
          , Melbourne, Australia, July
          <volume>20</volume>
          ,
          <year>2018</year>
          , Association for Computational Linguistics,
          <year>2018</year>
          , pp.
          <fpage>18</fpage>
          -
          <lpage>26</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>A.</given-names>
            <surname>Baruah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. A.</given-names>
            <surname>Barbhuiya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Dey</surname>
          </string-name>
          , IIITG-ADBU at HASOC 2019:
          <article-title>Automated hate speech and ofensive content detection in english and code-mixed hindi text</article-title>
          ,
          <source>in: Working Notes of FIRE 2019 - Forum for Information Retrieval Evaluation</source>
          , Kolkata, India,
          <source>December 12-15</source>
          ,
          <year>2019</year>
          , volume
          <volume>2517</volume>
          <source>of CEUR Workshop Proceedings</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>229</fpage>
          -
          <lpage>236</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>A.</given-names>
            <surname>Baruah</surname>
          </string-name>
          ,
          <string-name>
            <surname>K. A. Das</surname>
            ,
            <given-names>F. A.</given-names>
          </string-name>
          <string-name>
            <surname>Barbhuiya</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Dey</surname>
          </string-name>
          ,
          <article-title>Aggression identification in english, hindi and bangla text using bert, roberta and SVM</article-title>
          , in: Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying,
          <source>TRAC@LREC</source>
          <year>2020</year>
          , Marseille, France, May
          <year>2020</year>
          ,
          <string-name>
            <given-names>European</given-names>
            <surname>Language Resources Association</surname>
          </string-name>
          (ELRA),
          <year>2020</year>
          , pp.
          <fpage>76</fpage>
          -
          <lpage>82</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>B. R.</given-names>
            <surname>Chakravarthi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Muralidaran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Priyadharshini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>McCrae</surname>
          </string-name>
          ,
          <article-title>Corpus creation for sentiment analysis in code-mixed Tamil-English text</article-title>
          ,
          <source>in: Proceedings of the 1st Joint</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>