<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>SVM for Hate Speech and Ofensive Content Detection</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Shyam Ratan</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sonal Sinha</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Siddharth Singh</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Centre for Transdisciplinary Studies, Dr. Bhimrao Ambedkar University</institution>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Linguistics, Dr. Bhimrao Ambedkar University</institution>
          ,
          <country country="IN">India</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper presents the system description of S_cube, which was submitted at the FIRE Shared Task 2021 on Hate Speech and Ofensive Content Identification in Indo-European Languages (HASOC). Our team submitted a system for Subtask 1 in two languages - English and Hindi, which has two diferent segments Subtask 1A and 1B for both languages. We experimented with the classic machine learning using Support Vector Machine (SVM). We discuss the system and its results with main findings for hate speech and ofensive content identification in this paper. Our model achieves an F1 Score of 0.7563 at English Subtask 1A while the performance is worse for Hindi Subtask 1B (0.7195 F1).</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;English</kwd>
        <kwd>Hindi</kwd>
        <kwd>SVM</kwd>
        <kwd>Hate Speech</kwd>
        <kwd>Ofensive Language</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Communication on the internet has become a lot faster than anything in the world through
various social media platforms like Facebook, Twitter, Whatsapp, Viber, Telegram and many
more. Now, the concern is to check what kind of information and speech is being spread by
users so that the social media platforms do not work as hotbeds for hate speech and ofensive
content. Therefore, a robust automatic filter system is required to sweep away these malicious
contents.Hate speech and ofensive content ranges over several issues, such as politics, religion,
colour, gender, caste, ethnicity, etc. which holds the potential to polarise the society [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The
benefit of anonymity and fake accounts on social media are major contributing factors for ease
in bullying and the spread of hate speech and ofensive languages at light speed.
      </p>
      <p>
        Prominent eforts have been put to develop systems to secure the platforms (distinctively
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] ). In addition to it, many shared tasks are being regularly organised
for awareness and to come up with productive outcomes as automatic detection around the
context of hate speech, aggression and ofensive content [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
      </p>
      <p>One of its kinds in this horizon is FIRE 2021 shared task on Hate Speech and Ofensive Content
Detection in Indo-European Languages (HASOC 2021). In this paper, as part of the shared task,
we elaborate on automatic hate speech and ofensive content identification using SVM based
system and its development for both segments of sub-task 1 in two languages - Hindi and English.</p>
      <p>Train Sub-task 1B
HATE OFFN PRFN
683 622 1,196
566 654 213</p>
      <p>Test Set
TOTAL
1,281
1,532</p>
      <p>The remaining portion of the paper is divided into four sections. Section 2 discusses the
used corpus size and its types for training and testing. Section 3 gives a detailed sketch of the
conducted experiments for this task. Moreover, section 4 delivers the developed system’s results
and their error analysis with classified types of errors. Eventually, section 5 wraps up with the
concluding notes.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Dataset</title>
      <p>In order to direct the experiments for the identification/classification of hate speech and ofensive
language, we used the annotated twitter corpus for Hindi and English languages which were
shared in the FIRE Shared Task HASOC 2021 [15]. An enumeration of the shared task corpus is
given in Table 1. The corpus is labelled at two levels and they were presented as two segments
in Subtask 1 for Hindi and English [16] given below
1. Subtask 1A: In sub-task 1A, the corpus is annotated as HOF and NOT. HOF stands for hate
speech, ofensive language, and profane words while NOT is non hate and non-ofensive
content. Hence it is a binary classification task.
2. Subtask 1B: In Subtask 1B, fine-grained classification is ofered for the identification of
hate speech and ofensive language. If the content is marked HOF in the Subtask 1A then
it is marked as Hate Speech (HATE), Ofensive (OFFN), and Profanity (PRFN) in this stage.</p>
      <p>Hence it is a three class classification task.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Experiments with SVM</title>
      <p>
        We mainly experimented with SVMs classifier for Subtask 1A and 1B of Hindi and English
corpus. We used the scikit-learn implementation of SVM ([17], [18] as cited in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]). Support
Vector Machines (SVMs) [19] are one of the most eficient classic machine learning models used
for diferent kinds of text classification tasks. We experimented with binary and three-class
problems with our basic objective of exploring the eficiency and productivity of SVMs for the
detection of hate speech and ofensive content.
      </p>
      <p>In the case of our system, we experimented with SVM for both segments of Subtask 1 with
the consecutive sets of features (given below in list 1, 2, and 3) and diferent C-values (0.001,
0.01, 0.1, 1, 5, 10) for working out the best model. Our classifier’s best performances in both
languages are given in Table 2 with n-gram features. Selection of these combination of word
n-grams and character n-grams is based on best performances of system for Subtask1A and
Subtask 1B.</p>
      <p>1. Character n-grams features (trigrams to five-grams).
2. Word n-grams features (unigrams, bigrams and trigrams).
3. A systematic combination of diverse character n-grams and word n-grams features.</p>
      <p>From the above experiments, we get that given the particular dataset, for English Subtask 1A
feature of character five-gram and word trigram with C-value 10 gives the best performance.
For Hindi Subtask 1A feature of character four-gram and word bigram with C-value 5 gives
the best performance. For English Subtask 1B feature of character four-gram and word trigram
with C-value 5 gives the best performance. For Hindi Subtask 1B feature of character four-gram
and word unigram with C-value 10 gives the best performance. In the overall judgment of both
Subtasks, the combination of character n-grams and word n-grams performed well for Subtask
1A in Hindi and English than Subtask 1B. Though, the score-wise improvement in all sub-tasks
was very low for diferent features. Word n-gram features are widely efective in the case of
Subtask 1A, which is binary classification and on the other side of three-class classification
these are not very helpful for Subtask 1B.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Results and Error Analysis</title>
      <p>In the collective results, our system performed best on the test set in Subtask 1A for English in
comparison to Hindi (also Subtask 1A) and Subtask 1B for both languages. The macro F1 scores
of all segments of Subtask 1 are present in Table 3.</p>
      <p>Our SVM classifier was placed at the 34th position in English Subtask 1A (the macro F1 score
is 8 points below that of the topmost team), while it is placed at the 25th place in English Subtask
1B (macro F1 score being almost 9 points below the best performance team), it is placed at the
29th position in Hindi Subtask 1A (with an overall diference of 7 points in the micro F1 score of
the best team) and finally, it is placed at the 15th position in Hindi Subtask 1B (with a diference
of almost 11 points in macro F1 score in comparison to the topper team). The performance
comparison of our classifier and best classifier in all segments of Subtask 1 are summed up in
the Figure 1.</p>
      <p>Apart from the results of our system for both languages, analysis of predicted errors on test
data and its explanations are also most important. It is quite visible that the system we have
developed has high precision and low recall in all Subtasks for Hindi and English.</p>
      <p>In the comparison of predicted labels in sub-task 1A for both languages. Our system predicted
the values of HOF class in English Subtask 1A (see Figure 2) are much higher in numbers than
the Hindi Subtask 1A (see Figure 3), which is quite opposite for NOT class in both segments of
this Subtask. The performance of the system for diferent classes in diferent Subtasks is due to
the sampling size of training sample data. Here, In Subtask 1A the proportion of both classes
(HOF and NOT) were higher individually in English and Hindi.</p>
      <p>Likewise, in three-class classification Subtask 1B the system performed well for some classes
and predicted PRFN and NONE well in comparison of HATE and OFFN classes in English (see
Figure 4). In this Subtask of Hindi (see Figure 5), NONE class is produced adequately good in
numbers than the other classes (PRFN, OFFN, and HATE). The earlier trend of the proportion
of training sample data is followed here in the case of three-class classification, where the 65%
are PRFN and NONE classes of whole proportion in English, which is opposite in Hindi where
PRFN is much lesser and OFFN, HATE are subsequent in numbers than NONE class. Another
basis of the lower performance of the system in diferent Subtasks for both languages is the
structure and morphological features of both languages, where the structure of Hindi is a little
bit complex with a good number of morphological features.</p>
      <p>In error analysis, some diferent types of errors are classified on the basis of gold labels
and system predicted labels in both languages. These are satire/sarcasm, slogan, coined and
aggressive lexical items, idiomatic expressions, quotes, and code-mixed data, etc. Broadly, these
error types are predicted in the form of lexical features for all segments of Subtask 1 represented
in Table 4.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>The paper deals with a detailed description of the S_Cube system which is developed for HASOC
at FIRE 2021. Results of the experiment show that SVM extrapolates a cut above for the binary
classifier task in Subtask 1A, efective in cases of uneven corpus too, which is far opposite in
the case of the three-class classifier. SVM is able to achieve low recall (but high precision) for
all Subtasks in both languages. We also observed that, the lower performance in Subtask 1B
could be broadly ascribed to the uneven corpus and the lack of ample training sample size for
Hindi &amp; English Examples</p>
      <p>Translation &amp; Explaination
1. Vodafone had raised a dog, it
became very famous then Mukesh
Ambani was fond of it.</p>
      <p>The government is silent, the
public is upset
1. People of BJP (Bhartiya Janta
Party)1 are Coward. 2. We want
new government in this may not
like this government who makes
false promises</p>
      <p>Gold Label
Predicted Label
NOT - HOF,
OFFN - HATE
NONE - HATE
HOF - NOT,
HATE - NONE
diferent classes. The lexical features like satire, slogans, idioms, quotes and code-mixed data
are adding to the factor due to which system is producing error. Therefore, a more propped
corpus with a substantial learning sample size for each class could give better results in these
incidents.
english and german (2020).
[15] S. Modha, T. Mandl, G. K. Shahi, H. Madhu, S. Satapara, T. Ranasinghe, M. Zampieri,
Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Ofensive Content
Identification in English and Indo-Aryan Languages and Conversational Hate Speech, in:
FIRE 2021: Forum for Information Retrieval Evaluation, Virtual Event, 13th-17th December
2021, ACM, 2021.
[16] T. Mandl, S. Modha, G. K. Shahi, H. Madhu, S. Satapara, P. Majumder, J. Schäfer, T.
Ranasinghe, M. Zampieri, D. Nandini, A. K. Jaiswal, Overview of the HASOC subtrack at FIRE
2021: Hate Speech and Ofensive Content Identification in English and Indo-Aryan
Languages, in: Working Notes of FIRE 2021 - Forum for Information Retrieval Evaluation,
CEUR, 2021. URL: http://ceur-ws.org/.
[17] F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel,
P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher,
M. Perrot, E. Duchesnay, Scikit-learn: Machine learning in Python, Journal of Machine
Learning Research 12 (2011) 2825–2830.
[18] L. Buitinck, G. Louppe, M. Blondel, F. Pedregosa, A. Mueller, O. Grisel, V. Niculae, P.
Prettenhofer, A. Gramfort, J. Grobler, R. Layton, J. VanderPlas, A. Joly, B. Holt, G. Varoquaux,
API design for machine learning software: experiences from the scikit-learn project, in:
ECML PKDD Workshop: Languages for Data Mining and Machine Learning, 2013, pp.
108–122.
[19] M. A. Hearst, Support vector machines, IEEE Intelligent Systems 13 (1998) 18–28. URL:
http://dx.doi.org/10.1109/5254.708428. doi:10.1109/5254.708428.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>R.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. K.</given-names>
            <surname>Ojha</surname>
          </string-name>
          , Kmi-panlingua
          <source>at HASOC</source>
          <year>2019</year>
          :
          <article-title>SVM vs BERT for hate speech and ofensive content detection</article-title>
          , in: P. Mehta,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rosso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Majumder</surname>
          </string-name>
          , M. Mitra (Eds.), Working Notes of FIRE 2019 -
          <article-title>Forum for Information Retrieval Evaluation, Kolkata</article-title>
          , India,
          <source>December 12-15</source>
          ,
          <year>2019</year>
          , volume
          <volume>2517</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>285</fpage>
          -
          <lpage>292</lpage>
          . URL: http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2517</volume>
          /
          <fpage>T3</fpage>
          -14.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Zampieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Malmasi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Nakov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Rosenthal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Farra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <article-title>Predicting the type and target of ofensive posts in social media, in: Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technology (NAACL-HLT</article-title>
          ),
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>T.</given-names>
            <surname>Ranasinghe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zampieri</surname>
          </string-name>
          ,
          <article-title>An evaluation of multilingual ofensive language identification methods for the languages of india</article-title>
          ,
          <source>Information</source>
          <volume>12</volume>
          (
          <year>2021</year>
          ). URL: https://www.mdpi.com/ 2078-2489/12/8/306. doi:
          <volume>10</volume>
          .3390/info12080306.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Malmasi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zampieri</surname>
          </string-name>
          ,
          <article-title>Challenges in discriminating profanity from hate speech</article-title>
          ,
          <source>Journal of Experimental &amp; Theoretical Artificial Intelligence</source>
          <volume>30</volume>
          (
          <year>2018</year>
          )
          <fpage>1</fpage>
          -
          <lpage>16</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Waseem</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Davidson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Warmsley</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Weber</surname>
          </string-name>
          ,
          <article-title>Understanding abuse: A typology of abusive language detection subtasks</article-title>
          ,
          <source>in: Proceedings of the First Workshop on Abusive Language Online, Association for Computational Linguistics</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>78</fpage>
          -
          <lpage>84</lpage>
          . URL: http: //aclweb.org/anthology/W17-3012.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>T.</given-names>
            <surname>Davidson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Warmsley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Macy</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Weber</surname>
          </string-name>
          ,
          <article-title>Automated hate speech detection and the problem of ofensive language</article-title>
          ,
          <source>in: Proceedings of ICWSM</source>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. N.</given-names>
            <surname>Reganti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bhatia</surname>
          </string-name>
          , T. Maheshwari,
          <article-title>Aggression-annotated corpus of hindienglish code-mixed data</article-title>
          , in: N. C. C. chair), K. Choukri,
          <string-name>
            <given-names>C.</given-names>
            <surname>Cieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Declerck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Goggi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hasida</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Isahara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Maegaard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mariani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Mazo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Moreno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Odijk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Piperidis</surname>
          </string-name>
          , T. Tokunaga (Eds.),
          <source>Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC</source>
          <year>2018</year>
          ),
          <source>European Language Resources Association (ELRA)</source>
          , Paris, France,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S.</given-names>
            <surname>Bhattacharya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bansal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bhagat</surname>
          </string-name>
          , Y. Dawer, b. lahiri,
          <string-name>
            <given-names>A. K.</given-names>
            <surname>Ojha</surname>
          </string-name>
          ,
          <article-title>Developing a multilingual annotated corpus of misogyny and aggression</article-title>
          ,
          <source>in: Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, European Language Resources Association (ELRA)</source>
          , Marseille, France,
          <year>2020</year>
          , pp.
          <fpage>158</fpage>
          -
          <lpage>168</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>R.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. K.</given-names>
            <surname>Ojha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Malmasi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zampieri</surname>
          </string-name>
          ,
          <article-title>Benchmarking aggression identification in social media</article-title>
          ,
          <source>in: Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018)</source>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          , Santa Fe, New Mexico, USA,
          <year>2018</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>11</lpage>
          . URL: https://aclanthology.org/W18-4401.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>R.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. K.</given-names>
            <surname>Ojha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Malmasi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zampieri</surname>
          </string-name>
          ,
          <article-title>Evaluating aggression identification in social media</article-title>
          ,
          <source>in: Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, European Language Resources Association (ELRA)</source>
          , Marseille, France,
          <year>2020</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>5</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M.</given-names>
            <surname>Zampieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Malmasi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Nakov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Rosenthal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Farra</surname>
          </string-name>
          , R. Kumar, SemEval
          <article-title>-2019 Task 6: Identifying and Categorizing Ofensive Language in Social Media (OfensEval)</article-title>
          ,
          <source>in: Proceedings of The 13th International Workshop on Semantic Evaluation (SemEval)</source>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>T.</given-names>
            <surname>Mandl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Modha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Patel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dave</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Mandlia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Patel</surname>
          </string-name>
          ,
          <article-title>Overview of the HASOC track at FIRE 2019: Hate Speech and Ofensive Content Identification in Indo-European Languages)</article-title>
          ,
          <source>in: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation</source>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>T.</given-names>
            <surname>Mandl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Modha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. K.</given-names>
            <surname>Shahi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Jaiswal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Nandini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Patel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Majumder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Schäfer</surname>
          </string-name>
          ,
          <article-title>Overview of the hasoc track at fire 2020: Hate speech and ofensive content identification in indo-european languages</article-title>
          ,
          <source>in: FIRE</source>
          <year>2020</year>
          :
          <article-title>Forum for Information Retrieval Evaluation, Virtual Event</article-title>
          ,
          <fpage>16th</fpage>
          -20th
          <source>December</source>
          <year>2020</year>
          , ACM,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>T.</given-names>
            <surname>Mandl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Modha</surname>
          </string-name>
          , M. AnandKumar,
          <string-name>
            <given-names>B. R.</given-names>
            <surname>Chakravarthi</surname>
          </string-name>
          ,
          <article-title>Overview of the hasoc track at ifre 2020: Hate speech and ofensive language identification in tamil, malayalam, hindi,</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>