<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>FIRE</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>AI ML NIT Patna at HASOC 2019: Deep Learning Approach for Identi cation of Abusive Content ?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Kirti Kumari</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jyoti Prakash Singh</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>National Institute of Technology Patna</institution>
          ,
          <addr-line>Patna</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <volume>12</volume>
      <fpage>12</fpage>
      <lpage>15</lpage>
      <abstract>
        <p>Social media is a globally open place for online users to express their thoughts and opinions. There are numerous advantages of social media but some severe challenges are also associated with it. Antisocial and abusive conduct has become more common due to the emergence of social media. Identi cation of Hate Speech, Cyber-aggression, and O ensive language is a very challenging task. The nature of structures of the natural language makes this task even more tedious. Being a challenging task, we are fascinated to propose a deep learning system based on Convolutional Neural Networks to identify Hate Speech, O ensive language, and Profanity. We have done experiments with three di erent embeddings. These experiments have been associated with comments of code-mixed Hindi-English and multi-domain social media text. We have found that One-hot embedding performed better than pre-trained fastText embedding for the code-mixed Hindi dataset.</p>
      </abstract>
      <kwd-group>
        <kwd>Hate Speech</kwd>
        <kwd>Network</kwd>
        <kwd>GloVe</kwd>
        <kwd>fastText</kwd>
        <kwd>O ensive Language</kwd>
        <kwd>Convolutional Neural</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        In social media, anyone is free to post their ideas and views without declaring
his/her identity. Detection of Cyber-aggression [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], Hate Speech [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], O ensive
language and Profanity used by social media users have become one of the major
challenges of the current scenario. Social media users are being targeted by Hate
Speech and O ensive language such as abusive, hurtful, derogatory or
unlawful user-generated content by some mischievous users. These online platforms
provide an open place to discuss and comment on di erent matters but abusive
comments and online violence on individuals have turned this into a very
important social issue. As a result of the misuse of online interactions, a large number
of people have fallen into depression, anxiety, and other mental health problems.
A survey undertaken by Feminism in India has noted that online abuse has been
faced by more than 50% of females in major cities of India1. During the study2
(July 2017), 66% of online abused people reported that they feel powerlessness
in their capacity to react to Internet violence or harassment. These statistics
emphasize the necessity of an automated system for the detection of abusive
comments as well as the moderation of the system. As a result, several research
e orts across the world have emerged over the past few years to identify abusive
content [
        <xref ref-type="bibr" rid="ref1 ref12 ref15 ref2 ref4">1, 2, 4, 12, 15</xref>
        ] using machine learning and natural language processing.
      </p>
      <p>
        Hate Speech detection becomes a challenging task because it can not be
addressed simply by ltering words. In addition to the meaning of words, a lot of
other factors such as context information, characteristics of the user, the
gender of individual people have to be considered for the detection of Hate Speech.
Abuse is a term that includes many varying forms of ne-grained adverse
expressions in the framework of natural language. For example, Nobata et al. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]
concentrated on Hate Speech, Derogatory language, and Profanity while Wassem
and Hovy [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] focused on racism and sexism types of abuse. De nitions tend to
be overlapping and ambiguous for di erent types of abuse. The Hate Speech
and O ensive Content Identi cation in Indo-European Languages (HASOC)
organizer team de nes Hate Speech as describing negative attributes or de ciencies
toward groups because of race, political opinion, sexual orientation, gender,
social status, health condition or similar [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. A large number of works [
        <xref ref-type="bibr" rid="ref1 ref15 ref5 ref6">1, 5, 6, 15</xref>
        ]
are reported by the researchers on Hate Speech detection for English language
only and very few works [
        <xref ref-type="bibr" rid="ref2 ref8">2, 8</xref>
        ] have been reported for mixed languages such as
English-Hindi, English-Bengali, and other languages.
      </p>
      <p>
        In this paper, we have used the multi-lingual HASOC Corpus [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and
proposed a deep learning model based on Convolutional Neural Networks (CNN)
to identify Hate Speech and O ensive content on multi-domain social media
platforms collected from Facebook and Twitter. We have used three types of
embeddings of text and transliteration tools to normalize Devanagari to Roman
script for code-mixed Hindi corpus.
      </p>
      <p>The rest of the paper is structured as follows. Section 2 presents associated
works for the detection of Hate Speech and O ensive Language while Section
3 presents our suggested framework for identi cation of Hate Speech, O ensive
language, and Profanity. Section 4 presents the nding of the suggested scheme.
Finally, in Section 5, we have concluded the paper and have discussed the future
directions for these tasks.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Works</title>
      <p>As social media and online platforms have grown in terms of impact and
acceptance of users, various problems such as Hate Speech, Profanity and O ensive
language on these platforms have increased drastically. Several systems have
been proposed by researchers for automatic detection and classi cation of these
problems.</p>
      <sec id="sec-2-1">
        <title>1 https://blog.ipleaders.in/cyber-stalking 2 https://www.statista.com/statistics/784838/online-harassment-impact-on-women/</title>
        <p>
          Burnap and Williams [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] identi ed Hate Speech on the Twitter network
focusing mainly on racism. Nobata et al. [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] used character n-gram features and
reported that character n-gram features are the most predictive features for the
detection of Hate Speech. Wassem and Hovy [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] have focused on Hate tweet
detection related to racism and sexism. They have used Logistic Regression as
classi er and character n-gram features to classify the tweets. Davidson et al.
[
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] have found that racist and homophobic tweets are generally Hate Speech and
sexist tweets are in general o ensive. They have used Logistic Regression, Naive
Bayes, Decision Trees, Random Forests and Support Vector Machines (SVM) to
classify the tweets. Mehdat and Tetreault [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] also found that character n-gram
features are more predictive than token n-gram features for Hate Speech
detection. Badjatiya et al. [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] detected Hate Speech using deep learning models such as
Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM).
They have experimented with several embeddings named Random, GloVe and
fastText embeddings and have found that combination of LSTM, Random
embedding and Gradient Boosted Decision Trees (GBDTs) had performed the best
for classifying the Hate tweets. Del Vigna et al. [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] have classi ed the Hate Speech
of Facebook comments into ne-grained classes. They have used two di erent
approaches with SVM and LSTM to identify the Hate comments. Bohra et al.
[
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] have identi ed code-mixed Hate tweets, especially for Hindi and English
language on Twitter. Kamble and Joshi [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] have also focused on Hindi and English
code-mixed tweets and have detected Hate Speech using various deep learning
models such as CNN, LSTM, and Bi-directional LSTM. A lot of research works
have been done for the English language but very few works have been done for
the other languages and code-mixed languages. In this paper, we have focused
on multi-lingual text, especially for Hindi and English languages and used a deep
neural network model to detect Hate Speech, O ensive language, and Profanity.
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Methodology</title>
      <p>This section describes the details of datasets and proposed approaches. The
description of the datasets used in the experiments has been given in sub-section
3.1 and the details of the proposed approaches to identify Hate Speech, O ensive
language and Profanity are presented in sub-section 3.2.
3.1</p>
      <p>
        Description of Datasets
In this paper, the multilingual datasets [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] provided by Hate Speech and O
ensive Content Identi cation in Indo-European Languages (HASOC)3 have been
used. The shared tasks of HASOC have been provided for three languages
(English, code-mixed Hindi, and German) and each language, there are three
subtasks (Sub-task1, Sub-task2, and Sub-task3). The provided comments have been
collected from Twitter and Facebook. The details of the sub-tasks are: Sub-task1
      </p>
      <sec id="sec-3-1">
        <title>3 https://hasoc2019.github.io</title>
        <p>is a coarse-grained binary classi cation that needed respondents to classify tweets
into two groups: Hate and O ensive (HOF) and Not Hate-O ensive (NOT). (i)
HOF: This post includes hateful, o ensive or profane contents and (ii) NOT:
This post contains neither Hate Speech nor o ensive content. Sub-task2 is a
ne-grained classi cation. Hate Speech and O ensive posts from the Sub-task1
are further classi ed into three categories: (i) Hate Speech (HATE): Posts
under this class contain Hate Speech contents. (ii) O ensive (OFFN): Posts under
this class contain o ensive contents. (iii) Profane (PRFN): These posts contain
profane words. In Sub-task3, the category of abuse is checked and includes only
the posts marked as HOF in Sub-task1. Sub-task3 is further grouped into two
classes: (i) Targeted Insult (TIN): Such posts contain humiliating/insulting or
threatening content. (ii) Untargeted Insult (UNT): Posts that contain
untargeted swearing and profanity, those posts of particular profanity which are not
targeted at anybody but contain language that is not acceptable. The sizes of
the training datasets for English and Hindi corpus are 5852 and 4665 posts,
respectively. Test data for English and Hindi corpus are 1153 and 1318 posts,
respectively. The detailed description of the datasets used in this work is given
in Table 1.
The proposed methodology is based on the Convolutional Neural Networks
(CNN) model, a block diagram of which is shown in Figure 1. At rst, we
have removed the stopwords from the comments by using Natural Language
Toolkit4. The embedding layer is the representation of inputs in the deep neural</p>
      </sec>
      <sec id="sec-3-2">
        <title>4 www.nltk.org</title>
        <p>
          network models. The embedding layer encodes the word used in the comments.
We have done experiments with three di erent embeddings including One-hot
embedding, GloVe embedding [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] and fastText embedding [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. In the case of the
Hindi dataset, we have used only One-hot and fastText embeddings. For One-hot
embedding, we have transliterated Devanagari to Roman script by using
transliteration tools5. These tools identify the Unicode patterns and transliterate the
Devanagari script to Roman script. The dimensions of a word embeddings are
kept 300 for pre-trained (GloVe and fastText) and 100 for One-hot embeddings.
This embedded comment is fed to the CNN layer. In our case, we have used four
layers of convolution and one layer of the max-pooling in between the 3rd and
4th convolution layers. At last, we have used the atten layer followed by a dense
layer. Within the layer, we have used sigmoid and softmax activation function at
the dense layer for binary class and multi-class problems, respectively. In every
hidden layer, we have used the Recti ed Linear Unit (ReLU) activation function.
Number of lters used in 1st, 2nd, 3rd and 4th convolutional layers are 8, 16, 64
and 64, respectively. The lter size and max-pooling size in both cases are used
as 4. We have used 80% of the samples for training and the remaining are used
for validation. The details of the hyper-parameters of our experiments are shown
in Table 2. In all the experiments we have used Keras library6.
        </p>
        <p>Comment</p>
        <p>Preprocessing</p>
        <p>Embedding</p>
        <p>Layer</p>
        <p>CNN</p>
        <p>Predicted Class</p>
      </sec>
      <sec id="sec-3-3">
        <title>5 https://pandey.github.io/posts/transliterate-devanagari-to-latin.html 6 http://keras.io</title>
        <p>4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Results and Discussions</title>
      <p>This section describes the results obtained on the English and code-mixed Hindi
languages for all three Sub-tasks. For both datasets, we have used several
embeddings followed by a Convolutional Neural Networks (CNN) model to classify
the comments into their output classes. Using a macro-averaged F1-score and
weighted F1-score, classi cation models have been evaluated for all the tasks.
Table 3 shows the results obtained by proposed approaches to test samples with
di erent combinations of embeddings, which have been used for training and
testing. Our system achieved an approximate weighted F1-score of 69%, 55%
and 60% for English Sub-task1, Sub-task2, and Sub-task3, respectively with
fastText embedding. For the Hindi Sub-task1, Sub-task2 and Sub-task3, our system
achieved an approximate weighted F1-score of 78%, 52%, and 66%, respectively
with One-hot embedding. It is clear from the Table 3 that fastText embedding
is performing better than GloVe and One-hot embeddings in case of the English
dataset and One-hot embedding is performing better than fastText embedding
for the Hindi dataset. Our results are ranked 17th, 20th, 12th among participants
of shared tasks for Hindi Sub-task1, Sub-task2 and Sub-task3, respectively and
the results on English dataset are positioned 56th, 35th, 30th among participants
of shared tasks for Sub-task1, Sub-task2 and Sub-task3, respectively.</p>
      <p>The proposed model has performed better for Sub-task1 and Sub-task3 which
can be seen in Table 3. Table 3 also shows that misclassi ed instances are more
for Sub-task2 in both datasets. The main reason for the misclassi cation of
Subtask2 is that it is a ne-grained classi cation task. Even for the human being,
it is very di cult to di erentiate among the Hate Speech, O ensive language,
and Profanity; not only due to very ne but also very fade di erences among
these classes. Just ltering the keywords will generally result in many
falsepositive cases because context plays a major role in the detection of the Hate
Speech, O ensive language, and Profanity. Another important reason for the
misclassi cation of classi ers is that the datasets are very unbalanced which can
be seen in Table 1.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion and Future Work</title>
      <p>Hate Speech and O ensive language identi cation is a challenging task. Many
numbers of research have been carried out in the domain of Hate Speech
detection for the English language but very few researches are reported for the other
languages and multi-lingual text. This research work has been focused on
multilingual text classi cation, especially for Hindi and English code-mixed text. In
this paper, a deep learning model for the identi cation of Hate Speech, O ensive
contents, and Profanity on multi-domain platforms have been proposed. Three
types of embeddings: One-hot, pre-trained GloVe and fastText embeddings have
been used in the experiments. It has been found that fastText embedding has
performed better than the other two embeddings for the English dataset and
One-hot has performed better for the Hindi dataset.</p>
      <p>Hate Speech detection is an open challenge for the research community. The
social media post contains not only text but also image followed by text and
even in the case of text, code-mixed languages are used. Therefore, future works
on Hate Speech detection might address multi-lingual cases of several languages
and consideration of multi-modal forms of social media posts to make the system
more robust.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgements</title>
      <p>The rst author would want to acknowledge the Ministry of Electronics and
Information Technology (MeitY), Government of India for the nancial support
during the research work through the Visvesvaraya Ph.D Scheme for Electronics
and IT.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Badjatiya</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gupta</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gupta</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Varma</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Deep learning for Hate Speech detection in tweets</article-title>
          .
          <source>In: Proceedings of the 26th International Conference on World Wide Web Companion</source>
          . pp.
          <volume>759</volume>
          {
          <fpage>760</fpage>
          .
          <string-name>
            <surname>International World Wide Web Conferences Steering Committee</surname>
          </string-name>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bohra</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vijay</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Akhtar</surname>
            ,
            <given-names>S.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shrivastava</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>A dataset of HindiEnglish code-mixed social media text for Hate Speech detection</article-title>
          .
          <source>In: Proceedings of the Second Workshop on Computational Modeling of People's Opinions</source>
          , Personality, and Emotions in Social Media. pp.
          <volume>36</volume>
          {
          <issue>41</issue>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Burnap</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Williams</surname>
            ,
            <given-names>M.L.</given-names>
          </string-name>
          :
          <article-title>Cyber Hate Speech on twitter: An application of machine classi cation and statistical modeling for policy and decision making</article-title>
          .
          <source>Policy &amp; Internet</source>
          <volume>7</volume>
          (
          <issue>2</issue>
          ),
          <volume>223</volume>
          {
          <fpage>242</fpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yan</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wong</surname>
            ,
            <given-names>K.C.</given-names>
          </string-name>
          :
          <article-title>Verbal aggression detection on Twitter comments: convolutional neural network for short-text sentiment analysis</article-title>
          .
          <source>Neural Computing and Applications</source>
          pp.
          <volume>1</volume>
          {
          <fpage>10</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Davidson</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Warmsley</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Macy</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weber</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Automated Hate Speech detection and the problem of O ensive language</article-title>
          .
          <source>arXiv preprint arXiv:1703.04009</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Del Vigna12</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cimino23</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dell'Orletta</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Petrocchi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tesconi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Hate me, hate me not: Hate Speech detection on facebook</article-title>
          .
          <source>In: Proceedings of the First Italian Conference on Cybersecurity (ITASEC17)</source>
          . pp.
          <volume>86</volume>
          {
          <issue>95</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Joulin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grave</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bojanowski</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Douze</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jegou</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Fasttext.zip: Compressing text classi cation models</article-title>
          .
          <source>arXiv preprint arXiv:1612.03651</source>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Kamble</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joshi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Hate Speech detection from code-mixed Hindi-English tweets using deep learning models</article-title>
          . arXiv preprint arXiv:
          <year>1811</year>
          .
          <volume>05145</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Kumari</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>J.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dwivedi</surname>
            ,
            <given-names>Y.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rana</surname>
            ,
            <given-names>N.P.</given-names>
          </string-name>
          :
          <article-title>Aggressive social media post detection system containing symbolic images</article-title>
          .
          <source>In: Conference on e-Business, eServices and e-Society</source>
          . pp.
          <volume>415</volume>
          {
          <fpage>424</fpage>
          . Springer (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Modha</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mandl</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Majumder</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Patel</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Overview of the HASOC track at FIRE 2019: Hate Speech and O ensive Content Identi cation in Indo-European Languages</article-title>
          .
          <source>In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation</source>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Mehdad</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tetreault</surname>
          </string-name>
          , J.:
          <article-title>Do characters abuse more than words?</article-title>
          <source>In: Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue</source>
          . pp.
          <volume>299</volume>
          {
          <issue>303</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Nobata</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tetreault</surname>
          </string-name>
          , J.,
          <string-name>
            <surname>Thomas</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mehdad</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Abusive language detection in online user content</article-title>
          .
          <source>In: Proceedings of the 25th international conference on world wide web</source>
          . pp.
          <volume>145</volume>
          {
          <fpage>153</fpage>
          .
          <string-name>
            <surname>International World Wide Web Conferences Steering Committee</surname>
          </string-name>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Pennington</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Socher</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manning</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          : Glove:
          <article-title>Global vectors for word representation</article-title>
          .
          <source>In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP)</source>
          . pp.
          <volume>1532</volume>
          {
          <issue>1543</issue>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Schmidt</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wiegand</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>A survey on Hate Speech detection using natural language processing</article-title>
          .
          <source>In: Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media</source>
          . pp.
          <volume>1</volume>
          {
          <issue>10</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Waseem</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hovy</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Hateful symbols or hateful people? predictive features for Hate Speech detection on Twitter</article-title>
          .
          <source>In: Proceedings of the NAACL student research workshop</source>
          . pp.
          <volume>88</volume>
          {
          <issue>93</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>