<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Forum for Information Retrieval Evaluation, December</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Leveraging Transformers for Hate Speech Detection in Conversational Code-Mixed Tweets</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Zaki Mustafa Farooqi</string-name>
          <email>zaki19048@iiitd.ac.in</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sreyan Ghosh</string-name>
          <email>gsreyan@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rajiv Ratn Shah</string-name>
          <email>rajivratn@iiitd.ac.in</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Code-Mixed Languages, Hindi-English, Hate Speech, Transformers, Ofensive Tweets</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Multimodal Digital Media Analysis Lab, Indraprastha Institute of Information Technology Delhi</institution>
          ,
          <country country="IN">India</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <volume>1</volume>
      <fpage>3</fpage>
      <lpage>17</lpage>
      <abstract>
        <p>In the current era of the internet, where social media platforms are easily accessible for everyone, people often have to deal with threats, identity attacks, hate, and bullying due to their association with a cast, creed, gender, religion, or even acceptance or rejection of a notion. Existing works in hate speech detection primarily focus on individual comment classification as a sequence labelling task and often fail to consider the context of the conversation. The context of a conversation often plays a substantial role when determining the author's intent and sentiment behind the tweet. This paper describes the system proposed by team MIDAS-IIITD for HASOC 2021 subtask 2, one of the first shared tasks focusing on detecting hate speech from Hindi-English code-mixed conversations on Twitter. We approach this problem using neural networks, leveraging the transformer's cross-lingual embeddings and further finetuning them for low-resource hate-speech classification in transliterated Hindi text. Our best performing system, a hard voting ensemble of Indic-BERT, XLM-RoBERTa, and Multilingual BERT, achieved a macro F1 score of 0.7253, placing us 1 on the overall leaderboard standings.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>(R. R. Shah)
https://github.com/zmf0507 (Z. M. Farooqi); https://github.com/Sreyan88 (S. Ghosh); http://midas.iiitd.edu.in/
© 2021 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
influence. Randomized spelling variations and multiple possible interpretations of Hinglish
words in diferent contextual situations make it extremely dificult to deal with for automated
classification. Another challenge worth considering in dealing with Hinglish is the demographic
divide between Hinglish users relative to total active users globally. This poses a severe limitation
as the tweet data in Hinglish language is a small fraction of the large pool of tweets generated,
necessitating the use of selective methods to process such tweets in an automated fashion.</p>
      <p>Parent Tweet
भारतीय रेलवे के ऑ ीजन ए ेस अिभयान की 200वीं रेल ने अपनी या ापूरी
कर ली है। 10 अ रेलगािड़यां 784 मीिटक टन ऑ ीजन लेकर गितमान ह।देश
भर म अभी तक 775 से अिधक टकरों के मा म से 12,630 मीिटक टन मेिडकल
ऑ ीजन प ंचाई गई है
Comment
@user Aaap sach me doctor ho ki aaise hi farji degree li</p>
      <p>hai?
Reply</p>
      <p>
        The context of the conversation plays a very crucial role in hate speech identification. We
acknowledge the fact that a comment categorized as hate speech may not always contain the
subject of hate on its own. As shown in figure 1, agreement or disagreement with a previous
comment or the overall ideology in the chain of the conversation might also induce hate towards
a particular target group. In addition to this, systems that can eficiently utilize the entire context
of the comment chain with a holistic understanding of the entire discussion may also help in
detecting “trigger comments” i.e., non-toxic comments in online discussions which lead to toxic
replies and implicitly help in mitigating bias in hate speech identification which remains a
long-standing problem in this domain [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ].
      </p>
      <p>
        Hate Speech and Ofensive Content Identification in English and Indo-Aryan Languages
(HASOC) 2021 [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] proposes two subtasks where subtask 1 aims towards identifying and
discriminating hate and profane tweets in English, Hindi, and Marathi, and subtask 2 aims towards
detecting hate tweets in conversations primarily in Hindi-English code-mixed texts. This paper
illustrates our key contribution to subtask 2 [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] of HASOC 2021 [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. We present our system
based on ensemble of transformers to detect hate speech in code-mixed Hindi-English
conversations, which helps us achieve the first place in the HASOC Subtask 2 final leaderboard
standings.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        Hate speech detection is a challenging task with literature including techniques such as
dictionary-based [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], distributional semantics [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and recent literature exploring the power
of neural network architectures for the same [8]. However, a majority of the work done
on hate speech detection is constrained to the English language [9, 10, 11, 12, 13, 14, 15, 16]
with very limited work on other foreign languages [17, 18, 19, 20, 21] and code-switched text
[22, 23, 24, 25, 26]. Although Hinglish has been a major contributor to hate speech online, this
area has seen very little work with recent work exploring transformers [27] and author profiling
using graph neural networks [25]. Works like [22] uses a Convolutional Neural Network (CNN)
architecture togther with Glove embeddings and transfer learning, in one of the first attempts
to detect hate-speech online in the Hinglish language. Similar to our work, [28] explored
XLMRoBERTa and achieved competitive results on the task of detecting hate-speech in Dravidian
languages. In HAOSC 2020 shared task, [29] also used XLM-RoBERTa to achieve the third rank
on the overall task of detecting ofensive content in code-mixed Dravidian text. However, we
acknowledge the fact that XLM-RoBERTa was pre-trained on Hindi only text and our work
difers from theirs in which we also transliterate words in a diferent code to Hindi to solve the
problem of hate-speech detection.
      </p>
      <p>
        Hate speech detection has branched into several sub-tasks like toxic span extraction [30, 31],
rationale identification [ 32] and hate target identification [ 20]. Though recent advancement
in the field of NLP has pushed the limits of hate speech identification, like transformers [ 25]
and graph neural networks [33, 25, 34] with people attempting to induce external knowledge
leveraging author profiling [ 25] or ideology [35] but using context of the conversation is still
a challenge with very little work exploring this problem. Context of the conversation plays a
huge role in hate speech identification with recent literature exploring both the structure and
efect of context [ 36, 37] for the same. One interesting and related direction of work described
in [38] relates to building systems that generate text which acts as hate speech interventions
in online discussion. Context can both help in detecting trigger comments [39] and implicitly
handle bias in hate speech identification which remains a long-standing problem in this domain
[
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ]. To the best of our knowledge, all these works are constrained to the English language
with very little work on code-mixed Hindi-English and Hinglish text, considering the context
of the conversation.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Dataset</title>
      <p>The dataset provided for this task has code-switched Hindi-English as well as Hinglish
conversation chains taken from twitter as shown in Fig 1. This is a binary classification dataset
having two classes HOF and NOT. HOF denotes the tweet, comment, or reply which contains
hate, ofensive, and profane content in itself or is supporting hate, whereas NOT denotes the
tweet, comment, or reply which does not contain any hate speech, profane, or ofensive content.
More details about the dataset provided to us can be found in table 1. We have also provided
Avg. Comments which tells us about the average number of comments to a Parent Tweet in the
dataset since all Parent Tweets had at least one comment. We do not provide the average number
of replies for comments since all comments do not have replies.</p>
      <sec id="sec-3-1">
        <title>Total Conversations</title>
      </sec>
      <sec id="sec-3-2">
        <title>HOF(Hateful/Ofensive) NOT (neither Hateful nor Ofensive)</title>
        <p>For feeding data into our model, we flatten the given conversations into individual
parentcomment-reply unique conversation chains. As mentioned earlier, all parent tweets have atleast
one comment, but all comments do not have replies, so, a training instance might end with
a comment. For each instance, the final label assigned to the instance was the label of the
ifnal comment in the conversation chain. Table 2 describes the dataset distribution with the
validation split used in our experiments in section 5.2 .</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Methodology</title>
      <p>Our methodology primarily involves fine-tuning the transformer models pre-trained on a
massive multilingual corpus. The following sections further describe the end-to-end approach
used in our experiments.</p>
      <sec id="sec-4-1">
        <title>4.1. Data Pre-processing</title>
        <p>We first start by concatenating the tweet and its comments and replies, if they are present, to
form the final text sequence. Our intuition behind this process is that this concatenation will
help the model understand the context better, especially in those cases where the comment or
reply may not be hateful but shows support for the hateful parent tweet. While concatenating
the tweets, we insert a new separator token “[SENSEP]” between the tweet , comment, and the
reply, to diferentiate between one tweet and another. Post concatenation, we perform data
cleaning by removing hashtags, emojis, URLs and mentions from the tweets. However, we do
not remove punctuation and numbers to preserve the syntactic and semantic coherence of the
tweets.</p>
        <p>Although the data is code-mixed, having both Hindi and English text, there were several
instances where Hindi text is present in Roman script. Since our models are pretrained on a
code-mixed corpus, it is essential to deal with Hindi text in the Roman script to make the whole
dataset consistent for training purposes. Therefore, we perform transliteration using AI4Bharat
library1 to convert the Hindi text in Roman script to Devanagari script.</p>
        <p>Tweet</p>
        <p>Reply</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Baseline Approach : Fine-tuning Transformer Models</title>
        <p>We perform experiments with three diferent types of transformer models, which are described
below.</p>
        <p>• Indic-BERT [40] is an ALBERT model pre-trained on a massive multilingual corpus
having 12 Indic languages such as Assamese, Bengali, English, Gujarati, Hindi, Kannada,
Malayalam, Marathi, Oriya, Punjabi, Tamil, Telugu [40] . The multilingual corpus has
about 9 billion tokens.
• Multilingual BERT (mBERT) is a BERT [41] model pretrained on Wikipedia data having
over 100 languages with a masked language modeling objective.
• XLM-Roberta (XLM-R) [42] is a transformer-based masked language model pretrained
on Common Crawl data having about 100 languages. It was proposed by Facebook and
happened to be one of the best-performing transformer models for multilingual tasks.</p>
        <p>On top of these pre-trained transformer models, we add a dropout followed by a fully
connected layer of size two which takes in the transformer’s CLS token’s representation of size
768. The fully connected layer returns logits for the two classes, i.e., HOF and NOT, which are
then passed to a softmax layer to predict the class of the input text. This is explained as a part
of the ensemble model in figure 2 .</p>
        <p>1https://pypi.org/project/ai4bharat-transliteration/</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Our Approach : Models Ensembling</title>
        <p>We perform model ensembling on the top of our three fine-tuned models as explained in the
section 4.2. Since we had three diferent transformer models fine-tuned for our task, we decided
to combine the output of the three models using the majority voting system, i.e., Hard Voting
and Soft Voting. We denote our two ensemble models as Hard Voting Ensemble and Soft
Voting Ensemble. Hard Voting Ensemble model takes in the class predictions from each of
the fine-tuned transformer models and selects the class with maximum votes. Similarly, Soft
Voting Ensemble model takes in the class probabilities from each of the fine-tuned transformer
models and sum the same class probabilities and selects the class having higher probabilities
sum. Figure 2 shows the end-to-end model pipeline used in our experiment.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Experiments and Results</title>
      <sec id="sec-5-1">
        <title>5.1. Experimental Setup</title>
        <p>For fine-tuning our transformer models, we set a learning rate of 2e-5 for all our experiments
with AdamW [43] optimizer and a linear learning rate scheduler. We train our models with a
batch size of 8. Our experiments use the Hugging-Face Transformers library [44] for fine-tuning
all the pre-trained transformer models.</p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. Evaluation with Validation data</title>
        <p>In order to evaluate our models, we split the train dataset in an 80:20 ratio where 80% of the
data is the new train data, and the rest of 20% becomes the validation data as shown in table
2. The split is done in random order while maintaining the same class ratio in both train and
validation data. We train our models for ten epochs and keep track of the validation loss at each
epoch. For reporting results on validation data, We use the model checkpoint corresponding to
the epoch with minimum validation loss. At the same time, we make a note of that epoch for
which we later train our model on the whole train dataset in 5.3.</p>
        <p>From Table 3 , we can see that our model ensembling approaches Soft Voting Ensemble
and Hard Voting Ensemble outperform rest of the baseline transformer models with Macro
F1 score of .7682 and .7621 respectively.
Note : F1, Precision and Recall are Macro Scores</p>
      </sec>
      <sec id="sec-5-3">
        <title>5.3. Evaluation with Test data</title>
        <p>Following the results obtained in section 5.2 , we train our models on the whole train dataset for
the best number of epochs identified through the minimum validation loss in section 5.2 . These
epochs vary from 3 to 6 for each of the three transformer models discussed in 4.2. We submit the
test results for three of our models as marked in table 4 which reports the results obtained on the
test data. It can be inferred that the test results follow the same trend as was with the validation
data. However, Hard Voting Ensemble with Macro F1 of 0.7253 is slightly better than Soft
Voting Ensemble with Macro F1 score of 0.7223. However, the overall performance of the
model ensembling technique is better than the Indic-BERT, XLM-RoBERTa, and Multilingual
BERT by a significant margin where the highest possible Macro F1 score for baseline models is
0.7031.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Analysis</title>
      <p>Macro F1 Score for Models</p>
      <p>Indic-BERT Multilingual BERT XLM-RoBERTSaoft Voting EnseHmabrdleVoting Ensemble</p>
      <p>Figure 3 compares the results in table 4. It can be noted that the Indic-BERT model has
the lowest F1 score among all the baseline and ensemble models. Soft Voting Ensemble and
Hard Voting Ensemble models yield better results than all the baseline transformer models.
However, merely having a better F1 score is not enough since it is crucial to understand where
our approach fails and where it performs better than the baselines.</p>
      <p>We start our error analysis with table 5 where we have shown the total number of misclassified
samples from each class. In addition to it, it has the percentage of total samples misclassified
from each class, which helps us develop a better understanding of the direction where models
are making more mistakes. Figure 4 shows the detailed results of the performance of each
model for samples of both classes. A closer look reveals that Multilingual BERT and
XLMRoBERTa misclassify almost equal number of samples from both classes HOF and NOT where
the percentage ranges from 29.35% to 31.24%. However, the case is quite diferent for Indic-BERT,
where the misclassification rate is very high for the NOT class, which is 40.27% compared to
the misclassification rate of 23.30% for the HOF class. This could also be because Indic-BERT is
trained on 12 Indian languages and has a less diverse pre-training corpus than the other two
transformer models.
Note : % MR (HOF) and % MR (NOT) refer to percent misclassification rate for HOF and NOT classes
respectively. % MR (class) is the percentage of total misclassified samples of the class out of the total
samples of the class. These are calculated with total HOF samples count of 695 and total NOT samples
count of 653 in test data.</p>
      <p>NOT 390
l
rLTaeeubHOF 162
263
533
Indic-BERT</p>
      <p>XLM-RoBERTa</p>
      <p>As far as our ensemble models are concerned, we observe that the misclassification rate is
very balanced for both classes. To compare it with the baseline transformer models, we can see
that the ensemble models have the lowest misclassification rate for HOF class ranging between
23% to 24%, which was also the case with Indic-BERT, and similarly, the misclassification rate
for NOT class is also close to the lowest among the baseline models. This suggests that model
ensembling minimized the misclassification rate for both classes, and we can further conclude
that a single model’s mistake is likely to be corrected by the other two models in an ensemble
model. However, the ensemble models still make 7-8% more mistakes in identifying NOT class
compared to HOF class.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Conclusion</title>
      <p>In this paper, we deal with a novel problem of detecting hateful tweets in twitter conversations
where the comment or reply might not be ofensive and toxic but contributes to the hate
associated with the parent tweet. We performed a thorough experimental analysis with
stateof-the-art models such as XLM-RoBERTa, Indic-BERT, and Multilingual BERT to show that
pre-trained multi-lingual transformer models can achieve decent performance on this task. We
further demonstrate that this performance can be improved with model ensemble techniques
such as Soft Voting and Hard Voting. As this problem is dealt with for the first time, there could
be many ways to improve these numbers and build a more robust system by incorporating
other factors such as emojis and hashtags, which may equally or partially contribute to the
hatefulness of a tweet. Additionally, we aim to explore better architectures for taking into
account the context of a comment like parent comment and replies to judge the nature of the
comment. Author profiling also would be a potential area of research to detect implicit hate in
conversations.
detection with comment embeddings, in: Proceedings of the 24th International Conference
on World Wide Web, WWW ’15 Companion, Association for Computing Machinery, New
York, NY, USA, 2015, p. 29–30. URL: https://doi.org/10.1145/2740908.2742760. doi:1 0 . 1 1 4 5 /
2 7 4 0 9 0 8 . 2 7 4 2 7 6 0 .
[8] P. Badjatiya, S. Gupta, M. Gupta, V. Varma, Deep learning for hate speech detection
in tweets, in: Proceedings of the 26th international conference on World Wide Web
companion, 2017, pp. 759–760.
[9] A.-M. Founta, C. Djouvas, D. Chatzakou, I. Leontiadis, J. Blackburn, G. Stringhini, A. Vakali,
M. Sirivianos, N. Kourtellis, Large scale crowdsourcing and characterization of twitter
abusive behavior, 2018. a r X i v : 1 8 0 2 . 0 0 3 9 3 .
[10] S. Carta, A. Corriga, R. Mulas, D. R. Recupero, R. Saia, A supervised multi-class multi-label
word embeddings approach for toxic comment classification., in: KDIR, 2019, pp. 105–112.
[11] H. H. Saeed, K. Shahzad, F. Kamiran, Overlapping toxic sentiment classification using deep
neural architectures, in: 2018 IEEE International Conference on Data Mining Workshops
(ICDMW), IEEE, 2018, pp. 1361–1366.
[12] A. Vaidya, F. Mai, Y. Ning, Empirical analysis of multi-task learning for reducing identity
bias in toxic comment detection, in: Proceedings of the International AAAI Conference
on Web and Social Media, volume 14, 2020, pp. 683–693.
[13] T. Tran, Y. Hu, C. Hu, K. Yen, F. Tan, K. Lee, S. Park, Habertor: An eficient and efective
deep hatespeech detector, 2020. a r X i v : 2 0 1 0 . 0 8 8 6 5 .
[14] H. Hitkul, R. R. Shah, P. Kumaraguru, S. Satoh, Maybe look closer? detecting trolling
prone images on instagram, in: 2019 IEEE Fifth International Conference on Multimedia
Big Data (BigMM), IEEE, 2019, pp. 448–456.
[15] Hitkul, K. Aggarwal, P. Bamdev, D. Mahata, R. R. Shah, P. Kumaraguru, Trawling for
trolling: A dataset, 2020. a r X i v : 2 0 0 8 . 0 0 5 2 5 .
[16] S. Ghosh, S. Lepcha, S. Sakshi, R. R. Shah, Speech toxicity analysis: A new spoken language
processing task, arXiv preprint arXiv:2110.07592 (2021).
[17] O. Kamal, A. Kumar, T. Vaidhya, Hostility detection in hindi leveraging pre-trained
language models, 2021. a r X i v : 2 1 0 1 . 0 5 4 9 4 .
[18] J. A. Leite, D. F. Silva, K. Bontcheva, C. Scarton, Toxic language detection in social
media for brazilian portuguese: New dataset and multilingual analysis, arXiv preprint
arXiv:2010.04543 (2020).
[19] A. Saroj, S. Pal, An indian language social media collection for hate and ofensive speech, in:
Proceedings of the Workshop on Resources and Techniques for User and Author Profiling
in Abusive Language, 2020, pp. 2–8.
[20] V. Basile, C. Bosco, E. Fersini, D. Nozza, V. Patti, F. M. Rangel Pardo, P. Rosso, M. Sanguinetti,
SemEval-2019 task 5: Multilingual detection of hate speech against immigrants and women
in Twitter, in: Proceedings of the 13th International Workshop on Semantic Evaluation,
Association for Computational Linguistics, Minneapolis, Minnesota, USA, 2019, pp. 54–63.</p>
      <p>URL: https://aclanthology.org/S19-2007. doi:1 0 . 1 8 6 5 3 / v 1 / S 1 9 - 2 0 0 7 .
[21] A. Ghosh Chowdhury, A. Didolkar, R. Sawhney, R. R. Shah, ARHNet - leveraging
community interaction for detection of religious hate speech in Arabic, in: Proceedings of the
57th Annual Meeting of the Association for Computational Linguistics: Student Research
Workshop, Association for Computational Linguistics, Florence, Italy, 2019, pp. 273–280.</p>
      <p>URL: https://aclanthology.org/P19-2038. doi:1 0 . 1 8 6 5 3 / v 1 / P 1 9 - 2 0 3 8 .
[22] P. Mathur, R. Shah, R. Sawhney, D. Mahata, Detecting ofensive tweets in hindi-english
code-switched language, in: Proceedings of the Sixth International Workshop on Natural
Language Processing for Social Media, 2018, pp. 18–26.
[23] R. Kapoor, Y. Kumar, K. Rajput, R. R. Shah, P. Kumaraguru, R. Zimmermann, Mind your
language: Abuse and ofense detection for code-switched languages, in: Proceedings of
the AAAI Conference on Artificial Intelligence, volume 33, 2019, pp. 9951–9952.
[24] P. Mathur, R. Sawhney, M. Ayyar, R. Shah, Did you ofend me? classification of ofensive
tweets in hinglish language, in: Proceedings of the 2nd Workshop on Abusive Language
Online (ALW2), 2018, pp. 138–148.
[25] S. Chopra, R. Sawhney, P. Mathur, R. R. Shah, Hindi-english hate speech detection: Author
profiling, debiasing, and practical perspectives, in: Proceedings of the AAAI Conference
on Artificial Intelligence, volume 34, 2020, pp. 386–393.
[26] S. Kamble, A. Joshi, Hate speech detection from code-mixed hindi-english tweets using
deep learning models, 2018. a r X i v : 1 8 1 1 . 0 5 1 4 5 .
[27] T. Ranasinghe, S. Gupte, M. Zampieri, I. Nwogu, Wlv-rit at
hasoc-dravidian-codemixifre2020: Ofensive language identification in code-switched youtube comments, 2020.
a r X i v : 2 0 1 1 . 0 0 5 5 9 .
[28] K. Yasaswini, K. Puranik, A. Hande, R. Priyadharshini, S. Thavareesan, B. R. Chakravarthi,
IIITT@DravidianLangTech-EACL2021: Transfer learning for ofensive language detection
in Dravidian languages, in: Proceedings of the First Workshop on Speech and Language
Technologies for Dravidian Languages, Association for Computational Linguistics, Kyiv,
2021, pp. 187–194. URL: https://aclanthology.org/2021.dravidianlangtech-1.25.
[29] A. Baruah, K. A. Das, F. A. Barbhuiya, K. Dey,
Iiitg-adbu@hasoc-dravidian-codemixifre2020: Ofensive content detection in code-mixed dravidian text, 2021. a r X i v : 2 1 0 7 . 1 4 3 3 6 .
[30] J. Pavlopoulos, L. Laugier, J. Sorensen, I. Androutsopoulos, Semeval-2021 task 5: Toxic
spans detection, in: Proceedings of the 15th International Workshop on Semantic
Evaluation, 2021.
[31] S. Ghosh, S. Kumar, Cisco at semeval-2021 task 5: What’s toxic?: Leveraging transformers
for multiple toxic span extraction from online comments, 2021. a r X i v : 2 1 0 5 . 1 3 9 5 9 .
[32] B. Mathew, P. Saha, S. M. Yimam, C. Biemann, P. Goyal, A. Mukherjee, Hatexplain: A
benchmark dataset for explainable hate speech detection, 2020. a r X i v : 2 0 1 2 . 1 0 2 8 9 .
[33] P. Mishra, M. D. Tredici, H. Yannakoudakis, E. Shutova, Abusive language detection with
graph convolutional networks, 2019. a r X i v : 1 9 0 4 . 0 4 0 7 3 .
[34] M. Das, P. Saha, R. Dutt, P. Goyal, A. Mukherjee, B. Mathew, You too brutus! trapping
hateful users in social media: Challenges, solutions insights, 2021. a r X i v : 2 1 0 8 . 0 0 5 2 4 .
[35] J. Qian, M. ElSherief, E. Belding, W. Y. Wang, Hierarchical cvae for fine-grained hate speech
classification, 2018. a r X i v : 1 8 0 9 . 0 0 0 8 8 .
[36] J. Pavlopoulos, J. Sorensen, L. Dixon, N. Thain, I. Androutsopoulos, Toxicity detection:</p>
      <p>Does context really matter?, 2020. a r X i v : 2 0 0 6 . 0 0 9 9 8 .
[37] M. Saveski, B. Roy, D. Roy, The structure of toxic conversations on twitter, 2021.</p>
      <p>a r X i v : 2 1 0 5 . 1 1 5 9 6 .
[38] J. Qian, A. Bethke, Y. Liu, E. Belding, W. Y. Wang, A benchmark dataset for learning to
intervene in online hate speech, arXiv preprint arXiv:1909.04251 (2019).
[39] H. Almerekhi, H. Kwak, J. Salminen, B. J. Jansen, Are these comments triggering?
predicting triggers of toxicity in online discussions, WWW ’20, Association for
Computing Machinery, New York, NY, USA, 2020. URL: https://doi.org/10.1145/3366423.3380074.
doi:1 0 . 1 1 4 5 / 3 3 6 6 4 2 3 . 3 3 8 0 0 7 4 .
[40] D. Kakwani, A. Kunchukuttan, S. Golla, G. N.C., A. Bhattacharyya, M. M. Khapra, P.
Kumar, IndicNLPSuite: Monolingual Corpora, Evaluation Benchmarks and Pre-trained
Multilingual Language Models for Indian Languages, in: Findings of EMNLP, 2020.
[41] J. Devlin, M.-W. Chang, K. Lee, K. Toutanova, Bert: Pre-training of deep bidirectional
transformers for language understanding, 2019. a r X i v : 1 8 1 0 . 0 4 8 0 5 .
[42] A. Conneau, K. Khandelwal, N. Goyal, V. Chaudhary, G. Wenzek, F. Guzmán, E. Grave,
M. Ott, L. Zettlemoyer, V. Stoyanov, Unsupervised cross-lingual representation learning at
scale, 2020. a r X i v : 1 9 1 1 . 0 2 1 1 6 .
[43] I. Loshchilov, F. Hutter, Decoupled weight decay regularization, 2019. a r X i v : 1 7 1 1 . 0 5 1 0 1 .
[44] T. Wolf, L. Debut, V. Sanh, J. Chaumond, C. Delangue, A. Moi, P. Cistac, T. Rault, R. Louf,
M. Funtowicz, J. Davison, S. Shleifer, P. von Platen, C. Ma, Y. Jernite, J. Plu, C. Xu, T. L. Scao,
S. Gugger, M. Drame, Q. Lhoest, A. M. Rush, Transformers: State-of-the-art natural
language processing, in: Proceedings of the 2020 Conference on Empirical Methods in Natural
Language Processing: System Demonstrations, Association for Computational Linguistics,
Online, 2020, pp. 38–45. URL: https://www.aclweb.org/anthology/2020.emnlp-demos.6.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Williams</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Burnap</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Javed</surname>
          </string-name>
          , H. Liu,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ozalp</surname>
          </string-name>
          ,
          <article-title>Hate in the machine: anti-black and anti-muslim social media posts as predictors of ofline racially and religiously aggravated crime</article-title>
          ,
          <source>The British Journal of Criminology</source>
          <volume>60</volume>
          (
          <year>2020</year>
          )
          <fpage>93</fpage>
          -
          <lpage>117</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>L.</given-names>
            <surname>Dixon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Sorensen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Thain</surname>
          </string-name>
          , L. Vasserman,
          <article-title>Measuring and mitigating unintended bias in text classification</article-title>
          ,
          <source>in: Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society</source>
          ,
          <year>2018</year>
          , pp.
          <fpage>67</fpage>
          -
          <lpage>73</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D.</given-names>
            <surname>Borkan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Dixon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Sorensen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Thain</surname>
          </string-name>
          , L. Vasserman,
          <article-title>Nuanced metrics for measuring unintended bias with real data for text classification</article-title>
          ,
          <source>in: Companion Proceedings of The 2019 World Wide Web Conference</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>491</fpage>
          -
          <lpage>500</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Modha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Mandl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. K.</given-names>
            <surname>Shahi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Madhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Satapara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ranasinghe</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Zampieri, Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Ofensive Content Identification in English and Indo-Aryan Languages and Conversational Hate Speech</article-title>
          , in: FIRE 2021:
          <article-title>Forum for Information Retrieval Evaluation, Virtual Event</article-title>
          ,
          <fpage>13th</fpage>
          -17th
          <source>December</source>
          <year>2021</year>
          , ACM,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S.</given-names>
            <surname>Satapara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Modha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Mandl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Madhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Majumder</surname>
          </string-name>
          ,
          <article-title>Overview of the HASOC Subtrack at FIRE 2021: Conversational Hate Speech Detection in Code-mixed language</article-title>
          , in: Working Notes of FIRE 2021 -
          <article-title>Forum for Information Retrieval Evaluation</article-title>
          ,
          <string-name>
            <surname>CEUR</surname>
          </string-name>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>R.</given-names>
            <surname>Guermazi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hammami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. B.</given-names>
            <surname>Hamadou</surname>
          </string-name>
          ,
          <article-title>Using a semi-automatic keyword dictionary for improving violent web site filtering</article-title>
          ,
          <source>in: 2007 Third International IEEE Conference on Signal-Image Technologies and Internet-Based System</source>
          ,
          <year>2007</year>
          , pp.
          <fpage>337</fpage>
          -
          <lpage>344</lpage>
          .
          <source>doi:1 0 . 1 1 0 9 / S I T I S . 2 0</source>
          <volume>0 7 . 1 3</volume>
          <fpage>7</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>N.</given-names>
            <surname>Djuric</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Morris</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Grbovic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Radosavljevic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Bhamidipati</surname>
          </string-name>
          , Hate speech
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>