<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Hate Speech and Ofensive Content Identification in English Language Based on BERT Model</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jing Li</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xutao Yang</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of information Science and Engineering, Yunnan University</institution>
          ,
          <addr-line>Kunming, 650500, Yunnan</addr-line>
          ,
          <country country="CN">P.R. China</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>With the development of the Internet, people are becoming increasingly interconnected through the network. More and more people interact through social media such as Weibo, YouTube, Facebook, etc. While this kind of interaction makes people's connection closer, it also brings problems. Some individuals post hate and ofensive language online to attack others, which not only damages the good online atmosphere but also harms the physical and mental health of the victims. Therefore, it is crucial to prohibit the occurrence of hate and ofensive language on these social media platforms. Based on the BERT model, this paper explores three ways to detect hate speech and ofensive content in English at FIRE 2024 task-1. The BERT and Convolutional Neural Network (CNN) method achieves macro F1 scores of 0.80049 on the HASOC 2024 English test set. The BERT and Recurrent Neural Network (RNN) method achieves macro F1 scores of 0.79075. For the improved Recurrent Neural network, it achieves macro F1 scores of 0.78065 on the same test set. Among the three methods, the BERT and Convolutional Neural Network combination model achieves the highest score, performs the best, and ranks 2nd in Task-1.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;hate speech</kwd>
        <kwd>ofensive content</kwd>
        <kwd>English</kwd>
        <kwd>BERT</kwd>
        <kwd>HASOC 2024</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        With the development of the Internet era, more and more people are using social media platforms.
Through this medium, people worldwide can chat, make friends, share experiences, express opinions,
and even brainstorm. These are the positive aspects that social media platforms bring to us. However,
social media platforms also have negative impacts. They often contain hate and ofensive content, such
as pejorative[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], cyber-bullying [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ], online extremism [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], and racism [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ]. Such content may cause
anxiety in those who are targeted, leading to psychological issues and subjecting others to violence[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
It has been recognized that online hate speech is a social issue that harms our society[
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. It is essential
to detect such hate and ofensive content early and prevent its spread.
      </p>
      <p>
        Since 2019, HASOC has been dedicated to sharing tasks for the detection of hate speech and ofensive
content, as well as researching languages, including English and some low-resource languages such as
Marathi. In 2024, HASOC shared this task: Hate Speech and Ofensive Content Identification in English
and Bangla[
        <xref ref-type="bibr" rid="ref10 ref9">9, 10</xref>
        ]. The task has two sub-tasks, and we participated in Task 1.
      </p>
      <p>The main task of Task 1 is to classify the given English Twitter content into two classes:
• Hate and Ofensive (HOF): post contains hate, ofensive, and profane content.</p>
      <p>• None Hate-Ofensive (NOT): post contains no Hate speech, profane, ofensive content.</p>
      <p>For this task, we use the BERT model to encode English sentences. Then we use three types of
networks for further classification: Convolutional Neural Network (CNN), Recurrent Neural Network
(RNN), and an improved version of Recurrent Neural Network. Ultimately, among these three methods,
the combination of the BERT model with Convolutional Neural Network performs the best, achieving a
macro F1 score of 0.80049, while RNN and an improved version of RNN have macro F1 scores of 0.79075
and 0.78065 respectively.</p>
      <p>Section 2 introduces the methods used in recent years to detect hate and ofensive content in English.
Section 3 describes the source of the dataset used in this task, the division of the dataset, and the test
set. In Section 4, we carefully describe the dataset’s preprocessing, the model’s workflow, and some
parameter settings during the model training process. Section 5 presents the macro F1 scores achieved
by the three methods on the HASOC 2024 test set. In the final section, we summarize this task’s work,
point out our experiment’s shortcomings, and look forward to the future.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>The detection of hate and ofensive content has been researched for a long time. HASOC has been
dedicated to detecting hate and ofensive content from 2019 to 2024, involving languages including
English and some low-resource languages such as Marathi. An increasing number of papers are getting
engaged in detecting hate and ofensive content. With their eforts, the classification performance of
models detecting hate and ofensive content has been continuously improving.</p>
      <p>
        Typically, the detection of hate and ofensive content is approached as a supervised classification
task[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], such as a binary classification task, to determine whether a sentence belongs to hate and
ofensive content. In the early days, some researchers used lexicon-based feature methods to detect
hate and ofensive content[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. However, their proposed models could not distinguish between hate
content and ofensive content. At present, the machine learning methods are widely used. Traditional
machine learning methods often require manual feature engineering to extract text features, while deep
learning-based methods mainly use neural networks. Risch and Krestel[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] used the traditional machine
learning method, logistic regression classifier, to detect ofensive content. Waseem[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] used SVM
and logistic regression classifier to detect racist or sexist content. In 2018, Pitsilis et al.[ 15] used deep
learning models, such as Recurrent Neural Networks, to detect ofensive content in English. Recurrent
Neural Networks consider the inputs from previous time steps when processing the current input,
which helps them understand contextual information. So, it achieves good results in the field of natural
language. As a variant of recurrent neural networks, long short-term memory networks have also
achieved good results in detecting hate and ofensive content[ 16]. Since 2020, language models based
on transformers, such as the BERT, m-BERT, and XLM-RoBERTa models[17], have become increasingly
popular in classification tasks. In the HASOC 2020 task, the YNU_OXZ team[ 18] proposed a model
based on XLM-RoBERTa and LSTMs for hate speech detection in English. In the HASOC 2021, the
team using a Graph Convolutional Network approach achieved the best results in detecting hate and
ofensive content in English[19].
      </p>
      <p>Due to the excellent sentence representation capability of transformer-based models and the ability
of Convolutional Neural Networks to capture local features, it may be able to detect insulting words
within sentences. So, we use the combination of the BERT model and CNN to detect hate and ofensive
content. At the same time, because a RNN can detect hate and ofensive content efectively, so we also
use the BERT model to encode English sentences and feed the last_hidden_state of the model into the
RNN for classification. In this way, we compare the results of these three methods to obtain a more
efective approach.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Datasets</title>
      <p>This task focuses on the binary classification of hate and ofensive speech in English. The organizers
didn’t ofer the training data for the datasets but provided the test data collected from Twitter. Therefore,
we downloaded the English datasets from HASOC 2019[20] and HASOC 2020[21], which have 5852 and
3792 tweets, respectively. Then, we divided the validation set and training set in a ratio of 2:8. So, we
have 7715 training data in total, 1929 validation data, and 885 test data. The data are all stored in CSV
ifle format. The training set includes sentence id, text content, and labels (HOF or NOT), while the test
set is without labels. Table 1 shows examples of hate and ofensive content in the training set. In Figure
1, we have presented the data distribution in the training and validation set for binary classification.
#DhoniKeepsTheGlove | WATCH: Sports Minister Kiren Rijiju
hasoc_en_1 itseslluseBs CstCatIetmoetanktebuacpktihneg mMaSttDerhownitihovICerC’BaanldidkaeaenpBgaodvgeer’n, ment
in the know as nation’s pride is involved https://t.co/zuo5335Rjr.</p>
      <p>@politico No. We should remember very clearly that #Individual1
hasoc_en_2 just admitted to treason . #TrumpIsATraitor#McCainsAHero
#JohnMcCainDay"</p>
    </sec>
    <sec id="sec-4">
      <title>4. Experiment</title>
      <p>In our experiment, we first preprocess the datasets to remove special characters from English sentences.
Then, we use three methods based on the BERT model for classification. We use the BERT model to
represent the vector representation of English sentences and pass the vector representation of the
sentences through three types of networks for classification: Convolutional Neural Network, Recurrent
Neural Network, and an improved version of Recurrent Neural Network. We search for the most
efective model among them. Below, we describe our experiment in detail.</p>
      <sec id="sec-4-1">
        <title>4.1. Preprocessing</title>
        <p>After reviewing the datasets, we find many special characters in English sentences, such as emoji
emoticons, URLs, mentions (@someone), and numbers. These special characters can afect the encoding
efectiveness of the model, so we take some measures to handle these special characters. Our steps are
as follows:
• Replace all numbers with the word "number".
• Replace emojis with corresponding text descriptions because some emotion may be included in
emojis.</p>
        <p>• Replace all website links with "URL".
• Remove all the tagged user name.
• Remove all the hashtags.</p>
        <p>• Remove all punctuation marks.</p>
        <p>We have also processed the labels. We replace HOF with 1 and NOT with 0 in the labels, which makes
it more convenient for training classification tasks.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Modeling</title>
        <p>As we all know, the BERT model is based on a bidirectional transformer structure, which can capture the
contextual information on both sides of English words and understand complex semantic relationships
efectively. Therefore, we use the BERT model for the vector representation of English sentences, and
then send the vector representation to three types of networks for further classification. Figure 2 is the
overall structure of the model using our methods.</p>
        <p>The first method involves feeding the sentence embedding represented by the BERT model into a
Convolutional Neural Network. First, we expand the dimension of the sentence vectors obtained from
BERT, and then input them into a two-dimensional convolutional neural network. The vector is passed
sequentially through convolutional layers, pooling layers, and finally fully connected feed-forward
network layer. In this method, we use convolutional kernels of diferent sizes to increase the diversity
of feature extraction. The kernel sizes we use are 2×768, 3×768, 4×768.</p>
        <p>The second method involves feeding the sentence embedding represented by the BERT model into a
Recurrent Neural Network. We implement the RNN using Long Short-Term Memory (LSTM) networks.
We take the hidden state of the last time step of the LSTM layer and pass it into a fully connected
feed-forward network layer.</p>
        <p>The third method is based on the second method. We modify the second method slightly. We perform
a max-pooling operation on the output of the LSTM layer, which can extract the maximum values along
the last dimension, and then the result is passed into a fully connected feed-forward network layer.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Training</title>
        <p>During the model training process, we use the training set to train the model and the validation set
to find the optimal model parameters, which are then applied to the test set for prediction. Since this
task is a binary classification task, we use the cross-entropy function to calculate the loss and macro F1
score as the evaluation metric to assess the model’s classification performance. Additionally, since the
amount of training data is insuficient, we employ early stopping and learning rate scheduling strategies
to prevent the model from overfitting. We use the training sets of HASOC 2019 and HASOC 2020 as
the dataset and train on three methods. The hyperparameter values and number of epochs remain the
same across all three methods in our experiment. Ultimately, we find that the combination of the BERT
model and CNN method performs best on the test set of HASOC 2024. This method uses AdamW as
the optimizer with a learning rate of 1e-5 and a batch size of 32.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Results</title>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion</title>
      <p>The competition for Task 1 is evaluated on macro-f1 metrics. There are a total of 8 teams participating
in Task 1. And our team ranks 2nd in Task-1. The final results are shown in Table 2.</p>
      <p>Table 2 shows that the combination of the BERT model and CNN performs slightly better in the
binary classification of hate and ofensive content in English, which gets the macro F1 score of 0.80049.
This paper describes our methods for detecting hate and ofensive content in English. We conduct
experiments using three BERT-based methods, in which the BERT and CNN combination model performs
best. It may be because CNN captures local features efectively, which helps the model capture the
specific patterns of hate or ofensive language in sentences. So, the method based on BERT and CNN
performs a little better than the method based on BERT and RNN. In the future, we plan to research the
detection of hate and ofensive content in low-resource languages. At the same time, due to the limited
data resources available for low-resource languages, we may explore research in few-shot learning. We
hope that in the future, we can have a clean and righteous online space.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>Thanks to the organizers of the competition. And this work is supported by Scientific Research and
Innovation Project of Postgraduates Students in the Academic Degree of YunNan University
(KC242410495).</p>
    </sec>
    <sec id="sec-8">
      <title>Declaration on Generative AI</title>
      <p>We have not employed any Generative AI tools.
A. Oh, O. Tsur, S. Volkova (Eds.), Proceedings of the First Workshop on NLP and Computational
Social Science, NLP+CSS@EMNLP 2016, Austin, TX, USA, November 5, 2016, Association for
Computational Linguistics, 2016, pp. 138–142. doi:10.18653/V1/W16-5618.
[15] G. K. Pitsilis, H. Ramampiaro, H. Langseth, Detecting ofensive language in tweets using deep
learning, CoRR abs/1801.04433 (2018). URL: http://arxiv.org/abs/1801.04433. arXiv:1801.04433.
[16] G. L. D. la Peña Sarracén, R. G. Pons, C. E. Muñiz-Cuza, P. Rosso, Hate speech detection using
attention-based LSTM, in: T. Caselli, N. Novielli, V. Patti, P. Rosso (Eds.), Proceedings of the Sixth
Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop
(EVALITA 2018) co-located with the Fifth Italian Conference on Computational Linguistics (CLiC-it
2018), Turin, Italy, December 12-13, 2018, volume 2263 of CEUR Workshop Proceedings,
CEURWS.org, 2018. URL: https://ceur-ws.org/Vol-2263/paper040.pdf.
[17] J. Devlin, M. Chang, K. Lee, K. Toutanova, BERT: pre-training of deep bidirectional transformers
for language understanding, in: J. Burstein, C. Doran, T. Solorio (Eds.), Proceedings of the 2019
Conference of the North American Chapter of the Association for Computational Linguistics:
Human Language Technologies, NAACL-HLT 2019, Minneapolis, MN, USA, June 2-7, 2019, Volume
1 (Long and Short Papers), Association for Computational Linguistics, 2019, pp. 4171–4186. doi:10.
18653/V1/N19-1423.
[18] X. Ou, H. Li, Ynu_oxz at hasoc 2020: Multilingual hate speech and ofensive content identification
based on xlm-roberta, in: P. Mehta, T. Mandl, P. Majumder, M. Mitra (Eds.), Working Notes of FIRE
2020 - Forum for Information Retrieval Evaluation, volume 2826 of CEUR Workshop Proceedings,
CEUR-WS.org, Hyderabad, India, 2020, pp. 121–127. URL: http://ceur-ws.org/Vol-2826/T2-3.pdf,
december 16–20, 2020.
[19] T. Mandl, S. Modha, G. K. Shahi, et al., Overview of the hasoc subtrack at fire 2021: Hatespeech and
ofensive content identification in english and indo-aryan languages, in: Working Notes of FIRE
2021 - Forum for Information Retrieval Evaluation, volume 3159 of CEUR Workshop Proceedings,
CEUR-WS.org, Gandhinagar, India, 2021, pp. 1–19. URL: https://ceur-ws.org/Vol-3159/T1-1.pdf,
december 13–17, 2021.
[20] T. Mandl, S. Modha, P. Majumder, D. Patel, M. Dave, C. Mandalia, A. Patel, Overview of the
hasoc track at fire 2019: Hate speech and ofensive content identification in indo-european
languages, in: FIRE ’19: Forum for Information Retrieval Evaluation, Association for Computing
Machinery, Kolkata, India, 2019, pp. 14–17. URL: https://doi.org/10.1145/3368567.3368584. doi:10.
1145/3368567.3368584, december 2019.
[21] T. Mandl, S. Modha, G. K. Shahi, A. K. Jaiswal, D. Nandini, D. Patel, P. Majumder, J. Schäfer,
Overview of the HASOC track at FIRE 2020: Hate speech and ofensive content identification
in indo-european languages, CoRR abs/2108.05927 (2021). URL: https://arxiv.org/abs/2108.05927.
arXiv:2108.05927.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>L. P.</given-names>
            <surname>Dinu</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Iordache</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. S.</given-names>
            <surname>Uban</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zampieri</surname>
          </string-name>
          ,
          <article-title>A computational exploration of pejorative language in social media</article-title>
          , in: M.
          <string-name>
            <surname>Moens</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Specia</surname>
          </string-name>
          , S. W. Yih (Eds.),
          <source>Findings of the Association for Computational Linguistics: EMNLP</source>
          <year>2021</year>
          , Virtual Event / Punta Cana, Dominican Republic,
          <fpage>16</fpage>
          -
          <lpage>20</lpage>
          November,
          <year>2021</year>
          , Association for Computational Linguistics,
          <year>2021</year>
          , pp.
          <fpage>3493</fpage>
          -
          <lpage>3498</lpage>
          . doi:
          <volume>10</volume>
          .18653/V1/
          <year>2021</year>
          .FINDINGS-EMNLP.
          <year>296</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>H.</given-names>
            <surname>Rosa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. S.</given-names>
            <surname>Pereira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Ribeiro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. C.</given-names>
            <surname>Ferreira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Carvalho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Oliveira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Coheur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Paulino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. M. V.</given-names>
            <surname>Simão</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Trancoso</surname>
          </string-name>
          ,
          <article-title>Automatic cyberbullying detection: A systematic review</article-title>
          ,
          <source>Computers in Human Behavior</source>
          <volume>93</volume>
          (
          <year>2019</year>
          )
          <fpage>333</fpage>
          -
          <lpage>345</lpage>
          . doi:
          <volume>10</volume>
          .1016/J.CHB.
          <year>2018</year>
          .
          <volume>12</volume>
          .021.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>J.</given-names>
            <surname>Shetty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. N.</given-names>
            <surname>Chaithali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Shetty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Varsha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Puthran</surname>
          </string-name>
          ,
          <article-title>Cyber-bullying detection: A comparative analysis of twitter data</article-title>
          , in: N. N. Chiplunkar, T. Fukao (Eds.),
          <source>Advances in Artificial Intelligence and Data Engineering</source>
          , Springer Singapore, Singapore,
          <year>2021</year>
          , pp.
          <fpage>841</fpage>
          -
          <lpage>855</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Aldera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Z.</given-names>
            <surname>Emam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Al-Qurishi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>AlRubaian</surname>
          </string-name>
          , A. Alothaim,
          <article-title>Online extremism detection in textual content: A systematic literature review</article-title>
          ,
          <source>IEEE Access 9</source>
          (
          <year>2021</year>
          )
          <fpage>42384</fpage>
          -
          <lpage>42396</lpage>
          . doi:
          <volume>10</volume>
          . 1109/ACCESS.
          <year>2021</year>
          .
          <volume>3064178</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>I.</given-names>
            <surname>Kwok</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Locate the hate: Detecting tweets against blacks</article-title>
          , in: M. desJardins,
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Littman</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, July 14-18</source>
          ,
          <year>2013</year>
          , Bellevue, Washington, USA, AAAI Press,
          <year>2013</year>
          , pp.
          <fpage>1621</fpage>
          -
          <lpage>1622</lpage>
          . doi:
          <volume>10</volume>
          .1609/AAAI.V27I1. 8539.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Tulkens</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Hilte</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Lodewyckx</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Verhoeven</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Daelemans</surname>
          </string-name>
          ,
          <article-title>A dictionary-based approach to racism detection in dutch social media</article-title>
          ,
          <source>CoRR abs/1608</source>
          .08738 (
          <year>2016</year>
          ). URL: http://arxiv.org/abs/ 1608.08738. arXiv:
          <volume>1608</volume>
          .
          <fpage>08738</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R.</given-names>
            <surname>Bannink</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Broeren</surname>
          </string-name>
          ,
          <string-name>
            <surname>P. M. van de Looij-Jansen</surname>
          </string-name>
          , F. G. de Waart, H. Raat,
          <article-title>Cyber and traditional bullying victimization as a risk factor for mental health problems and suicidal ideation in adolescents</article-title>
          ,
          <source>PLoS ONE 9</source>
          (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Williams</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Burnap</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Javed</surname>
          </string-name>
          , H. Liu,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ozalp</surname>
          </string-name>
          ,
          <article-title>Hate in the machine: Anti-black and antimuslim social media posts as predictors of ofline racially and religiously aggravated crime</article-title>
          ,
          <source>The British Journal of Criminology</source>
          <volume>60</volume>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>N.</given-names>
            <surname>Raihan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Ghosh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Modha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Satapara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Gaur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Dave</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zampieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jaki</surname>
          </string-name>
          , T. Mandl,
          <article-title>Overview of the hasoc track at fire 2024: Hate-speech identification in english and bengali, in: Forum for Information Retrieval Evaluation (FIRE</article-title>
          <year>2024</year>
          )
          <article-title>Working Notes</article-title>
          , CEUR-WS.org, Gandhinagar, India,
          <year>2024</year>
          . December 9-13, Gandhinagar, India.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>K.</given-names>
            <surname>Ghosh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Raihan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Modha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Satapara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Gaur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Dave</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zampieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jaki</surname>
          </string-name>
          , T. Mandl,
          <article-title>Overview of the hasoc track at fire 2024: Hate-speech identification in english and bengali, in: Proceedings of the 16th Annual Meeting of the Forum for Information Retrieval Evaluation (FIRE '24), Association for Computing Machinery</article-title>
          , New York, NY, USA,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wiegand</surname>
          </string-name>
          ,
          <article-title>A survey on hate speech detection using natural language processing</article-title>
          , in: L.
          <string-name>
            <surname>Ku</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          Li (Eds.),
          <source>Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media</source>
          ,
          <source>SocialNLP@EACL</source>
          <year>2017</year>
          , Valencia, Spain, April 3,
          <year>2017</year>
          , Association for Computational Linguistics,
          <year>2017</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          . doi:
          <volume>10</volume>
          .18653/V1/W17-1101.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <article-title>Detecting ofensive language in social media to protect adolescent online safety</article-title>
          , in: 2012 International Conference on Privacy, Security, Risk and Trust,
          <string-name>
            <surname>PASSAT</surname>
          </string-name>
          <year>2012</year>
          ,
          <article-title>and</article-title>
          2012 International Confernece on Social Computing,
          <source>SocialCom</source>
          <year>2012</year>
          , Amsterdam, Netherlands, September 3-
          <issue>5</issue>
          ,
          <year>2012</year>
          , IEEE Computer Society,
          <year>2012</year>
          , pp.
          <fpage>71</fpage>
          -
          <lpage>80</lpage>
          . doi:
          <volume>10</volume>
          .1109/SOCIALCOM-PASSAT.
          <year>2012</year>
          .
          <volume>55</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>C.</given-names>
            <surname>Nobata</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. R.</given-names>
            <surname>Tetreault</surname>
          </string-name>
          , A. Thomas,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Mehdad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <article-title>Abusive language detection in online user content</article-title>
          , in: J.
          <string-name>
            <surname>Bourdeau</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Hendler</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Nkambou</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>B. Y.</given-names>
          </string-name>
          <string-name>
            <surname>Zhao</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the 25th International Conference on World Wide Web, WWW</source>
          <year>2016</year>
          , Montreal, Canada,
          <source>April 11 - 15</source>
          ,
          <year>2016</year>
          , ACM,
          <year>2016</year>
          , pp.
          <fpage>145</fpage>
          -
          <lpage>153</lpage>
          . doi:
          <volume>10</volume>
          .1145/2872427.2883062.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Waseem</surname>
          </string-name>
          ,
          <article-title>Are you a racist or am I seeing things? annotator influence on hate speech detection on twitter</article-title>
          , in: D.
          <string-name>
            <surname>Bamman</surname>
            ,
            <given-names>A. S.</given-names>
          </string-name>
          <string-name>
            <surname>Dogruöz</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Eisenstein</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Hovy</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Jurgens</surname>
            ,
            <given-names>B. O</given-names>
          </string-name>
          <string-name>
            <surname>'Connor</surname>
          </string-name>
          ,
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>