1. Introduction

Application of XLM-RoBERTa for Multi-Class Classification of Conversational Hate Speech

Tebo Leburu-Dingalo

Karabo Johannes Ntwaagae

Nkwebi Peace Motlogelwa

Edwin Thuma

Monkgogi Mudongo

0 0 Department of Computer Science, University of Botswana

In this paper, team University of Botswana Computer Science (UB-CS) investigate the use of XLMRoBERTa, a multilingual model trained on 100 diferent languages for transfer learning in the identification of conversational hate-speech in code-mixed languages. We also investigate whether enriching the tweets with textual sentiments from emojis can help improve the classification performance. Our proposed solution outperformed other teams that participated at the HASOC (2022) Task 2 with a macro F1 score of 0.4939. The result suggest that enriching the tweets with textual sentiments and using a pre-trained multilingual model for transfer learning can help in the identification of conversational hate-speech in code-mixed languages.

eol>Hate Speech XLM-RoBERTa Transfer Learning

1. Introduction

it is supporting an ofensive preceding or parent message. Furthermore messages are often expressed using a mix of languages, a property that needs to be factored in the development of hate and ofensive content detection systems [ 2 ]. Hence towards addressing this challenge the HASOC 2022 Task 2: Identification of Conversational Hate-Speech in Code-Mixed Languages (ICHCL) - Multiclass Classification encourages the development of systems capable of detecting ofensive or hateful content in tweets looking at the context of the parent content 1 [ 3, 4 ]. In particular systems should be able to identify those posts that are hateful or ofensive as well as those that support the dissemination of hateful and ofensive content. In this paper we attempt to address the problem through the use of a transformer model XLM-Roberta [ 5 ] which has been proved efective in multilingual text classification tasks. We fine-tune the model on the provided dataset. In an attempt to improve the model performance for the task, we focus on enhancing the tweets through data cleaning and text augmentation. To this end we pre-process the tweets and convert emojis which make a sizeable part of the tweets to text. Our approach based on the intuition that emojis can express the actual emotion felt by the user when typing a posts regardless of rhetoric expressed in the tweet. Therefore, we theorize that augmenting tweets with emoji descriptions will enhance model performance as they give a better reflection of sentiment and type of language used in the tweet.

2. Evolution of the HASOC Shared Task

The Hate Speech and Ofensive Content Identification in Indo-European Languages (HASOC (2019)) 1 shared task started in 2019 [ 6 ] inspired by two evaluation forums, OfensEval 2 [7] and GermanEval 3 [8]. In particular, the objective of the HASOC task was to develop data, hate speech detection technology and evaluation resources for several Indo-European languages. For example, the HASOC (2019) shared task ofered 3 tasks. The first task (Sub-task A) ofered in three languages (English, German and Hindi) was a binary classification task in which participants were required to classify tweet into Hate and Ofensive (HOF) and Non- Hate and ofensive (NOT) classes [ 6 ]. In Sub-task B, the classes in Sub-task A were further classified into three classes namely: (HATE) Hate speech, (OFFN) Ofenive and (PRFN) Profane. In Sub-task C, only posts labelled as HOF were included and participants were required to check the type of ofence[ 6 ]. The two types of ofences were Targeted Insult (TIN) and Untargeted (UNT). HASOC (2020) Shared task did not difer that much from the preceding year (HASOC (2019)). In particular, the Sub-tasks A & B were made multilingual by joining the English, German and Hindi datasets in order to promote research on multilingual techniques [9].

A new task was introduced in HASOC (2021) [ 2 ] and HASOC (2022) 4 where participants were required to identify from a conversational thread whether a parent tweet, reply where either a standalone Hate (SHOF), Contextual Hate (CHOF) and Non-Hate (NONE). This was motivated by the fact that a majority messages on social networking sites form part of a conversational thread. Such conversational threads can contain hate and ofensive content which may not be

1https://hasocfire.github.io/hasoc/2019/call_for_participation.html

2https://competitions.codalab.org/competitions/20011 3https://projects.fzai.h-da.de/iggsa/ 4https://hasocfire.github.io/hasoc/2022/index.html visible from a single comment or reply but can be determined if parent content is considered. The aim of the task is thus to detect posts that are hateful or ofensive on their own, and those that support hate or ofensive content of their parent posts. Hence the task defines three classes for the identification of hate and ofensive language in posts as follows: • (SHOF) Standalone Hate - This tweet, comment, or reply contains Hate, ofensive, and profane content in itself. • (CHOF) Contextual Hate - Comment or reply is supporting the hate, ofence and profanity expressed in its parent. This includes afirming the hate with positive sentiment and having apparent hate. • (NONE) Non-Hate - This tweet, comment, or reply does not contain Hate, ofensive, and profane content in itself.

3. Experimental Setup 3.1. Training and Validation Dataset

The dataset for Task 2: Identification of Conversational Hate-Speech in Code-Mixed Languages (ICHCL) comprises twitter postings, comments and replies to each comment based on controversial stories from diferent topics including Temple-Mosque Controversy, Taliban and Covid Controversy. The tweets use mix of both the English and Hindi languages referred to as Hinglish. The statistics of the dataset is shown Table 1. This data was randomly split into 80% training data and 20% validation data.

3.2. Pre - Processing

The tweets were first concatenated to create conversational threads comprising parent tweets and comments as well as parent tweets, comments and replies where available. A manual exploration of the training data indicated that the tweets contained a lot of special characters, urls and emojis. We perform data cleaning by removing urls, stopwords, extra spaces and newlines. We however retain emojis which we expand to text using the emoji library5 to augment the tweets.

3.3. Selection of Model Parameters

In our emperical investigation we deploy a SimpleTransformers6 Library by HuggingFace7, which has implementation of task-specific SimpleTransformer models. In particular, we use a classification model called ClassificationModel , which uses a pre-trained model for the task of binary and multi-class classification. The model used is based on the HugginFace implementation of XLM-RoBERTa, a transformer based multilingual model pre-trained on CommonCrawl data containing 100 languages. XLM-RoBERTa is based on the BERT architecture and has a total of 12 layers for learning diferent semantic information with a classification layer built on top. Since we consider the influence of emojis in our experiments we first deployed the model with emojis omitted from the tweets using a learning rate of 1e-5 at 3 and 5 epochs respectively. We further experimented with augmented tweets similarly at a learning rate of 1e-5 at 3 and 5 epochs. All models used the AdamW optimizer. Base on the result in Table 2, we chose to use the parameter used in Run 4 enhanced tweet for our run submission the Task 2: Identification of Conversational Hate-Speech in Code-Mixed Languages (ICHCL).

6https://github.com/ThilinaRajapakse/simpletransformers 7https://huggingface.co/xlm-roberta-base 4. Results and Analysis

Table 3 shows the leaderboard of the HASOC (2022) Task 2: Identification of Conversational Hate-Speech in Code-Mixed Languages (ICHCL). Our team UB-CS denoted by † managed to outperform other teams. The results suggest that using multilingual model trained on several languages can improve the identification of conversational hate speech in code mixed languages (HINGLISH - Hindi-English). In addition, the results suggest that we can further improve the performance by enriching the tweets with textual sentiments generated from emojis.

5. Discussion and Conclusion

The results of our investigation suggests that enriching the tweets with textual sentiments and using a pre-trained multilingual model for transfer learning can help in the identification of conversational hate-speech in code-mixed languages. A natural progression of this work is to analyse whether a state-of-the-art performance can be attained by using an ensemble from several pre-trained multilingual models for transfer learning. FIRE ’19, Association for Computing Machinery, New York, NY, USA, 2019, p. 14–17. URL: https://doi.org/10.1145/3368567.3368584. [7] M. Zampieri, S. Malmasi, P. Nakov, S. Rosenthal, N. Farra, R. Kumar, SemEval-2019 task 6: Identifying and categorizing ofensive language in social media (OfensEval), in: Proceedings of the 13th International Workshop on Semantic Evaluation, Association for Computational Linguistics, Minneapolis, Minnesota, USA, 2019, pp. 75–86. URL: https://aclanthology.org/ S19-2010. doi:10.18653/v1/S19-2010. [8] M. Wiegand, M. Siegel, Overview of the germeval 2018 shared task on the identification of ofensive language, 2018. [9] T. Mandl, S. Modha, A. Kumar M, B. R. Chakravarthi, Overview of the hasoc track at fire 2020: Hate speech and ofensive language identification in tamil, malayalam, hindi, english and german, in: Forum for Information Retrieval Evaluation, FIRE 2020, Association for Computing Machinery, New York, NY, USA, 2020, p. 29–32. URL: https://doi.org/10.1145/ 3441501.3441517. doi:10.1145/3441501.3441517.

[1] C. O'Regan , Hate Speech Online: an (Intractable) Contemporary Challenge? , Current Legal Problems 71 ( 2018 ) 403 - 429 . URL: https://doi.org/10.1093/clp/cuy012.

[2]

Modha ,

Mandl ,

G. K.

Shahi ,

Madhu ,

Satapara ,

Ranasinghe , M. Zampieri, Overview of the hasoc subtrack at fire 2021: Hate speech and ofensive content identification in english and indo-aryan languages and conversational hate speech, in: Forum for Information Retrieval Evaluation , FIRE 2021 , Association for Computing Machinery , New York, NY, USA, 2021 , p. 1 - 3 . URL: https://doi.org/10.1145/3503162.3503176.

[3]

Satapara ,

Majumder ,

Mandl ,

Modha ,

Madhu ,

Ranasinghe ,

Zampieri , K. North,

Premasiri , Overview of the hasoc subtrack at fire 2022: Hate speech and ofensive content identification in english and indo-aryan languages , in: FIRE 2022 : Forum for Information Retrieval Evaluation, Virtual Event , 9th -13th December 2022 , ACM, 2022 .

[4]

Modha ,

Mandl ,

Majumder ,

Satapara ,

Patel ,

Madhu , Overview of the hasoc subtrack at fire 2022: Identification of conversational hate-speech in hindi-english codemixed and german language , in: Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation , CEUR , 2022 .

[5]

Conneau ,

Khandelwal ,

Goyal ,

Chaudhary ,

Wenzek ,

Guzmán , E. Grave,

Ott ,

Zettlemoyer ,

Stoyanov , Unsupervised cross-lingual representation learning at scale, in: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics , Online, 2020 , pp. 8440 - 8451 . URL: https://aclanthology.org/ 2020 .acl-main. 747 . doi: 10 .18653/v1/ 2020 .acl-main. 747 .

[6]

Mandl ,

Modha ,

Majumder ,

Patel ,

Dave ,

Mandlia ,

Patel , Overview of the hasoc track at fire 2019: Hate speech and ofensive content identification in indoeuropean languages , in: Proceedings of the 11th Forum for Information Retrieval Evaluation ,