1. Introduction

Leveraging Text Generated from Emojis for Hate Speech and Ofensive Content Identification

Nkwebi Peace Motlogelwa

Edwin Thuma

Monkgigi Mudongo

Tebo Leburu-Dingalo

Gontlafetse Mosweunyane

0 0 Department of Computer Science, University of Botswana

In this paper, team University of Botswana Computer Science (UBCS) investigate whether enriching social media data with text generated from emojis can help in the identification of Hate Speech and Ofensive Content. In particular, we build three diferent binary text classifiers that can detect Hate and Ofensive content (HOF) or Not Hate-Ofensive content (NOT) on data sampled from Twitter. In building our first classifier, we used pre-processed text from twitter only without emojis. In the second classifier, we enrich our preprocessed text from Twitter with text generated from emojis within the Tweets. Our result suggests that enriching Tweets with text generated from emojis within the Tweets improves the classification accuracy of our hate and ofensive content classier.

eol>Hate Speech Binary Classification fastText Emojis

1. Introduction

pre-trained on relevant social media corpus. In their experimental results, they suggest that transfer learning of word embeddings can significantly improve the classification accuracy of hate speech and ofensive content. Mishra et al. [ 7 ] also used BERT pre - trained transformer based neural network models to fine tune their model. In their work, they utilized BERT implementation present in pytorch-transformers library. Their proposed solution outperformed other participants in the HASOC 2019 shared task [ 1 ]. In this paper, we present our proposed solution to the HASOC 2021 shared task English Sub-task A, which is binary classification task [ 8, 9 ]. In the aforementioned task, participating system are required to classify Tweets into two classes, namely: Hate and Ofensive (HOF) and Non- Hate and ofensive (NOT). In our participation, we investigate whether enriching social media text with text generated from emojis can improve the classification accuracy of our binary classifier. Our proposed solution is motivated by the fact that people usually include emojis to accompany the text in order to fill in emotional cues that are missing in the typed messages. For example, one may use an angry face emoji only in their message to depict that they are disgusted and outraged or they can use this message to accompany the typed conversation.

2. Methodology

In this Section, we present our binary text classification approaches for classifying tweets into two classes, namely: Hate and Ofensive (HOF) and Non- Hate and ofensive (NOT). HOF class signifies that the tweet contains Hate, ofensive and profane content. NOT signifies that the tweet does not contain any Hate speech, profane, ofensive content. Our proposed binary text classifier used fastText [ 10 ]. fastText 1, contributed by Facebook AI Research (FAIR), is an open-source library for eficient text classification and word representation.

2.1. Training Dataset

The training dataset was pre-processed to make it compatible with fastText by moving the labels (HOF or NOT) to the beginning of each sentence and adding __label__ as prefix to each label. Additional pre-processing was then performed on the dataset. In particular, we used the Natural Language Toolkit (NLTK)2, a suite of libraries and programs for symbolic and statistical natural language processing to stem the text and for stop words removal. The Porter stemming algorithm was used for stemming [ 11 ]. In addition, the following pre-processing steps were applied to the dataset mainly to clean the text: • Removing HTML tags • Removing URLs • Converting all cases to lower case • Hashtags and mentions not removed, as well as punctuations not removed.

The training dataset contains 3843 tweets. Of this, 1342 are not hate speech and 2501 are hate speech. During training, the training dataset was subdivided such that 3043 tweets train our 1https://fasttext.cc/ 2http://www.nltk.org/ classification model and 800 tweets are used for validation. The subdivision was done such that the first 3043 tweets are for training the model, and the last 800 tweets are validation. This was done using standard Linux head and tail commands.

• head -n 3043 en_hasoc_clean.csv > en_hasoc_clean.train • tail -n 800 en_hasoc_clean.csv > en_hasoc_clean.valid

2.2. Testing Dataset

The same pre-processing done in the training dataset was performed on the test data set, except for pre-processing that deals with labelling the tweets as hate speech or none hate speech.

3. Description of Runs

We submit 3 runs for: Subtask 1A: Identifying Hate, ofensive and profane content from the post. Below is a brief description of each run: 3.1. Run 1 - UBCS This is our baseline run. We used fastText to build a binary classifier for the identification of Hate and Ofensive (HOF) and Non - Hate and Ofensive (NOT). When building our binary classifier, fastText automatically generated a Tweet vector by averaging the word embeddings for each tweet in the pre-processed training set as features. To train and test our classification model, fastText used multinomial logistic regression [ 12 ], which is a linear learner. Before making predictions of the labels for the test dataset using the trained model, fastText also generates feature vectors for the Tweets in the test set using the same techniques used in generating feature vectors for the training set. Both the training and test dataset underwent the same pre-processing steps as described in Section 2.1. 3.2. Run 2 - UBCS In this run, our aim is to improve the classification accuracy of our binary classifier in our baseline run (Run 1 - UBCS) by replacing emojis with text. For our emoji replacement, we used emojis 1.6.1 3, which is a Python package for converting emoticons to words and vice versa. In particular, we used the demojize() function to convert the emojis to text. The pre-processing and emoji removal was applied to both the training dataset and the test dataset. Both the training data and test data were pre-processed as described in Section 2.1. Figure 1 shows emoji replacement. 3.3. Run 3 - UBCS In this run, our aim was to improve the classification accuracy of our binary classifier after for Run 2 - UBCS where both the training data and test data were pre-processed and emojis replaced with corresponding text. In particular, we fine tuned the parameters of our classifier in order to improve the performance of our model. Specifically, we explored the following: Learning rate (-lr), number of epochs (-epoch), and maximum length of word ngrams (-wordNgrams). The model that improved on performance was then used to predict labels of the pre-processed test data. This was achieved using this command: ./fasttext supervised -input en_hasoc_clean.train -output model_hasoc_clean_epoch -lr 0.5 -epoch 50 -wordNgrams 2.

4. Results and Analysis

In this paper, we investigate whether enriching social media tweets with text generated from emojis that accompany the text can improve the classification accuracy of our classifier. Table 1 presents the results of our investigation. Run 1 - UBCS is our baseline run, which does not include text generate from emojis. This baseline run performed poorly compared to the other runs in terms of Macro F1, which was used as the oficial evaluation measure for the HASOC 2021 binary classification task. Run 2 - UBCS is our best run, with a Micro F1 score of 0.7070. For this run, we fixed all the parameters used in our baseline run (RUN 1 - UBCS) and then enriched the tweets in the training and testing set with emojis. The results of our investigation suggest incorporating emotions as text from emojis can improve the classification accuracy of hate speech or ofensive content on social media. In our third run, we attempted to improve the classification accuracy of our second run (Run 2 - UBCS) using the optimal parameters that gave the best classification accuracy on our training set. In particular, we varied the epoch and the learning rate. However, this resulted in the degradation in the classification accuracy on the test set.

5. Discussion and Conclusion

The most obvious finding to emerge from this study is that we can improve the classification accuracy of our binary classifier for identification of hate speech or ofensive content in social media tweets by enriching the tweets with text generated from emojis. You will recall that evidence from previous studies suggest that BERT based models produce better performance. This is also evidenced by the overall performance of teams that participated in this years task. Further studies need to be carried out in order to validate whether emojis can significantly improve the classification accuracy of a binary classier which is built to identify hate speech or ofensive content in social media tweets using BERT based models.

[1]

Modha ,

Mandl ,

Majumder ,

Patel , Overview of the HASOC track at FIRE 2019 : Hate speech and ofensive content identification in indo-european languages , in: P. Mehta,

Rosso ,

Majumder , M. Mitra (Eds.), Working Notes of FIRE 2019 - Forum for Information Retrieval Evaluation, Kolkata , India, December 12-15 , 2019 , volume 2517 of CEUR Workshop Proceedings, CEUR-WS.org , 2019 , pp. 167 - 190 . URL: http://ceur-ws. org/ Vol- 2517 / T3 -1.pdf.

[2]

J. M.

Struß ,

Siegel ,

Ruppenhofer ,

Wiegand , M. Klenner, Overview of germeval task 2, 2019 shared task on the identicfiation of ofensive language , in: Proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019 ), German Society for Computational Linguistics & Language Technology , Erlangen, Germany, 2019 , pp. 354 - 365 .

[3]

Zampieri ,

Malmasi ,

Nakov ,

Rosenthal ,

Farra , R. Kumar, SemEval -2019 task 6: Identifying and categorizing ofensive language in social media (OfensEval) , in: Proceedings of the 13th International Workshop on Semantic Evaluation , Association for Computational Linguistics , Minneapolis, Minnesota, USA, 2019 , pp. 75 - 86 . URL: https: //aclanthology.org/S19-2010. doi: 10 .18653/v1/ S19 -2010.

[4]

Devlin , M.-

Chang ,

Lee ,

Toutanova , BERT: Pre-training of deep bidirectional transformers for language understanding , in: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , Volume 1 (Long and Short Papers), Association for Computational Linguistics , Minneapolis, Minnesota, 2019 , pp. 4171 - 4186 . URL: https://aclanthology.org/ N19-1423. doi: 10 .18653/v1/ N19 -1423.

[5]

Paraschiv , D.-C. Cercel, Upb at germeval -2019 task 2: Bert-based ofensive language classification of german tweets , in: Proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019 ), German Society for Computational Linguistics & Language Technology , Erlangen, Germany, 2019 , pp. 398 - 404 .

[6]

M. A.

Bashar ,

Nayak , Qutnocturnal@hasoc'19: CNN for hate speech and ofensive content identification in hindi language , in: P. Mehta,

Rosso ,

Majumder , M. Mitra (Eds.), Working Notes of FIRE 2019 - Forum for Information Retrieval Evaluation, Kolkata , India, December 12-15 , 2019 , volume 2517 of CEUR Workshop Proceedings, CEUR-WS.org , 2019 , pp. 237 - 245 . URL: http://ceur-ws. org/ Vol- 2517 / T3 -8.pdf.

[7]

Mishra ,

Mishra , 3idiots at HASOC 2019: Fine-tuning transformer neural networks for hate speech identification in indo-european languages , in: P. Mehta,

Rosso ,

Majumder , M. Mitra (Eds.), Working Notes of FIRE 2019 - Forum for Information Retrieval Evaluation, Kolkata , India, December 12-15 , 2019 , volume 2517 of CEUR Workshop Proceedings , CEURWS.org, 2019 , pp. 208 - 213 . URL: http://ceur-ws. org/ Vol- 2517 / T3 -4.pdf.

[8]

Modha ,

Mandl ,

G. K.

Shahi ,

Madhu ,

Satapara ,

Ranasinghe , M. Zampieri, Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Ofensive Content Identification in English and Indo-Aryan Languages and Conversational Hate Speech , in: FIRE 2021: Forum for Information Retrieval Evaluation, Virtual Event , 13th -17th December 2021 , ACM, 2021 .

[9]

Mandl ,

Modha ,

G. K.

Shahi ,

Madhu ,

Satapara ,

Majumder ,

Schäfer ,

Ranasinghe ,

Zampieri ,

Nandini ,

A. K.

Jaiswal , Overview of the HASOC subtrack at FIRE 2021: Hate Speech and Ofensive Content Identification in English and Indo-Aryan Languages , in: Working Notes of FIRE 2021 - Forum for Information Retrieval Evaluation , CEUR , 2021 . URL: http://ceur-ws.org/.

[10]

Joulin , E. Grave,

Bojanowski , T. Mikolov, Bag of tricks for eficient text classification , arXiv preprint arXiv:1607.01759 ( 2016 ).

[11]

M. F.

Porter , An algorithm for sufix stripping , Program 14 ( 1980 ) 130 - 137 .

[12]

Böhning , Multinomial logistic regression algorithm , Annals of the Institute of Statistical Mathematics 44 ( 1992 ) 197 - 200 . URL: https://ideas.repec.org/a/spr/aistmt/ v44y1992i1p197- 200 .html. doi: 10 .1007/BF00048682.