1. Introduction

IRIT-PREVISION AT HASOC 2020 : Fine-tuning BERT for Hate Speech and Offensive Content Identification

Josiane Mothe

Josiane.Mothe@irit.fr 0

Pratik Parikh

parikh.er@gmail.com 0

Faneva Ramiandrisoa

faneva.ramiandrisoa@irit.fr 0 0 IRIT, Université de Toulouse , France

This paper describes the participation of the IRIT-PREVISION team at HASOC (Hate Speech and Offensive Content Identification in Indo-European Languages) 2020 shared task. Our approach is based on ifne-tuning a pre-trained transformer based language model BERT (Bidirectional Encoder Representation from Transformer) [1]. We participated to the English sub-task A. We obtained a macro average F1 of 0.497 (self-computed).

eol>Information system Hate Speech Detection Social Media BERT Deep Learning

1. Introduction

The rest of this paper is organized as follows: Section 2 presents related work in the area of hate speech detection; Section 3 describes the HASOC 2020 shared task as well as the data sets provided; Section 4 describes the methodology we proposed to answer the HASOC 2020 challenge as well as the preprocessing we developed; Section 5 presents the results we obtained; ifnally, Section 6 concludes this paper and presents some future work.

2. Related Work

Online detection of abusive language, hate speech, aggression, offensive content, etc. is an important topic. Indeed, the number of users who are subject to or have witnessed hate speech or offensive languages online is increasing [ 2 ] and can concern citizen, public persons, or organizations. Detecting this type of content on social media platforms such as Facebook and Twitter is an important challenge that authorities care about as illustrated for example by the recent French government decisions 1 to force social media to act.

In the past few years, a lot of research have been conducted to detect hate speech [ 7 ], offensive language [ 5 ], and aggression [ 8 ]. Supervised learning approaches are predominantly used to solve the problem ranging from deep learning based methods [ 9 ] such as convolutional neural networks (CNN) to traditional machine learning based methods [10] such as support vector machine (SVM).

During the first edition of HASOC (2019) [ 7 ], deep learning based methods were widely used and achieved the top performances. However, it can also be observed that some of the non-deep learning systems performances were quite close to the top ones. The team from IIIT-Hyderabad even obtained the best result on one sub-task in Hindi language with traditional machine learning [10].

3. Task Description and Data

HASOC is a shared task, where the goal is to detect hateful content in textual posts published on social media, namely Twitter. This shared task is a multilingual track joining English, German, and Hindi, and consists of two main sub-tasks: 1. Sub-task A (Identifying Hate, offensive and profane) : it focuses on Hate speech and Offensive language identification offered for English, German, and Hindi. This sub-task is a coarse-grained binary classification in which the objective is to classify tweets into two classes : HOF (Hate and Offensive) and NOT (Non- Hate and Offensive). 2. Sub-task B (Discrimination between Hate, profane and offensive) : it is a fine-grained classification for English, German, and Hindi. Here, the posts labeled as HOF from the sub-task A are further classified into three categories : HATE (Hate speech), OFFN (Offenive) and PRFN (Profane).

No context or meta-data like the users’ network are provided [ 7 ].

1https://www.theguardian.com/world/2019/jul/09/france-online-hate-speech-law-social-media, accessed on October 15ℎ , 2020.

4. Methodology

Before building our model, we pre-processed the data set: we converted all the texts into lowercase, substituted all "URL" by "http", also substituted emoticon into their text equivalents by using the online emoji project on github2. Finally, we removed all non UTF-8 words.

During the HASOC 2020 shared task, we submitted only one model. Our model is based on BERT or Bidirectional Encoder Representations from Transformers [ 1 ]. More precisely, we ifne-tuned a pre-trained BERT model called _ _ which contains 24 layers of size 1024, 16 self-attention heads and 340M parameters. Fine-tuning a pre-trained BERT model is less expensive than training a BERT model from scratch and fine-tuning is also very interesting and works well on small data sets which is the case of the HASOC data set. The pretrained model we used was trained at Google on the corpus data composed of English Wikipedia (2,500M words) and BooksCorpus (800M words) [11]. This pre-trained model is publicly available on github3.

During the fine-tuning, we used a batch size of 16, the Adam optimizer with a learning rate of 5e-5 and a number of epochs of 10 as parameters. Each sequence is truncated to max allowed sequence length of 40 characters. We used the library pytorch-pretrained-bert4 for implementation. Training was carried out on a Nvidia Geforce GTX 1080TI GPU and took about 14 minutes in total.

2https://github.com/carpedm20/emoji, accessed on February, 04ℎ 2020. 3https://github.com/google-research/bert, accessed on February, 04ℎ 2020.

4https://github.com/shehzaadzd/pytorch-pretrained-BERT, accessed on February, 04ℎ 2020.

5. Results

This section reports the result our team obtained on the English sub-task A when participating to HASOC 2020. Table 2 reports the result as well as the three best teams’ results. We obtained a macro average F1 of 0.4969 which places our team in the twenty-first ℎ place (out of thirtyvife participants). The difference between our result and the best one is 0.0183. In general, the differences between participants’ results are very small, where the difference between the best team and the thirty third team is 0.05815. Only two teams out of thirty vfie got much lower results. The data set used for evaluation is non-shared by the organizers.

6. Conclusion and Future Work

In this paper, we presented our participation at the second edition of HASOC shared task in English language for sub-task A: identifying hate, offensive and profane content. We used a model that relies on BERT to tackle the problem. Our model achieved a macro average F1 of 0.4969 which ranked our team on the twenty-first place over thirty-vfie participants. However the difference with the best team is very small (0.0183).

For the short term future work, we plan to apply our model to the other HASOC 2020 subtasks and on the other two languages. For long term future work, we want to integrate or combine a keyphrase representation in our model [12]. We already created some models based on keyphrase lexicons but we did not submit it due to lack of time. We would also like to analyse the cross-domain transfer of some models that we developed to detect weak signals in social media [13].

Ethical issues. Working on online data raises ethical issues which are out of the scope of this paper. Training has been made on a publicly available data set while test was ran by the organizers them-selves.

Acknowledgments

This work is partially supported by the PREVISION project, which has received funding from 5https://competitions.codalab.org/competitions/26027#learn_the_details-results, accessed on October 14ℎ , 2020. the European Union’s Horizon 2020 research and innovation programme under GA No 833115 (https://cordis.europa.eu/project/id/833115). The paper reflects the authors’ view and the Commission is not responsible for any use that may be made of the information it contains. ings, 2018, pp. 141–153. URL: https://doi.org/10.1007/978-3-319-76941-7_11. doi:10. 1007/978-3-319-76941-7\_11. [10] V. Mujadia, P. Mishra, D. M. Sharma, Iiit-hyderabad at HASOC 2019: Hate speech detection, in: Working Notes of FIRE 2019 - Forum for Information Retrieval Evaluation, Kolkata, India, December 12-15, 2019, 2019, pp. 271–278. URL: http://ceur-ws.org/ Vol-2517/T3-12.pdf. [11] Y. Zhu, R. Kiros, R. S. Zemel, R. Salakhutdinov, R. Urtasun, A. Torralba, S. Fidler, Aligning books and movies: Towards story-like visual explanations by watching movies and reading books, in: 2015 IEEE International Conference on Computer Vision, ICCV 2015, Santiago, Chile, December 7-13, 2015, 2015, pp. 19–27. URL: https://doi.org/10.1109/ ICCV.2015.11. doi:10.1109/ICCV.2015.11. [12] J. Mothe, F. Ramiandrisoa, M. Rasolomanana, Automatic keyphrase extraction using graph-based methods, in: Proceedings of the 33rd Annual ACM Symposium on Applied Computing, SAC 2018, Pau, France, April 09-13, 2018, 2018, pp. 728–730. URL: https://doi.org/10.1145/3167132.3167392. doi:10.1145/3167132.3167392. [13] F. Ramiandrisoa, J. Mothe, F. Benamara, V. Moriceau, IRIT at e-risk 2018, in: Working Notes of CLEF 2018 - Conference and Labs of the Evaluation Forum, Avignon, France, September 10-14, 2018, 2018. URL: http://ceur-ws.org/Vol-2125/paper_102.pdf.

[1]

Devlin ,

Chang ,

Lee ,

Toutanova , BERT: pre-training of deep bidirectional transformers for language understanding, in: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Minneapolis , MN, USA, June 2-7, 2019 , Volume 1 (Long and Short Papers), 2019 , pp. 4171 - 4186 . URL: https://doi.org/10.18653/v1/ n19- 1423 . doi: 10 .18653/v1/n19- 1423 .

[2]

Lu , J. Nie, RALIGRAPH at HASOC 2019: VGCN-BERT: augmenting BERT with graph embedding for offensive language detection , in: Working Notes of FIRE 2019 - Forum for Information Retrieval Evaluation, Kolkata , India, December 12-15 , 2019 , 2019 , pp. 221 - 228 . URL: http://ceur-ws. org/ Vol- 2517 / T3 -6.pdf.

[3]

S. T.

Roberts ,

Tetreault ,

Prabhakaran , Z. Waseem (Eds.), Proceedings of the Third Workshop on Abusive Language Online , Association for Computational Linguistics, Florence, Italy, 2019 . URL: https://www.aclweb.org/anthology/W19-3500.

[4]

Kumar ,

A. K.

Ojha ,

Malmasi ,

Zampieri , Evaluating aggression identification in social media , in: Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying , TRAC@LREC 2020 , Marseille, France, May 2020 , 2020 , pp. 1 - 5 . URL: https://www.aclweb.org/anthology/2020.trac- 1 .1/.

[5]

Zampieri ,

Malmasi ,

Nakov ,

Rosenthal ,

Farra ,

Kumar , Semeval2019 task 6: Identifying and categorizing offensive language in social media (offenseval) , in: Proceedings of the 13th International Workshop on Semantic Evaluation, SemEval@NAACL-HLT 2019 , Minneapolis , MN, USA, June 6-7, 2019 , 2019 , pp. 75 - 86 . URL: https://doi.org/10.18653/v1/s19- 2010 . doi: 10 .18653/v1/s19- 2010 .

[6]

Mandl ,

Modha ,

G. K.

Shahi ,

A. K.

Jaiswal ,

Nandini ,

Patel ,

Majumder ,

Schäfer , Overview of the HASOC track at FIRE 2020: Hate Speech and Offensive Content Identification in Indo-European Languages , in: Working Notes of FIRE 2020 - Forum for Information Retrieval Evaluation , CEUR , 2020 .

[7]

Modha ,

Mandl ,

Majumder ,

Patel , Overview of the HASOC track at FIRE 2019 : Hate speech and offensive content identification in indo-european languages , in: Working Notes of FIRE 2019 - Forum for Information Retrieval Evaluation , Kolkata, India, December 12-15 , 2019 , 2019 , pp. 167 - 190 . URL: http://ceur-ws. org/ Vol- 2517 / T3 -1.pdf.

[8]

Ramiandrisoa ,

Mothe , Aggression identification in social media: a transfer learning based approach , in: Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, TRAC@LREC 2020 , Marseille, France, May 2020 , 2020 , pp. 26 - 31 . URL: https://www.aclweb.org/anthology/2020.trac- 1 .5/.

[9]

Agrawal ,

Awekar , Deep learning for detecting cyberbullying across multiple social media platforms , in: Advances in Information Retrieval - 40th European Conference on IR Research , ECIR 2018 , Grenoble, France, March 26-29, 2018 , Proceed-