1. Introduction

Ofensive Content Identification in English Tweets

Ritesh Kumar

ritesh.cse@nitjsr.ac.in 1

Vishesh Gupta

me.guptavishesh@gmail.com 0

Rajendra Pamula

rajendrapamula@gmail.com 0 0 Department of Computer Science and Engineering, Indian Institute of Technology (Indian School of Mines) , Dhanbad 1 Department of Computer Science and Engineering, National Institute of Technology Jamshedpur , India

Hate speech is a prevalent practice that society has to struggle with everyday. The freedom of speech and ease of anonymity granted by social media has also resulted in incitement to hatred. This presents the need for automatic detection of hate speeches or tweets on social media. In this paper, we have presented the machine learning models that can detect hate Speech and ofensive content. Specifically, we described the model submitted for the shared task on hate Speech and ofensive content identification in English Tweets at HASOC 2021 and our team name is Vishesh Gupta.The problem concentrates on hate speech detection in English language. The challenge is divided into two tasks of diferent granularity: (1) coarse-grained binary classification in which participating system are required to classify tweets into two class, namely: Hate and Ofensive (HOF) and Non-Hate and ofensive (NOT). (2) to predict one of the three types of hate speeches present. Overall, our performance is good but it needs some improvement, our scores are encouraging enough to work for better results in future.

1. Introduction

Nowadays, social media has become a significant part of our lives and just like everything it has its pros and cons. Various benefits of social media come with several challenges including hate speech, ofensive and profane content getting published targeting an individual, a group or a society. Hate speech and other ofensive content in online socialization have seriously afected daily life of people. Social media companies such as, YouTube, Facebook, and Twitter have their own approaches to eliminate the hate speech content or anything which negatively afects the society. However, detecting such objectionable content at the earliest to curb the menace of spreading such news online is still a major challenge faced by social media companies and researchers. It is very essential to detect such behaviour. The amount of data generated on social media sites can be estimated from the fact that, every second,on average, around 6,000 tweets are generated. Content moderation of such a huge data is dificult to achieve exclusively through man power. Social networking sites are struggling with content moderation. Artficial Intelligence and diferent Machine Learning techniques can be exploited for hate speech Detection.

In this paper, we have explored various Machine Learning (ML) algorithms for hate speech and ofensive content identification in English language in a shared task called HASOC 2021 [ 1 ] of Forum for Information Retrieval Evaluation (FIRE) 2021 and our team name is Vishesh Gupta. As per requirement of HASOC 2021 Subtask-1 [ 2 ], we have submitted five runs for Subtask-1A and four runs for Subtask-1B. We have extracted dferent lexical and non lexical features from the text for the classification. Our best run in subtask-1A has achieved Macro-F1 score of 0.7680. For subtask-1B, our best run was with an F1-score of 0.5871.

2. Related Work

Several works have been proposed to detect hate speech and ofensive content across social platforms. Hajim et. al. [3] proposed a approach to collect hateful and ofensive expressions and perform Hate Speech Detection. Muhammad Okky Ibrohim et. al. [4] proposed a Multi-label Hate Speech and Abusive Language Detection in Twitter. M. Ali Fauzi [5] used Ensemble Method for Indonesian Twitter Hate Speech Detection.Anusha M D et. al. [6] proposed an Ensemble Model for Hate Speech and Ofensive Content Identification in Indo-European Languages in HASOC 2020 in which they combine CountVectorizer and TF-IDF transformer with additional text-based features to build an ensemble of Gradient Boosting, Random Forest and XGBoost classifiers, with soft voting.

Tharindu Ranasinghe et. al. [7] submitted thier model in HASOC-2019 in which they evaluated six diferent neural network architectures for the classification tasks: pooled Gated Recurrent Unit (GRU), Long Short-Term Memory (LSTM) and GRU with Attention , 2D Convolution with Pooling, GRU with Capsule and LSTM with Capsule and Attention.

Urmi Saha . [8] submitted her model in HASOC 2019 in which she used a list of hate words for feature engineering to build ML approaches for English and their approach on the test set provided by HASOC 2019 achieved accuracies of 0.68, 0.65 and 0.66 for English language subtasks 1, 2 and 3 respectively.

Sarthak Gupte et. al. [9] participated in HASOC-2020 task 1 i.e. Ofensive comment identification in Code-mixed Malayalam Youtube comments in which they used cross-lingual contextual word embeddings and transfer learning to make predictions to Malayalam data. P.Karthikeyan et. al. [10] proposed a research paper on Hate Speech Detection with Hateful and Ofensive Expressions on Twitter using various Machine Learning Techniques where they show various concepts of sentiment analysis.

3. Task and Dataset Description

In this section, we have described the hate speech and ofensive content identification shared task and the dataset provided to the participants.

HASOC 2021 shared task is divided into two subtasks(Subtask-1 and Subtask-2). Subtask-1 is further divided into two subtasks (Subtask-1A and Subtask-1B).

SubTask-1A basically aims to classify tweets into two class, namely: Hate and Ofensive (HOF) and Non-Hate and ofensive (NOT). (shown in Table 1): 1. (NOT) Non-Hate-Ofensive - This post does not contain any Hate speech, profane, ofensive content.

2. (HOF) Hate and Ofensive - This post contains Hate, ofensive, and profane content.

In Subtask-1B, hate-speech and ofensive posts from the Subtask-1A are further classified into four categories. (shown in Table 2): 1. (HATE) Hate speech :- Posts under this class contain Hate speech content. 2. (OFFN) Ofensive :- Posts under this class contain ofensive content. 3. (PRFN) Profane :- These posts contain profane words. 4. (NONE) Not Hate-ofensive:- This post does not contain any Hate speech, profane, ofensive content. We have used the dataset available at HASOC 2021. The dataset consists of 3,843 tweets for training and 1,281 tweets for testing with balanced distribution of each classes. Data was interpreted at two diferent levels of granularity. First, each text was labelled as ‘HOF’ or ‘NOT’. Secondly, levels are further divided as ‘NONE’, ’HATE’, ’OFFN’ and ’PRFN’.

4. System Description 4.1. Text Preprocessing

We have removed all the links, punctuations, numbers and stop words. We have used lemmatization for grouping together the diferent forms of a word into a single word. NLTK wordnet [ 11] is used for lemmatization. Both Subkask-1A and Subtask-1B uses same preprocessing.

4.2. Feature Extraction

TfidfVectorizer [ 12] is used for converting the text into numerical features.Pipeline1 is used for doing TfidfVectorizer and classification in pipelined manner.Tokenizer by keras library is used for LSTM. For Logistic regression , Random Forest and XGBoost, we have used TfidfVectorizer from scikit-learn library. Glove [13] is used to create word embeddings and GRU model is used with this glove for classification.

4.3. Machine Learning Models

For Subtask-1A, we have submitted five runs based on five diferent algorithms, namely- Logistic Regression [14], LSTM [15], ktrain [16], XGBoost [17] and Glove+GRU [18]. We have used the scikit-learn library for logistic regression based models and Keras for LSTM. GloVe is an unsupervised learning algorithm for getting vector representations for words. Training is performed on aggregated global word-word co-occurrence statistics from a corpus. We have used pre-trained word vectors of twitter data for training in Glove+GRU model. We scored maximum Macro F1 score of 0.7680 using GRU for subtask-1A. We have used the following values of the parameter : 1. For TfidfVectorizer, we have used mindf=20, maxfeatures=2000 and maxdf=0.6. 2. In XGBoost, we have used learning rate=0.1, max depth=7 and n estimators=150. 3. For LSTM and GRU, we have used batch size of 256 and 10 epochs for training data .

For Subtask-1B, we have submitted four runs based on four diferent algorithms, namely Logistic regression,Random Forest, Ktrain and XGBoost.We have treated subtask 1B as a multiclassification problem with 4 categories.The parameter values were the same as mentioned above. And Random Forest was implemented using n estimators=1000. We have scored maximum Macro F1 score of 0.5871 using XGBoost for subtask-1B.

5. Results and Discussion

The results of Subtask-1A are represented in terms of Macro-F1,Macro Precision, Macro Recall and Accuracy (shown in Table 3), and the results of Subtask-1B are also in terms of MacroF1,Macro Precision, Macro Recall and Accuracy (shown in Table 4). The best score as Macro-F1, we get from Subtask-1A is 0.7680. For Subtask-1B we get best score as Macro-F1 is 0.5871. Table 3 and 4 shows the score of our submissions based on HASOC oficial ranking. Our best system was ranked 27 in Subtask-1A and 22 for Subtask-1B.

For Subtask-1A, the Glove + GRU system have performed better than all other models.For Subtask-1B, the XGBoost system have performed better than all other models.The accuracy and Macro-F1 score obtained in subtask-1B was lower than that of subtask-1A due to more number of categories of classifiction in subtask-1B. The same situation could be found in results of all the teams. This can be due to the fact that classification of the hate speeches text into ifner granularity is a much more dificult task than detecting instances of hate speech. Also, as 1https://chrisfotache.medium.com/text-classification-in-python-pipelines-nlp-nltk-tf-idf-xgboost-and-more-b83451a327e0 subtask-1B was multi-class classification problem , it’s macro F1 score was lower as compared to the binary classification problem in subtask-1A.

6. Conclusion and Future Work

We have completed the task using various classification algorithms and evaluated the performance of diferent classification algorithms for Hate Speech and Ofensive Content in English Tweets this year's shared task.Our overall rank is 27 for subtask-1A and 22 for subtask-1B which were average as compared to other submissions obtained in the HASOC 2021 shared task. We look forward to experimenting with diferent advance algorithm or neural network models. Also, fine tuning the parameters of the algorithm can help in improvement of the overall performance. And the results of more than one classification algorithm can be combined to generate an overall better score. We shall be exploring these tasks in the coming days. 2021: Hate Speech and Ofensive Content Identification in English and Indo-Aryan Languages, in: Working Notes of FIRE 2021 - Forum for Information Retrieval Evaluation, CEUR, 2021. URL: http://ceur-ws.org/. [3] H. Watanabe, M. Bouazizi, T. Ohtsuki, Hate speech on twitter: A pragmatic approach to collect hateful and ofensive expressions and perform hate speech detection, IEEE Access 6 (2018) 13825–13835. doi:10.1109/ACCESS.2018.2806394. [4] M. O. Ibrohim, I. Budi, Multi-label hate speech and abusive language detection in indonesian twitter (2019) 46–57. [5] M. A. Fauzi, A. Yuniarti, Ensemble method for indonesian twitter hate speech detection,

Indonesian Journal of Electrical Engineering and Computer Science 11 (2018) 294–299. [6] M. Anusha, H. Shashirekha, An ensemble model for hate speech and ofensive content identification in indo-european languages. (2020) 253–259. [7] T. Ranasinghe, M. Zampieri, H. Hettiarachchi, Brums at hasoc 2019: Deep learning models for multilingual hate speech and ofensive language identification. (2019) 199–207. [8] U. Saha, A. Dubey, P. Bhattacharyya, Iit bombay at hasoc 2019: Supervised hate speech and ofensive content detection in indo-european languages. (2019). [9] T. Ranasinghe, S. Gupte, M. Zampieri, I. Nwogu, Wlv-rit at hasoc-dravidian-codemixifre2020: Ofensive language identification in code-switched youtube comments, arXiv preprint arXiv:2011.00559 (2020). [10] P. KARTHIKEYAN, B. JYOTHI, Hate speech detection with hateful and ofensive expressions on twitter using various machine learning techniques (????). [11] E. Loper, S. Bird, Nltk: The natural language toolkit, arXiv preprint cs/0205028 (2002). [12] V. Kumar, B. Subba, A tfidfvectorizer and svm based sentiment analysis framework for text data corpus, in: 2020 National Conference on Communications (NCC), 2020, pp. 1–6. doi:10.1109/NCC48643.2020.9056085. [13] J. Pennington, R. Socher, C. D. Manning, Glove: Global vectors for word representation, in: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), 2014, pp. 1532–1543. [14] Logistic regression (2010) 631–631. URL: https://doi.org/10.1007/978-0-387-30164-8_493.

doi:10.1007/978-0-387-30164-8_493. [15] S. Hochreiter, J. Schmidhuber, Long short-term memory, Neural computation 9 (1997) 1735–1780. [16] A. S. Maiya, ktrain: A low-code library for augmented machine learning, arXiv preprint arXiv:2004.10703 (2020). [17] T. Chen, C. Guestrin, Xgboost: A scalable tree boosting system, in: Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, 2016, pp. 785–794. [18] R. Ni, H. Cao, Sentiment analysis based on glove and lstm-gru, in: 2020 39th Chinese Control Conference (CCC), 2020, pp. 7492–7497. doi:10.23919/CCC50068.2020.9188578.

[1]

Modha ,

Mandl ,

G. K.

Shahi ,

Madhu ,

Satapara ,

Ranasinghe , M. Zampieri, Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Ofensive Content Identification in English and Indo-Aryan Languages and Conversational Hate Speech , in: FIRE 2021: Forum for Information Retrieval Evaluation, Virtual Event , 13th -17th December 2021 , ACM, 2021 .

[2]

Mandl ,

Modha ,

G. K.

Shahi ,

Madhu ,

Satapara ,

Majumder ,

Schäfer ,

Ranasinghe ,

Zampieri ,

Nandini ,

A. K.

Jaiswal , Overview of the HASOC subtrack at FIRE