Aggression Identification in Posts -two machine learning approaches

Aggression Identification in Posts -two machine learning approaches FanevaRamiandrisoa faneva.ramiandrisoa@irit.fr IRIT Université de Toulouse

France

Université d'Antananarivo Aggression Identification in Posts -two machine learning approaches F43A59024EC2462FA8BBEE5045952E62 GROBID - A machine learning software for extracting information from scholarly documents Social media Social media analysis Cyber-agression TRAC Trolling, Aggression and Cyberbulling Machine learning based model

Social media have changed the way people communicate. One of the aspects is cyber-aggression and interpersonal aggression that can be catalyzed by perceived anonymity. Automatically monitoring user-generated content in order to help moderating it is thus a hot topic. In this paper, we present and evaluate two supervised machine learning models to identify aggressive content and the level of aggressiveness. The first model uses random forest and linear regression while the second model uses deep learning techniques.

Introduction

Social media have changed the way people communicate [3,13,14,5]. One of these aspects is cyber-aggression and interpersonal aggression that can be catalyzed by perceived anonymity [16]. Automatically monitoring user-generated content in order to help moderating social media is thus an important although difficult topic [4,17].

In 2018, the Shared Task on Aggression Identification was organised as part of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC -1) at COL-ING 2018 [9]. The objective of this task is to detect aggressive content and the level of aggressiveness. Thirty teams submitted their test runs. The best system obtained a weighted F-score of 0.64 on a data set composed of annotated Facebook comments.

In this paper, we report two models we developed in order to answer the aggression identification task. The first model uses random forest and linear regression which can be considered as relatively mature approaches while the second model combines CNN and LSTM recent deep learning techniques. No strong conclusion could be made on the superiority of one or the other model since it depends on the collection. This paper is organized as follows: Section 2 reports related works, Section 3 describes our two approaches, Section 4 describes the dataset used in this work, reports the results and discuss them while Section 5 concludes this paper and presents future works.

Related work

Approaches based on features and supervised classifiers such as Support Vector Machines (SVM) are often used in order to learn to detect whether a text contains aggressiveness [24]; in recent years, deep learning has been also employed for this task [19,2].

Deep learning has also been used by TRAC challenge participants. TRAC [9] challenge is the first that focuses on detecting aggressive text. The task training set is composed of Facebook posts/comments; there is also two kinds of test sets: one from Facebook and another from Twitter.

Among the thirty participants, Saroyehun [2] obtained the best results. The authors investigated the efficacy of deep neural network by experimenting different models : CNN, LSTM, BiLSTM, and combinations thereof. In their experiments they used translation technique to enlarge the training set and added an external dataset on hate speech 3 . The LSTM model which was trained on the augmented training set only, achieved the best weighted F1 score of 0.6425 on Facebook test set ; it is the first ranked system on TRAC challenge ; the same system does not performed as well on the Twitter data set. The other system of the same team which implements a combination of CNN and LSTM and which was trained on the augmented training set and the additional dataset, achieved a weighted F1 score of 0.5920 and the third rank on the twitter test set.

Raiyani et. al. [20], meanwhile, tested different models for text classification in TRAC, from classic machine learning model to deep learning models. At the end, they kept three models: FastText model, Dense neural networks, and Voting of the two. The Dense neural networks gives better performance than the two others and achieved a weighted F1 score of 0.5813 on Facebook test set; it is the fourteenth rank on TRAC challenge. While it achieved the best weighted F1 score of 0.6009 and the first rank on the twitter test set, although it was trained on a Facebook dataset.

Machine learning based models

We developed two supervised machine learning based models that we evaluated in this paper. The first method combines random forest and logistic regression while the second approach is deep learning based. We also developed a model based on CNN only for which results can be found in [21]; it performs in between the two models reported in this paper.

Trac-RF LR: combination of two classifiers

In this model we combined random forest (RF) based on surface features and linguistic features with logistic regression (LR) based on document vectorization. We chose this combination because a combination of multiple machine learning models placed first in many prestigious machine learning competitions [18], such as Netflix Competition, Kaggle,... Moreover, when using non-combined models on the training dataset, the results were lower in the case of TRAC as well and this was confirmed on the test set (see section 4.3).

RF Classifier. The random forest model uses different features extracted from the comments as presented in Table 1. Some are adapted from [1,22] where the authors tried to detect depression from texts; another source of inspiration is [7] where the authors suggested an information nutritional label for describing text qualities.

Emotions

Frequency of emotions from specific categories: anger, fear, surprise, sadness and disgust. The idea behind is to check the categories related to aggressiveness.

Gunning Fog Index Estimate of the years of education that a person needs to understand the text at first reading.

Flesch Reading Ease

Measure how difficult to understand a text is.

Linsear Write Formula Developed for the U.S. Air Force to calculate the readability of their technical manuals 5 .

New Dale-Chall Readability

Measure the difficulty of comprehension that persons encounter when reading a text. It is inspired from Flesch Reading Ease measure.

Swear words

The intuition behind is that the texts containing insults are often aggressive.

Lexical analysis with python library empath

Empath is a tool for analyzing text across lexical categories. By default, it has 194 lexical categories and each category is considered as feature. Some of these features are used for abusive language detection, hate speech, cyberbullying and the others are used for sentiment or personality analysis that we judged useful for aggression detection.

A RF classifier was trained on train and validation sets by representing each text (Facebook comment or tweet) with a vector composed by the features we mentioned in Table 1.

The following parameters were used during the training: class weight="balanced", max features="sqrt", n estimators=60, min weight fraction leaf=0.0, criterion='entropy', random state=2.

At prediction time, a text from the test set is represented with features and then run the trained model. The output is the estimated probabilities for the three classes (overtly aggressive, covertly aggressive and non-aggressive).

LR Classifier. This model is based on document vectorization using Doc2vec [12]. Doc2vec is used to represent sentences, paragraphs, or whole documents as vectors and it can be trained on small corpora, which is case of the task datasets.

Before building the LR Classifier, we first trained two separate Doc2vec models: a Distributed Bag of Words and a Distributed Memory model [12]. For the training, we used the same configuration as in [25] for representing user's text. The two Doc2vec models were trained on the train and validation sets. We used the Python package gensim 6 [23]. We also concatenated the output vectors of these two models, as done in [25], resulting in a representation by a 200-dimension vector per text.

Then a logistic regression classifier was trained on the vectors for both the train and validation sets with the following parameters : class weight="balanced", random state=1, max iter=100, solver="liblinear".

At prediction time, the texts from the test set were vectorized by using the two Doc2vec models and the 200-dimension vectors were given as input of trained classifier. The output is also a set of class probabilities.

Combination of two classifiers. The class probabilities obtained from RF classifier and LR Classifier were averaged and finally the class with the highest probability was considered as the class the text belongs to. We also tested different ways to combine the output probabilities obtained from the two classifiers RF and LR, such as maximum, minimum, etc., but the average method gave the best results.

Trac-CNN LSTM: Combination of CNN and LSTM

This model combines two deep learning techniques: CNN and LSTM. The main idea is to pass input representation (sentence matrix in Figure 1) to the CNN and pass the local features learnt by the CNN (concatenated vectors in Figure 1) to the LSTM. Indeed, CNN and LSTM are complementary due to the fact that each of them captures information at different scales [2].

The architecture of our combined model is illustrated in Figure 1. It is as follows: first, we convert sentences/texts into sentences matrix 7 where each row is a vector representation 8 of each word in the sentences/texts. Then, convolutions are applied on the sentences matrix where we used three filter region sizes: bigrams (height = 2), trigrams (height = 3) and fourgrams (height = 4). Each region has 100 filters; thus, in total there are 300 filters. The result of convolutions is called feature maps; vectors with variablelength according to the region filter and each filter region has 100 feature maps. Afterwards, a 1-max pooling is performed over feature maps. More precisely, for each region the largest number from each feature map is kept and then concatenated to form a vector. As a result, we obtain one vector of size 1009 per region filter. Then, these three vectors are concatenated to form a feature vector and a dropout is applied on this feature vector. The concatenated feature vector is passed to the LSTM layer. Then, we added one fully connected hidden layer to reduce the dimension of the concatenated vector, followed by a dropout. Finally, an output layer, which is also a fully connected layer with three possible output states, is added. On the output layer, the activation function used is the softmax function.

The architecture of our model is inspired from the CNN architecture Zhang et. al. [26] proposed and which is used for sentences classification. In that task, their CNN architecture outperforms baseline methods which use SVM as well as the one that used CNN in [8].

Evaluation

Data set

The evaluation is based on the TRAC 2018 shared task [9]. The task dataset is a subset of Kumar et al' [10] and consists in English and Hindi randomly sampled Facebook comments. In this study, we focused on the English part of the dataset which is detailed in Table 2. It is composed of (a) 11,999 Facebook comments for training and 3,001 comments for validation. It is annotated with 3 levels of aggression -Overtly Aggressive (OAG), Covertly Aggressive (CAG) and Non-Aggressive (NAG), (b) 916 English comments for test. Additionally, 1,257 English tweets were given as a second test set.

Evaluation measure

The evaluation metric used in this paper is the weighted F1 which was also used in the TRAC shared task. The weighted F1 is equal to the average, weighted by the number of instances for each label, of the F1 (given by equation 1) of each class label.

F1 = 2 R * P R + P(1)

Fig. 1: Illustration of a CNN + LSTM architecture for aggression detection inspired from [26]. where P = t p t p+ f p is the precision, R = t p t p+ f n is the recall, t p denotes the true positives, f p the false positives, and f n the false negatives.

Results

Table 3 reports the results we obtained with the two models presented above. For comparison, we report also results obtained with the RF classifier only and with the LR classifier only. The baseline mentioned in the first row was given by the TRAC shared task organizers while the second row is the best result from participants in the TRAC workshop. We can see that our two models outperform the baseline on both Facebook and Twitter subsets. Trac-RF-LR is better than Trac-CNN-LSTM on the Facebook collection while it is the opposite on the Twitter collection. This could be due to the train dataset which is only composed of texts crawled from Facebook. Indeed, we can observe the same behaviour for the other systems that participated to the challenge [9]. The only exception is for Saroyehun [2] system which performs better on the Twitter dataset.

System

Conclusion

In this paper, we presented two different supervised machine learning approaches for aggression identification on TRAC 2018 English collections (Facebook and Twitter based). The combination of random forest and linear regression classifiers based on a set of surface features and document vectorization leaded to the sixteenth ranked system out of thirty on the Facebook collection. The combination of CNN and Long Short-Term Memory was ranked fifteenth out of thirty systems.

To extend this work, we plan to update our models by adding new features such as bag of words or features more specific to the aggression. We also plan to apply feature engineering on the features we used in this paper in order to see which one are the most useful. On the other hand, feature selection could also be applied to build models that use features as less as possible [11,6]. Finally, an investigation on deep learning models will be conducted by using different architectures such as hierarchical attention network. We do believe that these tracks can help designing more performing models.

Ethical issue. While TRAC challenge has its proper ethical policies, detecting aggressive content from user's posts raises ethical issues that are beyond the scope of the paper.

Table 1 :1List of features used in RF to represent texts (Facebook comments or tweets).4 http://saifmohammad.com/WebPages/NRC-Emotion-Lexicon.htm, accessed on 2017-02-23

Table 2 :2Distribution of training, validation and testing data on TRAC 2018 data collection.Number ofTrainValidationFacebookTestTwittertexts (=posts+comments)11,9993,0019161,257Overt aggression2,708711144361Covert aggression4,2401,057142413No aggression5,0511,233630483

Table 3 :3Results for the English (Facebook and Twitter) task. Bold value is the best performance for our approaches.Weighted F1FacebookTwitterRandom Baseline0.3540.348Saroyehun [2]0.6420.592Trac-RF LR0.5810.409Trac-CNN LSTM0.5590.511Trac-RF only0.5730.397Trac-LR only0.5690.452

https://github.com/ZeerakW/hatespeech, accessed on January 10, 2020 http://www.streetdirectory.com/travel_guide/15675/writing/how_to_choose_ the_best_readability_formula_for_your_document.html, accessed on 2018-02-25 The dimension of a sentence matrix is l × d, where l is the length of the longest text/sentence in the dataset and d is the dimension of word vector representation. The word vector representation is obtained with word2vec model[15] trained on the training and validation sets. Because there is feature maps.

Acknowledgement. This work has been partially funded by the European Union's Horizon 2020 H2020-SU-SEC-2018 under the Grant Agreement n°833115 (PREVI-SION project). This work has also been partially supported by the Ministère des Affaires étrangères et du Développement international under the scholarship EIFFEL-DOCTORAT 2017/ n°P707544H for Faneva Ramiandrisoa's PhD thesis.

IRIT at e-Risk (regular paper) IAbdou Malam MArziki MNezar Bellazrak FBenamara AEl Kaidi BEs-Saghir ZHe MHousni VMoriceau JMothe FRamiandrisoa International Conference of the CLEF Association, CLEF 2017 Labs Working Notes 2017 1866 CEUR Workshop Proceedings Aggression detection in social media: Using deep neural networks, data augmentation, and pseudo labeling STAroyehun AGelbukh Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying the First Workshop on Trolling, Aggression and Cyberbullying

TRAC-

2018. 2018 social media has opened a world of 'open communication:'" experiences of adults with cerebral palsy who use augmentative and alternative communication and social media JCaron JLight Augmentative and Alternative Communication 32 1 2016 Moderated online communities and quality of usergenerated content JChen HXu ABWhinston Journal of Management Information Systems 28 2 2011 Social media and its role in friendship-driven interactions among young people: A mixed methods study JPDécieux AHeinen HWillems YOUNG 27 1 2019 Forward and Backward Feature Selection for Query Performance Prediction SDéjean RTIonescu JMothe MZUllah ACM Symposium on Applied Computing (SAC) Association for Computing Machinery 2020 An information nutritional label for online documents NFuhr AGiachanou GGrefenstette IGurevych AHanselowski KJarvelin RJones YLiu JMothe WNejdl ACM SIGIR Forum ACM 2018 51 Convolutional neural networks for sentence classification YKim CoRR abs/1408.5882 2014 Benchmarking Aggression Identification in Social Media RKumar AKOjha SMalmasi MZampieri Proceedings of the First Workshop on Trolling, Aggression and Cyberbulling (TRAC) the First Workshop on Trolling, Aggression and Cyberbulling (TRAC)

Santa Fe, USA

2018 Aggression-annotated corpus of hindienglish code-mixed data RKumar ANReganti ABhatia TMaheshwari arXiv:1803.09402 2018 arXiv preprint Non-convex Regularizations for Feature Selection in Ranking with Sparse SVM LLaporte RFlamary SCanu SDéjean JMothe IEEE Transactions on Neural Networks and Learning Systems 25 6 june 2014 Distributed representations of sentences and documents QVLe TMikolov Proceedings of the 31th International Conference on Machine Learning, ICML 2014 the 31th International Conference on Machine Learning, ICML 2014

Beijing, China

June 2014. 2014 JHLipschultz Social media communication: Concepts, practices, data, law and ethics Routledge 2017 Intimate partner violence victimization in the cyber and real world: Examining the extent of cyber aggression experiences and its association with inperson dating violence AMarganski LMelander Journal of interpersonal violence 33 7 2018 Distributed representations of words and phrases and their compositionality TMikolov ISutskever KChen GSCorrado JDean Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013

Lake Tahoe, Nevada, United States

December 5-8, 2013. 2013 Proceedings of a meeting held Social media, cyber-aggression and student mental health on a university campus FMishna CRegehr ALacombe-Duncan JDaciuk GFearing MVan Wert Journal of mental health 27 3 2018 Censored, suspended, shadowbanned: User interpretations of content moderation on social media platforms SMyers West New Media & Society 20 11 2018 A transfer learning approach for emotion intensity prediction in microblog text MOsama SREl-Beltagy 10.1007/978-3-030-31129-2_47 030-31129-2 47 Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2019 the International Conference on Advanced Intelligent Systems and Informatics 2019

AISI; Cairo, Egypt

2019. October 2019. 2019 A pragmatic supervised learning methodology of hate speech detection in social media GPriyadharshini 2019 Fully connected neural network with advance preprocessor to identify aggression over facebook and twitter KRaiyani TGonc ¸alves PQuaresma VBNogueira Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying, TRAC@COLING the First Workshop on Trolling, Aggression and Cyberbullying, TRAC@COLING

Santa Fe, New Mexico, USA

2018 Irit at trac FRamiandrisoa JMothe Proceedings of the First Workshop on Trolling, Aggression and Cyberbulling, TRAC@COLING the First Workshop on Trolling, Aggression and Cyberbulling, TRAC@COLING

Santa Fe, New Mexico, USA

2018. 2018 IRIT at e-Risk FRamiandrisoa JMothe FBenamara VMoriceau Conference and Labs of the Evaluation Forum, Living Labs (CLEF 2018)

Avignon, France

2018. 10/09/2018-14/09/2018. 2018 regular paper. on line). CEUR-WS : Workshop proceedings Software framework for topic modelling with large corpora RRehurek PSojka Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks the LREC 2010 Workshop on New Challenges for NLP Frameworks Citeseer 2010 A Survey on Hate Speech Detection Using Natural Language Processing ASchmidt MWiegand Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media. SocialNLP@EACL 2017 the Fifth International Workshop on Natural Language Processing for Social Media. SocialNLP@EACL 2017

Valencia, Spain

2017 Linguistic metadata augmented classifiers at the CLEF 2017 task for early detection of depression MTrotzek SKoitka CMFriedrich Working Notes of CLEF 2017 -Conference and Labs of the Evaluation Forum

Dublin, Ireland

September 11-14, 2017. 2017 A sensitivity analysis of (and practitioners' guide to) convolutional neural networks for sentence classification YZhang BCWallace CoRR abs/1510.03820 2015