-

UACh at MEX-A3T 2019: Preliminary Results on Detecting Aggressive Tweets by Adding Author Information Via an Unsupervised Strategy

Marco Casavantes

Roberto Lopez

Luis Carlos Gonzalez

lcgonzalezg@uach.mx 0 0 Universidad Autonoma de Chihuahua. Facultad de Ingenier a. Chihuahua , Chih. , Mexico

2019

537 543

In this paper we describe our participation for the Aggressiveness Detection Track in the second edition of MEX-A3T. We evaluate di erent strategies for text classi cation, including classi ers such as Support Vector Machines and a Multilayer Perceptron trained on n-grams (words and characters) and word embeddings. We also study the inclusion of features to try to give context to the text messages and explore if people verbally attack di erently depending on their traits and overall environment. Preliminary results show that our strategy is competitive to detect aggression in tweets, ranking in 2nd place with respect to the participants of 2018 and 2019.

Spanish text classi cation Aggressiveness Detection Multilayer Perceptron

Technology has changed the way in which people communicate with each other, giving rise to new services such as social networks, where a style of informal communication is used. Such social networks, though, present several challenges to maintain communication channels open to the free sharing of ideas. The intolerance and aggressiveness of certain users a ects the experience of other consumers or people interested in being part of the communities and their conversations. The fact of not being face to face in the communication channel and even preserve anonymity, encourages these individuals to express themselves o ensively. However, the volume of messages that are sent daily, the growth of online communities, and the respective ease of access to these social networks, make the moderation of communication channels a di cult task to be dealt with by conventional means, and as people increasingly communicate online, the need for high quality automated abusive language classi ers becomes much more profound[ 1 ].

One of the goals of the second edition of MEX-A3T[ 2 ] is to tackle this problem and further improve the research of this important NLP task, the detection of aggressive tweets in Mexican Spanish. In this work we evaluate strategies proposed before, such as the use of lexical features through TF-IDF representations, and di erent approaches to add features in order to try to give context to each text. Surprisingly, even tackling the task with such a basic approach our proposal is able to o er competitive results, just slightly behind the top performer of this competition in 2018 and 2019, INGEOTEC. Furthermore, we also investigate how to incorporate author's traits by using unsupervised methods and attempting to include this information as possible features, based on the hypothesis that there are di erent ways of aggression depending on the author's context. 2 2.1

Proposed Method Data Pre-processing

After loading the train and test sets, we strip the tweets from non-alphanumeric characters and only keep some relevant Spanish characters (a,e, ,o,u,n~,and u), all words are then made lowercase and subsequently we noticed that in both sets there exists many di erent terms to express laughter (mainly due to how many times "ja" is repeated when the word "jaja" appears and because of typos) so that led us to replace every word containing "jaja" to "risa" (laugh), with the purpose of decreasing the number of terms that represent this emotion. It is worth mentioning that we also created and conducted experiments on a version of the datasets where emojis were converted to text and hashtags were separated by words (e.g., ":)" would turn into "smiling face", and "#FelizMiercoles" would be "feliz miercoles"), however most hashtags were wrongly separated and the performance of the classi ers decreased by incorporating these steps and were therefore discarded. 2.2

Features

We conducted our research using the following features: Lexical: We use word n-grams (n=1, 2) and char n-grams (n=3, 4) as features, this collection of terms is weighted with its term frequency-inverse document frequency (TF-IDF).

Document Embeddings: The objective was to represent the tweets through Word Embeddings[ 3 ] and try di erent classi ers with these new features, each text message was converted to a vector of size = 300 (mean of the vectors of each word). The model of words in Spanish was computed with fastText[ 4 ] and downloaded from [ 5 ] .

User Occupation and Location predictions: Although we attempted sev

eral strategies to obtain unsupervised author pro les for each document [ 6 ], we ended up using the output of the system developed by [ 7 ] as predictions of occupation and location values to explore the possibility of di erences in vocabulary that exists according to the pro le of the author of the message. Grouping tweets by theme: An implementation of Self Organizing Maps (SOM) as a clustering strategy called MiniSom[ 8 ] was used with aims to nd groups in the collection of texts based on underlying or non-explicit features, the clustering was done including all words and also ignoring swear words (to reduce the noise and focus on thematic terms), after training the network we were able to compute the coordinates assigned to a tweet on the map and use these as new features.

Perspicuity score / In esz scale: Based on [ 9 ], we adapted the idea of capturing the quality of each tweet by using a modi ed Flesch Reading Ease score (since this test only applies to text written in English), called Perspicuity score and its equivalence to the In esz scale, following the equation described in [ 10 ] where the number of sentences is also xed at one.

All the extra categorical features mentioned above were concatenated following a One Hot Encoding scheme. 3

Experiments and Results

The datasets were provided by MEX-A3T Team. Table 1 shows the distribution of training and test partitions for Spanish tweets.

We separated the training set in 67% for training and 33% for validation to evaluate our experiments with di erent combinations of features discussed in section 2.2. We started our research by recreating the baselines described in the overview of the rst edition of MEX-A3T[ 11 ], particularly focusing on the character trigrams baseline, as it holds the best performance in comparison to the BoW baseline.

We trained Linear Support Vector Machines and a Multilayer Perceptron as classi ers for this task, and we decided to use the perceptron as the nal system to submit our predictions since it exhibited the best results in the validation stage, as shown in Table 2 where we obtained the F1-score macro and the FMeasure over the aggressive class. We performed all modeling regarding the creation of tf-idf feature matrices and SVM classi ers using scikit-learn[ 12 ], and for the Multilayer Perceptron, we used the implementation described in [ 13 ], there was only an instance were this Perceptron couldn't be trained with Word Embeddings, so we tried another con guration on the MLPClassi er from scikitlearn getting low scores similar to the ones obtained using LinearSVM, and therefore casting aside this approach. 3.1

Results

As stated before, the Multilayer Perceptron was chosen as nal system, however, because of time and memory constraints we had to train this model using only character n-grams of range [ 3,4 ] for this task even though later results shows better performance by using n-grams of range [ 3,5 ]. Table 3 list the top ve nal rankings for the aggressiveness detection task for 2019, more details of all results of the contest are shown at [ 2 ]. It is interesting to observe that even when our system relied on such a basic approach, it is able to compete faceto-face against INGEOTEC, a model based on an ensemble of classi ers, which specially tailors discriminative features for aggressive detection via a Genetic Programming strategy. 3.2

Analysis

To breakdown our results, we started by getting the 10 most valuable n-grams at character level separated by length, as shown in Table 4. With respect to the aggressive class, our nal con guration had more false positives than false negatives, meaning that it was easier for an aggressive tweet to be missclassi ed as non-aggressive than the other way around. Despite running several experiments and adding new features trying to give context to the tweets, in hopes of improving classi cation in this task, unfortunately these strategies showed, at best, almost unnoticeable changes in the results, and hinder of classi cation at worst. After manual inspection, we observed that this could have happened because: { Occupation and Location predictions did not group the messages in a balanced way, in fact, most tweets would fall under only one out of eight available categories for occupation and six categories for location. { SOM Coordinates would not enhance the classi cation scores before as the clusters were capturing word repetition instead of thematic aspects for each tweet. Later experiments (after submission of results) showed that this behaviour was caused because the clustering was made with n-grams; training the SOM with word embeddings created with the train set of this task (without external resources) solved this issue and did a better job at grouping the tweets by subjects. { There was no relevant pattern by applying a perspicuity score to each tweet, as there were multiple cases of similar scores assigned to both aggressive and non-aggressive messages. In this paper, we describe our strategy to classify aggressive and non-aggressive tweets in Mexican Spanish. In our best performing system, we use only lexical features and our results show a better performance than most results of all participants. This outcome, and the fact that the F-measure for the aggressive class is still low compared to the score on the non-aggressive class, motivates the idea of future work focusing on feature analysis for aggressiveness detection and explore which representations are truly relevant, including word embeddings, Bag of Words and Characters of di erent n-gram ranges, see if these complement each other and if so, how to combine them. We analyzed our clustering strategies, and after changing the way they were trained we could observe slight improvement in classi cation results, motivating us to keep experimenting on ways to try to add context to the text messages. We also believe in the potential that neural networks display for this task, and that more research on how to build and train them properly will certainly improve the current situation of this task. As future work, we look forward to develop new strategies based on deep neural networks, such as Recurrent Neural Networks, which are tools aimed to work with sequential data similar in nature to time series.

Chikashi

Nobata , Joel Tetreault, Achint Thomas,

Yashar

Mehdad , and

Chang . Abusive language detection in online user content . In Proceedings of the 25th International Conference on World Wide Web, WWW '16 , pages 145 { 153 , Republic and Canton of Geneva, Switzerland, 2016 . International World Wide Web Conferences Steering Committee.

Mario

Ezra Aragon , Miguel A Alvarez-Carmona, Manuel Montes-y Gomez, Hugo Jair Escalante, Luis Villasen~or-Pineda, and Daniela Moctezuma. Overview of MEX-A3T at IberLEF 2019: Authorship and aggressiveness analysis in Mexican Spanish tweets . In Notebook Papers of 1st SEPLN Workshop on Iberian Languages Evaluation Forum (IberLEF) , Bilbao, Spain, September, 2019 .

Quoc

Le and

Tomas

Mikolov . Distributed representations of sentences and documents . In Proceedings of the 31st International Conference on International Conference on Machine Learning - Volume 32, ICML'14 , pages

{ 1188{II{1196 . JMLR.org, 2014 .

Piotr

Bojanowski , Edouard Grave, Armand Joulin, and

Tomas

Mikolov . Enriching word vectors with subword information . Transactions of the Association for Computational Linguistics , 5 : 135 { 146 , 2017 .

5. Github - mquezada/starsconf2018-word -embeddings: Material para el taller "representaciones vectoriales de palabras basadas en redes neuronales" de la starsconf 2018 . https://github.com/mquezada/starsconf2018-word-embeddings. (Accessed on 06/02/ 2019 ).

Roberto

Lopez Santillan ,

L.C.

Gonzalez-Gurrola , and Graciela Ram rez-Alonso. Custom document embeddings via the centroids method: Gender classi cation in an author pro ling task . In Linda Cappellato , Nicola Ferro, Jian-Yun Nie , and Laure Soulier, editors, CLEF 2018 Evaluation Labs and Workshop { Working Notes Papers , 10 - 14 September, Avignon, France. CEUR-WS.org, September 2018 .

Rosa

Mar a Ortega-Mendoza and A Pastor Lopez-Monroy. The winning approach for author pro ling of mexican users in twitter at mex . a3t@ ibereval - 2018 .

8. Github - justglowing/minisom: Minisom is a minimalistic implementation of the self organizing maps . https://github.com/JustGlowing/minisom. (Accessed on 06/03/ 2019 ).

Thomas

Davidson , Dana Warmsley,

Michael W.

Macy , and

Ingmar

Weber . Automated hate speech detection and the problem of o ensive language . CoRR, abs/1703.04009 , 2017 .

10. Escala in esz | legible. https://legible.es/blog/escala-in esz/. (Accessed on 06/02/ 2019 ).

11. Miguel Alvarez-Carmona, Estefan a Guzman-Falcon, Manuel Montes-y Gomez, Hugo Jair Escalante, Luis Villasen~or- Pineda , Veronica Reyes-Meza, and Antonio Rico-Sulayes. Overview of MEX-A3T at IberEval 2018: Authorship and aggressiveness analysis in Mexican Spanish tweets . CEUR Workshop Proceedings , 2150 : 74 { 96 , 2018 .

12.

Pedregosa ,

Varoquaux ,

Gramfort ,

Michel ,

Thirion ,

Grisel ,

Blondel ,

Prettenhofer ,

Weiss ,

Dubourg ,

Vanderplas ,

Passos ,

Cournapeau ,

Brucher ,

Perrot , and

Duchesnay . Scikit-learn: Machine learning in Python . Journal of Machine Learning Research , 12 : 2825 { 2830 , 2011 .

13. Github - afshinrahimi/sparsemultilayerperceptron: Lasagne / theano based multilayer perceptron mlp which accepts both sparse and dense matrices and is very easy to use with scikit-learn api similarity . https://github.com/afshinrahimi/sparsemultilayerperceptron. (Accessed on 06/03/ 2019 ).