AMI at IberEval2018 Automatic Misogyny Identification in Spanish and English Tweets

AMI at IberEval2018 Automatic Misogyny Identification in Spanish and English Tweets VictorNina-Alcocer vicnial@inf.upv.es Department of Computer Systems and Computation Universitat Politècnica de València

Spain

AMI at IberEval2018 Automatic Misogyny Identification in Spanish and English Tweets 7820A1FB0EA10A0DDD80755C321955C1 GROBID - A machine learning software for extracting information from scholarly documents

In this paper we describe the submission for the Automatic Misogyny Identification in Spanish and English Tweets shared task organized at IberEval 1 . This work proposes an approach based on weights of ngrams, word categories, structural information and lexical analysis to discover whether these components allow us to discriminate between misogynous and no misogynous tweets and their respective categories and targets in case of misogynous tweets. Moreover, we analyze the use of some features created by these components to investigate their impact.

Introduction

AMI is the first task on automatic misogyny identification [2]. Its aim was to identify cases of aggressiveness and hate speech towards women in social media [1]. Poland's work [3] was the first attempt to manually classify misogynous tweets. Now this shared task will consider two subtasks for this classification:

-subtask1: Misogyny identification.

-subtask2a: Misogynistic Behaviour.

-subtask2b: Target Classification.

The aim of subtask1 is to identify whether a tweet is misogynous or not, and the second subtask2a aims to identify the category, if a misogynous tweet belongs to: discredit, dominance, sexual harassment, stereotype, and derailing. Finally, subtask2b is in charge to identify whether a misogynous tweet is active or passive i.e. if its target is generic (women in general) or individual. In this work, each of these tasks is approached as a classification task. We will use natural language processing (NLP), machine learning and feature engineering to identify patterns and learn classification models respectively.

Approach

This section tries to describe the main approaches that have been used. Generally, misogyny can be expressed written, orally, in a subtle or explicit way, also directly or indirectly addressed to someone. In order to investigate how people may express misogyny in tweets, we propose an approach that allows us to discover some aspects about how misogyny is expressed in the corpus provided by the organizers. Hence this approach takes into account some features that we considered important in order to understand if some of them contribute to recognizing misogynous content and its respective category.

Structure (str): Basically, knowing how many words are used in a tweet or if most of those words are written in capital letters, even if some of them use excessively punctuation marks could reveal important information. As we know a tweet is composed of words, punctuations, mentions, URLs, etc. In this approach, we will pay attention to these aspects to see if all of them in some way help to better discriminate between misogynous tweets and not misogynous one. A summary of these features is given below:

-The number of symbols or punctuation marks (!' ?,.").

-The number of words written in capital letters.

-The number of words and characters, including stop words.

-Mean of the numbers of words and characters.

-The number of mentions, URLs, and hash-tags.

LIWC categories (lc):

Another component that we consider important is the possibility to get features from Linguistic Inquiry and Word Count (LIWC)2 . We have just taken into account some categories related to misogynous emotions such as: angry, sexual, swear, positive, negative, etc. [4] The idea behind this component is to calculate for instance the percentage of positive or negative emotions, or even if a tweet has sexual content as we can see in Figure 1.

Ngrams (ng):

In this component Term frequency -Inverse document frequency based on Words (TFIDFW) or Chars (TFIDFC) schemes are used. For instance in misogynous TFIDFW (see Table 1) the term bitch (first place) is more used among the misogynous tweets than among the ones that are not misogynous (fourth place), e.g. in our case the uni-gram bitch has a different weight, it means that this word has a specific weight in a misogynous tweet and has another weight in a non misogynous one. The same logic is followed for subtask2a and subtask2b using TFIDFW of their categories and targets respectively.

Part of Speech (pos):

The last component of our approach takes into account part of speech information, which has the task of tagging each word in a sentence with its appropriate part of speech. We decide whether each word is a noun, adjective, verbs, etc. Using this component we can identify some patterns, for instance in our corpus some nouns are followed by punctuation marks e.g. bitch!!!!!!.

Experiments and Results

Thanks to the organizers we count with a dataset of 3307 Spanish and 3251 English tweets respectively. Each tweet is labeled as misogynous (1) or nomisogynous (0) and both datasets are balanced. Regarding the type of misogyny and target, each tweet is labeled as: discredit, dominance, sexual harassment, stereotype, derailing and active or passive in case of the target. With respect to the category and target information, the corpus is unbalanced. The first one is biased in favor of discredit (60%) and regarding the target is biased in favor of active (almost 75%). Moreover, to evaluate our system, a test dataset with 831 and 726 unlabeled tweets in Spanish and English respectively was provided.

For the experiments, we employed a set of feature combinations which has been used to feed some classifiers: Support Vector Machine (SVM), Multi-layer Perceptron (MLP) and MultinomialNB (MNB). SVM and 10 K-fold cross-validation were used. The first one was chosen because its performance was good enough with thousands of features, and the second one allows us to avoid over-fitting in all the experiments. Firstly, the main goal was to face the classification of misogynous tweets in Spanish in order to apply the best performing approach to the rest of subtasks Proceedings of the Third Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2018) in English or Spanish. Table 2 shows how the experiments were set up. Approaches ap1 and ap2 had the aim to find out whether features created by TFIDFW, TFIDFC, Bag of word-grams (BOW) or Bag of char-grams (BOC) are useful. ap1 uses the whole group of features (thousands of them) created by TFIDFW or TFIDFC, while ap2 obtains the 60 best features using truncated singular value decomposition (SVD) on TFIDFW and TFIDFC then combines with BOW. Unfortunately, those approaches were interesting but we did not obtain results over our baselines with any of the classifiers (MLP, SVM, MNB). ap3 tries to reduce the number of features: firstly we classified a tweet using MNB and then we obtained their respective probabilities to use them as features (2), additionally we got the best 20 features using SVD on TFIDFW and lastly, we added the features str (5) and lc (10). Unfortunately, with these 37 features we did not achieve results over our baselines in subtask1 and subtask2ab respectively. Now we proceed to analyze the results that we got with the approach proposed in Section 2. ap4 and ap5 follow the same logic, but ap5 obtains better results than ap4 because it uses TFIDFW. Tables 3 and 4 show the best val- ues that we achieved: run4 in Table 3 uses TFIDFW plus structure, category and weight of ngrams(unigrams+bigrams+trigrams) as features and we obtained 0.782 of accuracy applying linear SVM on subtask1 . While with respect to the subtask2a, we added part of speech as feature and we obtained 0.370 of F1-macro. Looking at Table 4, we may observe that in run2 we obtained 0.780 of F1-macro in subtask2b just using lc as feature. Also, that just using str and ng(bigram) we obtained 0.503 of F1-macro on subtask2a.

Official ranking

We did not expect good results in English (see Table 5), but we obtained scores slightly above the average F1-baseline (0.3374) in subtask2a and sub-task2b (see run3 and run4). While in subtask1 we were below the accuracy baseline (0.7837). These results can be due to a bad combination of our features. Table 6 shows the better results we obtained in Spanish (between the first five teams). However, we think that classifying misogynous tweets in this corpus was quite difficult because the performance of the teams was approximately 80% in terms of accuracy. Similarly, in subtask2a and subtask2b, mostly the teams were not far from the baseline.

Fig. 1 .1Fig. 1. The content (words) of a tweet belongs to some category: death, anger, etc.

Table 1 .1Weight of uni-grams and bi-gramsuni-gramsbi-gramsmisogynous no-misogynousmisogynous no-misogynousN term weightsN termweightsN termweightsN termweights1 bitch 0.0549131 rape0.021782 1 stupid bitch 0.0102041 stupid cunt0.0061592 dick0.0273982 dick0.019902 2 ass bitch0.0066582 son bitch0.0024293 stupid 0.0244363 cunt0.019422 3 suck dick0.0048073 men rights0.0020794 like0.0243884 bitch0.0187555 woman 0.0237525 hoe0.017120

Table 2 .2Configuration of the main experimentsNameSet upap1 .TFIDFW + TFIDFC + BOW + BOCap2 .SVD30(TFIDFW) + SVD30(TFIDFC) + BOWap3 . MNB(PREDICTED) + SVD20(TFIDFW) + str + lcap4 .BOW + str + lc + ng + posap5 .TFIDFW + str + lc + ng + pos

Table 3 .3Results with ap5 on English training tweetsrunsubtask1subtask2a subtask2bAccuracyF1-macro F1-macrorun1SVM on TFIDFW+str+lc0.733 +pos0.2990.721run2SVM on TFIDFW+str+lc+ng(u)0.781 +pos0.3020.762run3SVM on TFIDFW+str+lc+ng(u+b)0.781 +pos0.3430.763run4SVM on TFIDFW +str+lc+ng(u+b+t) 0.782 +pos 0.3700.764

Table 4 .4Results with ap5 on Spanish training tweetsrunsubtask1subtask2a subtask2bAccuracyF1-macroF1-macrorun1SVM on TFIDFW+str+lc0.8040.472-str0.781run2SVM on TFIDFW +str+lc+ng(b)0.860 -lc0.503 -str-ng(b)0.780

Table 5 .5Official results for English subtask1, subtask2a and subtask2bsubtask1subtask2abRankRunAccuracy Rank Average F1-macro16Our approach.run2 0.7809170.33643396617Our approach.run3 0.7809140.3391411318Our approach.run4 0.7809130.33959005126Our approach.run1 0.7094230.316368399

Table 6 .6Official results for Spanish subtask1, subtask2a and subtask2bsubtask1subtask2abRankRunAccuracy Rank Average Macro F19Our approach.run1 0.805054152 80.4272247620Our approach.run2 0.76654633 130.4117496222Our approach.run3 0.65944645 210.27271983

https://sites.google.com/view/ibereval-2018 https://www.receptiviti.ai/liwc-api-get-started Proceedings of the Third Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2018)

Conclusions

In this work, we proposed an approach that takes into account some aspects: weights of ngrams, LIWC categories, structural information and lexical analysis. We observed that each aspect contributes in some way to the different subtasks. Moreover, we notice that the four aspects contributed to obtaining a better accuracy and F1-macro in the corpus of English tweets. However, only the first three aspects were useful for the Spanish tweets. As future work, it is interesting to use some techniques to face unbalanced dataset and explore other features. Moreover, we plan deep learning to see what performance this technique could achieve.

Haters: Harassment, abuse, and violence online by bailey poland MBailey 10.1086/693771 Journal of Women in Culture and Society 43 2 2018 Signs Overview of the task on automatic misogyny identification at ibereval EFersini MAnzovino PRosso Proceedings of the Third Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2018), co-located with 34th Conference of the Spanish Society for Natural Language Processing (SEPLN 2018) CEUR Workshop Proceedings the Third Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2018), co-located with 34th Conference of the Spanish Society for Natural Language Processing (SEPLN 2018) CEUR Workshop Proceedings

Seville, Spain

September 18, 2018 The problem of identifying misogynist language on twitter (and other online social spaces) SHewitt TTiropanis CBokhove 10.1145/2908131.2908183 Proceedings of the 8th ACM Conference on Web Science the 8th ACM Conference on Web Science 2016 16 BPoland Haters: Harassment, Abuse, and Violence Online University of Nebraska Press 2016