<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>HaMor at the Profiling Hate Speech Spreaders on Twitter</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mirko Lai</string-name>
          <email>mirko.lai@unito.it</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco Antonio Stranisci</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cristina Bosco</string-name>
          <email>cristina.bosco@unito.it</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rossana Damiano</string-name>
          <email>rossana.damiano@unito.it</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Viviana Patti</string-name>
          <email>viviana.patti@unito.it</email>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <abstract>
        <p>In this paper we describe the Hate and Morality (HaMor) submission for the Profiling Hate Speech Spreaders on Twitter task at PAN 2021. We ranked as the 19th position - over 66 participating teams according to the averaged accuracy value of 73% reached by our proposed models over the two languages. We obtained the 43th higher accuracy for English (62%) and the 2nd higher accuracy for Spanish (84%). We proposed four types of features for inferring users attitudes just from the text in their messages: HS detection, users morality, named entities, and communicative behaviour. The results of our experiments are promising and will lead to future investigations of these features in a finer grained perspective.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Hate Speech</kwd>
        <kwd>Moral Values</kwd>
        <kwd>Communicative Behaviour</kwd>
        <kwd>Named Entities</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        The Profiling Hate Speech (HS) Spreaders on Twitter is an Author Profiling task organized at
PAN [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1, 2, 3</xref>
        ]. Teams are invited to develop a model that, given a Twitter feed of 200 messages,
determines whether its author spreads hatred contents. The task is multilingual, and covers
Spanish and English languages. The training set is composed of 200 users per language, 100 of
them annotated as haters by having posted at least one HS in their feeds; the annotation of single
tweets is not available, though. All the information about users, mentions, hashtags, and urls are
anonymized, making not replicable in this context approaches based on demographic features [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ],
or community detection [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        The Hate and Morality (HaMor) team participates to the task with a system that combines
HS, and moral values detection [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] in a feed of tweets, in order to infer a general attitude of a
user towards people vulnerable to discrimination. Our approach relies on the moral pluralistic
hypothesis (Cfr [
        <xref ref-type="bibr" rid="ref7 ref8 ref9">7, 8, 9</xref>
        ]), according to which moral foundations are many and people prioritize
some values than other ones. This can lead to divergent and often conflicting points of view on
debated facts, and might also be a factor in HS spreading [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. More specifically, we considered
a group-bound moral judgement as the signal of a potential negative stance against minorities,
and used it as a feature to classify HS spreaders together with a HS detection model. The paper is
structured as follow: in Section 2 features selection stage is described, and Section 3 is devoted
to present the experimental results of our model. In Conclusions (Section 4) some limitations of
our approach are discussed.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Feature Selection</title>
      <p>Four types of features for inferring users attitudes just from the text in their messages have been
selected to train our model: HS detection (Section 2.1), users morality (Section 2.2), Named
Entities (Section 2.3), Communicative behavior (Section 2.4).</p>
      <sec id="sec-2-1">
        <title>2.1. Hate Speech Detection</title>
        <p>
          Hate Speech Detection (HSD) is the automated task of detecting whether a piece of text contains
hate speech. Several shared tasks on HSD have taken place and large annotated corpora are
available in different languages. For example, the HatEval dataset for hate speech detection
against immigrants and women in Spanish and English tweets have been released to be used at
the Task 5 of the SemEval-2019 workshop [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. We decided to use the entire HatEval dataset for
training three models and we proposed the following features:
−
−
−
        </p>
        <p>SemEvalSVM (SESVM): 1-dimensional feature that counts - for each user - the number
of hateful tweets predicted by a linear SVM trained using a text 1-3grams bag-of-words
representation.</p>
        <p>Atalaya (ATA) [12]: 1-dimensional feature that counts - for each user - the number of
hateful tweets predicted by a linear-kernel SVM trained on a text representation composed
of bag-of-words, bag-of-characters and tweet embeddings, computed from fastText word
vectors. We have taken inspiration from the system proposed by the Atalaya team that
obtained the best result at the HatEval Spanish sub-task.</p>
        <p>Fermi (FER) [13]: a 1-dimensional feature that counts - for each user - the number of
hateful tweets predicted by SVM with the RBF kernel trained on tweet embeddings from
Universal Sentence Encoder. We based the system on that proposed by the Fermi team that
obtained the best result at the HatEval English sub-task.</p>
        <p>Furthermore, the growing interest on this topic leads the research community (and not only)
to develop some lexica of hateful words such as HurtLex [14], NoSwearing1, and The Racial
Slur Database2. HurtLex is a lexicon of offensive, aggressive, and hateful words in over 50
1https://www.noswearing.com/
2http://www.rsdb.org/full
languages (including English and Spanish). The words are divided into 17 different categories.
Then, NoSwearing is a list of English swear words, bad words, and curse words. The Spanish
translation was made by Pamungkas et. al [15]. Finally, the Racial Slur Database is a list of
words that could be used against someone - of a specific race, sexuality, gender etc. - divided into
more then 150 categories. The list is only available in English, we thus computed the Spanish
translation using Babelnet’s API [16]. We also take advantage of spaCy3 models en_core_web_lg,
and es_core_news_lg for expanding the three lexica. Indeed, we used the tok2vec embedding
representation for including in the three lists the 10 most similar tokens of each word in order to
also capture inflection forms and synonyms. We have thus proposed the following features:
−
−</p>
        <p>HurtLex (HL): a 18-dimensional feature that evaluates the number of hateful words used
by each user, the mean of hateful words in each tweet, and the standard deviation. We
exploited the following 6 categories: negative stereotypes ethnic slurs, moral and behavioral
defects, words related to prostitution, words related to homosexuality, words related to
the seven deadly sins of the Christian tradition, felonies and words related to crime and
immoral behavior (we exclusively considered the conservative level).</p>
        <p>No Swearing (NoS): a 3-dimensional feature that evaluates the number of swearing words
used by each user, the mean of swearing words in each tweet, and the standard deviation.
The Racial Slur Database (RSdb): a 27-dimensional feature that evaluates the number of
swearing words used by each user, the mean of swearing words in each tweet, and the
standard deviation for each of the following 9 categories: Asians, Arabs, Black people,
Chinese, Hispanics, Jews, Mexicans, Mixed Races, Muslims.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Moral Values Detection</title>
        <p>
          According to many scholars, morality is a pluralistic rather than an universal concept.
Several configuration of values are possible, and some of them are in conflict, such as autonomy
versus community [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], or conservation versus openness to change [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. The Moral
Foundation Theory (MFT) [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] shares this approach since it distinguishes vfie dyads leading to people
morality: care/harm, fairness/cheating, which relies on individualization, and loyalty/betrayal,
authority/subversion and purity/degradation, which are binding foundations. Some of these
combinations may correlate with specific political positions, as emerges from experimental results
[17]: liberals seem to agree on individualization values, whereas conservatives could be more
likely to follow binding dyads.
        </p>
        <p>
          In building our model, we considered binding moral dyads as a potential feature characterizing
a HS spreader. More specifically, we claimed that users who rely on loyalty/betrayal and
authority/subversion might be inclined to post hatred contents online. Hence, we referred to
two existing resources: the extended Moral Foundations Dictionary (eMFD) [18], and the Moral
Foundations Twitter Corpus (MFTC) [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
        </p>
        <p>The eMFD is a dictionary of terms categorized by a specific moral foundation. We chose all
those related to loyalty/betrayal and authority/subversion moral concerns, and translated them in
Spanish with the BabelNet’s API. Finally, we expanded the words list using the same methodology
explained in Section 2.1. The result is the following feature:
−
extended Moral Foundations Dictionary (eMFD): a 12-dimensional feature that comprises
the mean, the standard deviation, and the total amount of terms occurring in her/his tweets
for the four categories loyalty/betrayal and authority/subversion.</p>
        <p>The MFTC is a collection of 35, 000 tweets annotated for their moral domains, and organized
in 7 subcorpora, each focusing on a specific discourse domain (eg: the Black Lives Matters, and
#metoo movements, and the US 2016 presidential elections). Using transfer learning as a label
assignment method, we converted the original multi-label annotation schema in a binary-label one:
9, 000 texts annotated as loyalty, betrayal, authority or subversion were considered as potentially
correlated with HS (true), while the other not (false). Using the resulting corpora as training set,
we thus proposed the following feature.</p>
        <p>−</p>
        <p>Moral Foundations Twitter Corpus (MFTC): a 1-dimensional feature that counts - for each
user - the number of hateful tweets predicted by a linear SVM trained using a text 1-3grams
bag-of-words representation.</p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Named Entity Recognition of HS target</title>
        <p>In a message, the mention of a person belonging to a group vulnerable to discrimination might be
seen as a signal of hatred contents, since the clear presence of a target in this kind of expressions
allows discriminating between what is HS and what is not. Thereby, we implemented a feature
aimed at detecting the presence of a potential HS target within a tweet.</p>
        <p>We first collected all the entities of type PERSON in the whole training set detected by the
transition-based named entity recognition component of spaCy. Then, we searched the retrieved
entities on Wikipedia through the Opensearch API4. The example below shows the Wikipedia
pages returned by the Opensearch API when the entity Kamala is requested.</p>
        <p>[‘Kamala’,‘Kamala Harris’,‘Kamal Haasan’,
‘Kamala (wrestler)’,‘Kamala Khan’,‘Kamala Surayya’,
‘Kamala Harris 2020 presidential campaign’,
‘Kamaladevi Chattopadhyay’,‘Kamala Mills fire’,
‘Kamalani Dung’]</p>
        <p>However, this operation is revealed to be not accurate. In fact, it does not return a unique result
for each entity detected by spaCy, but a set of 10 potential candidates. Therefore, we decided
to create two lists - one for each language - of HS targets including only persons that belong to
categories that could be subject to discrimination.</p>
        <p>With the aim of detecting the relevant categories, we scraped the category box from the
Wikipedia pages of all entities of type PEOPLE detected by spaCy (3, 996 English, and 5, 089
Spanish). The result is a list of Wikipedia’s categories per language, which needed to be filtered
to avoid not relevant results.</p>
        <p>4https://www.mediawiki.org/wiki/API:Opensearch</p>
        <p>The Figure 1 shows a partial selection of Kamala Harris category box, which contains several
references to unnecessary information, such as ‘1964 births’, or ‘Writers from Oakland,
California’, but also usefully ones, such us ‘African-American candidates for President of the United
States’ or ‘Women vice presidents’.</p>
        <p>After a manual analysis of the two lists, we thus narrowed them by a regex filtering, in order
to obtain only a set of relevant categories: 279 for English, and 415 for Spanish. Finally, we
collected all the individuals who are their members. As final result, we obtained two gazetteers
of potential HS targets (7, 5890 entities for English, and 31, 235 for Spanish) in the following
format.</p>
        <p>{Margaret Skirving Gibb : Scottish feminists,</p>
        <p>Melih Abdulhayog˘lu : Turkish emigrants to the USA,</p>
        <p>James Adomian : LGBT people from Nebraska [...]}
We thus proposed a feature that counts the mentions towards persons belonging to a group
vulnerable to discrimination.</p>
        <p>−</p>
        <p>Named Entity Recognition of HS target (NER): a 5-dimensional feature expressing the
total number of potential HS targets mentioned in her/his tweets, the mean, the standard
deviation, and the ratio between the number of HS target, and all the HS targets mentioned
by the user.</p>
      </sec>
      <sec id="sec-2-4">
        <title>2.4. Communicative Behavior</title>
        <p>Under the label ‘Communicative Behavior’ a set of features related to the structure of the tweet
and to the user’s style has been grouped. The total number, the mean, and the standard deviation
have been computed for each feature over all users feeds.</p>
        <p>−
−</p>
        <p>Uppercase Words (UpW): this feature refers to the amount of words starting with a capital
letter and the number of words containing at least two uppercase characters.</p>
        <p>Punctuation Marks (PM): a 6-dimensional feature that includes the frequency of
exclamation marks, question marks, periods, commas, semicolons, and finally the sum of all the
punctuation marks mentioned before.</p>
        <p>Length (Len): 3 different features were considered to build a vector: number of words,
number of characters, and the average of the length of the words in each tweet.
Communicative Styles (CoSty): a 3-dimensional feature that computes the fraction of
retweets, of replies, and of original tweets over all user’s feed.</p>
        <p>Emoji Profile (EPro): this feature tries to distinguish some user’s traits from the emoji
her/his used. We implemented a one-hot encoding representation of the modifiers used
in the emoji ZWJ sequences (e.g. man: medium skin tone, beard) that includes the 5
different skin tone modifiers and the gender modifiers, in addition to the religious emojis
(e.g. Christian Cross) and the national flags.</p>
        <sec id="sec-2-4-1">
          <title>We finally employed bag-of-words models as feature:</title>
        </sec>
        <sec id="sec-2-4-2">
          <title>Bag of Words (BoW): binary 1-3grams of all user’s tweets. Bag of Emojis (BoE): binary 1-2grams of all user’s tweets only including emojis.</title>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Experiments and Results</title>
      <p>The organizers provided a train dataset of 400 Twitter’s feeds - 200 written in English and 200 in
Spanish - binary labelled with HS Spreader. The distribution is perfectly balanced among the true
and false labels. In order to assess the performance of the participating systems, a test set of 200
unlabelled Twitter’s feeds - 100 for each language - was provided. The accuracy was used for
evaluating the performance in terms of HS Spreader identification for each language. The final
ranking will be the average of the accuracy values per language.</p>
      <sec id="sec-3-1">
        <title>3.1. HaMor experiments</title>
        <p>In our experiments, we addressed HS Spreader identification employing a 5-fold validation over
the train set with the aim of maximizing the predictive accuracy. The code is available on GitHub
for further exploration and for allowing reproducibility of our experiments5. We carried out
several experiments by combining all the features introduced in Section 2 for training a linear
Support Vector Machine (SVM). Table 1 shows some of the performances we obtained in this
experimental setting for each languages. The values for the accuracy are the average over three
5-fold validations using different random shufefls of the train set. The runs we submitted have
been highlighted in bold print.</p>
        <p>The first row contains the higher result that we obtained in the experimental setting for the
two languages. Despite this, we have chosen to submit a Spanish run that reached a lower result.
We experimented in other shared tasks that the results obtained in the experimental setting is
often much lower than the ofcfiial ones [ 19]. It was precisely for this reason that we chose
features - that reached good results, though not the best - based on external sources such as
other annotated corpora (FER, ATA, HatEval), lexica (HL, eMFD, NoS), and semi-structured
contents (NER). Indeed, if, on one hand, the submitted model for English does not reached the
5https://github.com/mirkolai/PAN2021_HaMor
higher level of accuracy reached in the experimental setting, on the other hand, the submitted
model for Spanish reached a better result than experimental ones. The omission of BoW and BoE
features adversely affects the experimental runs for Spanish, but not for English. However, our
choice of not include the features BoW and BoE in the submitted run for English involves the
creation of a little feature-space representation of each instance (vectors length of 52 variables).
This representation could indeed include too few variables for effectively detecting a so complex
phenomena such as HS. Then, we did not investigated cross-cultural differences among English
and Spanish speaking countries although it is well known that historical and political factors are
relevant on the orientation of moral values [20]. Maybe also for that reason, the features based on
the moral values we investigated performs better in Spanish then in English tweets. However, the
introduction of the external-sources-based features positively affects the runs for both languages.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Official results</title>
        <p>Our models obtained 84% (2nd higher result) and 62% (43th higher result) in terms of accuracy
on HS Spreader identification respectively for Spanish and English. The final score, used in
determining the the final ranking, is the averaged accuracy values per language which corresponds
to 73% (19th position - over 66 participating teams).</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusions</title>
      <p>In this paper we presented an overview of the HAMOR submission for the Profiling Hate Speech
Spreaders on Twitter task at PAN-2021. We participated by submitting one run for both tweets
written in English and Spanish. Our approach, chiefly based on external resources such as other
annotated corpora, lexica, and semi-structured content, proved to be highly successful concerning
the task of HS Spreader identification in both languages. The results show that the use of external
resources preserves stable values of accuracy between the experimental setting and the prevision
of the test set on Spanish sub-task. The proposed lexica gave a considerable contribution for
obtaining these results and the use of named entity recognition for detection potential target of HS
looks promising. In the future, we plan to employ the features discarded from the submitted run
for a prediction on the test set. We also aim to explore a finer grained approach to MFT detection
features, considering different combination of moral values, and analyzing how moral attitudes
may vary across different countries. Finally, the Named Entity Recognition feature needs to
be improved through testing different NER tools, and referring to other semantic resources, in
addition to Wikipedia.
[12] J. M. Pérez, F. M. Luque, Atalaya at SemEval 2019 task 5: Robust embeddings for tweet
classification, in: Proceedings of the 13th International Workshop on Semantic Evaluation,
Association for Computational Linguistics, 2019, pp. 64–69.
[13] V. Indurthi, B. Syed, M. Shrivastava, N. Chakravartula, M. Gupta, V. Varma, FERMI
at SemEval-2019 task 5: Using sentence embeddings to identify hate speech against
immigrants and women in Twitter, in: Proceedings of the 13th International Workshop on
Semantic Evaluation, Association for Computational Linguistics, 2019, pp. 70–74.
[14] E. Bassignana, V. Basile, V. Patti, Hurtlex: A multilingual lexicon of words to hurt, in: 5th
Italian Conference on Computational Linguistics, CLiC-it 2018, volume 2253, CEUR-WS,
2018, pp. 1–6.
[15] E. W. Pamungkas, A. T. Cignarella, V. Basile, V. Patti, et al., 14-exlab@ unito for ami at
ibereval2018: Exploiting lexical knowledge for detecting misogyny in english and spanish
tweets, in: 3rd Workshop on Evaluation of Human Language Technologies for Iberian
Languages, IberEval 2018, volume 2150, CEUR-WS, 2018, pp. 234–241.
[16] R. Navigli, S. P. Ponzetto, Babelnet: The automatic construction, evaluation and application
of a wide-coverage multilingual semantic network, Artificial intelligence 193 (2012)
217–250.
[17] J. Graham, J. Haidt, B. A. Nosek, Liberals and conservatives rely on different sets of moral
foundations., Journal of personality and social psychology 96 (2009) 1029.
[18] F. R. Hopp, J. T. Fisher, D. Cornell, R. Huskey, R. Weber, The extended moral
foundations dictionary (eMFD): Development and applications of a crowd-sourced approach to
extracting moral intuitions from text, Behavior Research Methods 53 (2021) 232–246.
[19] M. Lai, A. T. Cignarella, D. I. Hernández Farías, iTACOS at IberEval2017: Detecting stance
in Catalan and Spanish tweets, in: Proceedings of the Second Workshop on Evaluation
of Human Language Technologies for Iberian Languages (IberEval 2017) co-located with
33th Conference of the Spanish Society for Natural Language Processing (SEPLN 2017),
volume 1881 of CEUR Workshop Proceedings, CEUR-WS.org, 2017, pp. 185–192.
[20] L. Myyry, K. Helkama, M. Silfver-Kuhalampi, K. Petkova, J. P. Valentim, K. Liik,
Explorations in reported moral behaviors, values, and moral emotions in four countries, Frontiers
in Psychology 12 (2021) 1468.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Bevendorff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Chulvi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. L. D. L. P.</given-names>
            <surname>Sarracén</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kestemont</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Manjavacas</surname>
          </string-name>
          , I. Markov,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mayerl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Rangel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rosso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Stamatatos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wiegmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wolska</surname>
          </string-name>
          , , E. Zangerle, Overview of PAN 2021:
          <article-title>Authorship Verification,Profiling Hate Speech Spreaders on Twitter,and Style Change Detection</article-title>
          ,
          <source>in: 12th International Conference of the CLEF Association (CLEF</source>
          <year>2021</year>
          ), Springer,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>F.</given-names>
            <surname>Rangel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. L. D. L. P.</given-names>
            <surname>Sarracén</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Chulvi</surname>
          </string-name>
          , E. Fersini,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rosso</surname>
          </string-name>
          ,
          <source>Profiling Hate Speech Spreaders on Twitter Task at PAN</source>
          <year>2021</year>
          ,
          <article-title>in: CLEF 2021 Labs and Workshops, Notebook Papers, CEUR-WS</article-title>
          .org,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Gollub</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wiegmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          , TIRA Integrated Research Architecture, in: N.
          <string-name>
            <surname>Ferro</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          Peters (Eds.),
          <source>Information Retrieval Evaluation in a Changing World, The Information Retrieval Series</source>
          , Springer, Berlin Heidelberg New York,
          <year>2019</year>
          . doi:
          <volume>10</volume>
          .1007/ 978-3-
          <fpage>030</fpage>
          -22948-1\_5.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Waseem</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Hovy</surname>
          </string-name>
          ,
          <article-title>Hateful symbols or hateful people? predictive features for hate speech detection on Twitter</article-title>
          ,
          <source>in: Proceedings of the NAACL Student Research Workshop</source>
          , Association for Computational Linguistics,
          <year>2016</year>
          , pp.
          <fpage>88</fpage>
          --
          <lpage>93</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>P.</given-names>
            <surname>Mishra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Del Tredici</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Yannakoudakis</surname>
          </string-name>
          , E. Shutova,
          <article-title>Author profiling for abuse detection</article-title>
          ,
          <source>in: Proceedings of the 27th international conference on computational linguistics</source>
          ,
          <year>2018</year>
          , pp.
          <fpage>1088</fpage>
          -
          <lpage>1098</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Hoover</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Portillo-Wightman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Yeh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Havaldar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Davani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Kennedy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Atari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Kamel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mendlen</surname>
          </string-name>
          , et al.,
          <article-title>Moral Foundations Twitter Corpus: A collection of 35k tweets annotated for moral sentiment</article-title>
          ,
          <source>Social Psychological and Personality Science</source>
          <volume>11</volume>
          (
          <year>2020</year>
          )
          <fpage>1057</fpage>
          -
          <lpage>1071</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Shweder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. C.</given-names>
            <surname>Much</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mahapatra</surname>
          </string-name>
          ,
          <string-name>
            <surname>L. Park,</surname>
          </string-name>
          <article-title>The" big three" of morality (autonomy, community, divinity) and the" big three" explanations of suffering</article-title>
          .,
          <source>Morality and Health</source>
          (
          <year>1997</year>
          )
          <fpage>119</fpage>
          -
          <lpage>169</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S. H.</given-names>
            <surname>Schwartz</surname>
          </string-name>
          ,
          <article-title>An overview of the schwartz theory of basic values</article-title>
          ,
          <source>Online readings in Psychology and Culture</source>
          <volume>2</volume>
          (
          <year>2012</year>
          )
          <fpage>2307</fpage>
          -
          <lpage>0919</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>J.</given-names>
            <surname>Haidt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Joseph</surname>
          </string-name>
          , et al.,
          <article-title>The moral mind: How vfie sets of innate intuitions guide the development of many culture-specific virtues, and perhaps even modules</article-title>
          ,
          <source>The innate mind 3</source>
          (
          <year>2007</year>
          )
          <fpage>367</fpage>
          -
          <lpage>391</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J.</given-names>
            <surname>Hoover</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Atari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Davani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Kennedy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Portillo-Wightman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Yeh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kogon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dehghani</surname>
          </string-name>
          ,
          <article-title>Bound in hatred: The role of group-based morality in acts of hate</article-title>
          ,
          <source>PsyArXiv</source>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>V.</given-names>
            <surname>Basile</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bosco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Fersini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Nozza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Patti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. M.</given-names>
            <surname>Rangel Pardo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rosso</surname>
          </string-name>
          , M. Sanguinetti, SemEval
          <article-title>-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter</article-title>
          ,
          <source>in: Proceedings of the 13th International Workshop on Semantic Evaluation, Association for Computational Linguistics</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>54</fpage>
          -
          <lpage>63</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>