<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Source-driven Representations for Hate Speech Detection</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Flavio Merenda</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Claudia Zaghi</string-name>
          <email>c.zaghi@student.rug.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tommaso Caselli</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Malvina Nissim</string-name>
          <email>m.nissim@rug.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Rikjuniversiteit Groningen</institution>
          ,
          <addr-line>Groningen</addr-line>
          ,
          <institution>The Netherlands Universita` degli Studi di Salerno</institution>
          ,
          <addr-line>Salerno</addr-line>
          ,
          <country>Italy f.merenda</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>English. Sources, in the form of selected Facebook pages, can be used as indicators of hate-rich content. Polarized distributed representations created over such content prove superior to generic embeddings in the task of hate speech detection. The same content seems to carry a too weak signal to proxy silver labels in a distant supervised setting. However, this signal is stronger than gold labels which come from a different distribution, leading to re-think the process of annotation in the context of highly subjective judgments.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Hate speech is “the use of aggressive, hatred or
offensive language, targeting a specific group of
people sharing a common trait: their gender,
ethnic group, race, religion, sexual orientation, or
disability” (Merriam-Webster’s collegiate dictionary,
1999). The phenomenon is widely spread on-line,
and Italian Social Media is definitely not an
exception
        <xref ref-type="bibr" rid="ref7">(Gagliardone et al., 2015)</xref>
        . To monitor the
problem, social networks and websites have
introduced a stricter code of conduct and regularly
remove hateful content flagged by users
        <xref ref-type="bibr" rid="ref3">(Bleich,
2014)</xref>
        . However, the volume of data requires that
ways are found to classify on-line content
automatically
        <xref ref-type="bibr" rid="ref13 ref9">(Nobata et al., 2016; Kennedy et al.,
2017)</xref>
        .
      </p>
      <p>
        The Italian NLP community is active on this
front
        <xref ref-type="bibr" rid="ref16 ref5">(Poletto et al., 2017; Del Vigna et al., 2017)</xref>
        ,
with the development of labeled data, including
the organization of a dedicated shared task at the
EVALITA 2018 campaign1. Relying on manually
labeled data has limitations, though: i.)
annotation is time and resource consuming; ii.)
portability to new domains is scarce2; iii.) biases are
unavoidable in annotated data, especially in the form
of annotation decisions. This is both due to the
intrinsic subjectivity of the task itself, and to the
fact that there is not, as yet, a shared set of
definitions and guidelines across the different projects
that yield annotated datasets.
      </p>
      <p>
        Introduced as a new take on data annotation
        <xref ref-type="bibr" rid="ref12 ref8">(Mintz et al., 2009; Go et al., 2009)</xref>
        , distant
supervision is used to automatically assign (silver)
labels based on the presence or absence of
specific hints, such as happy/sad emoticons
        <xref ref-type="bibr" rid="ref8">(Go et al.,
2009)</xref>
        to proxy positive/negative labels for
sentiment analysis, Facebook reactions
        <xref ref-type="bibr" rid="ref1 ref18">(Pool and
Nissim, 2016; Basile et al., 2017)</xref>
        for emotion
detection, or specific strings to assign gender
        <xref ref-type="bibr" rid="ref6">(Emmery
et al., 2017)</xref>
        . Such an approach has the
advantage of being more scalable (portability to
different languages or domains) and versatile (time and
resources needed to train), than pure supervised
learning algorithms, while preserving competitive
performance. Apart from the ease of generating
labeled data, distant supervision has a valuable
ecological aspect in not relying on third-party
annotators to interpret the data (Purver and Battersby,
1http://www.di.unito.it/˜tutreeb/
haspeede-evalita18/index.html
      </p>
      <p>2The EVALITA 2018 haspeede task addresses this
issue by setting the task in a cross-genre fashion.
2012). This reduces the risk of adding extra bias
(see also point (iii) about limitation in the previous
paragraph), modulo the choices related to which
proxies should be considered.</p>
    </sec>
    <sec id="sec-2">
      <title>Novelty and Contribution We promote a spe</title>
      <p>cial take on distant supervision where we use as
proxies the sources where the content is published
on-line rather than any hint in the content itself.
Through a battery of experiments on hate speech
detection in Italian we show that this approach
yields meaningful representations and an increase
in performance over the use of generic
representations. Contextually, we show the limitations of
silver labels, but also of gold labels that come from a
different dataset with respect to the evaluation set.
2</p>
    </sec>
    <sec id="sec-3">
      <title>Source-driven Representations</title>
      <p>
        Our approach is based on previous studies on
on-line communities showing that communities
tend to reinforce themselves, enhancing “filter
bubbles” effects, decreasing diversity, distorting
information, and polarizing socio-political
opinions
        <xref ref-type="bibr" rid="ref14 ref17 ref2 ref20 ref4 ref7">(Pariser, 2011; Bozdag and van den Hoven,
2015; Seargeant and Tagg, 2018)</xref>
        . Each
community in the social media sphere thus represents a
somewhat different source of data. Our hypothesis
is that the contents generated by each community
(source) can thus be used as proxies for
specialized information or even labeled data.
      </p>
      <p>Building on this principle, we scraped data from
social media communities on Facebook, acquiring
what we call source-driven representations. The
data is indeed used in two ways in the context
of Hate Speech detection, namely: i.) to
generate (potentially) polarized word embeddings to be
used in a variety of models, comparing it to more
standard generic embeddings (Section 3); and ii.)
as training data for a supervised machine learning
classifier, combining and comparing it with
manually labeled data (Section 4).
3</p>
    </sec>
    <sec id="sec-4">
      <title>Polarized Embeddings</title>
      <p>Polarized embeddings are representations built on
a corpus which is not randomly representative of
the Italian language, rather collected with a
specific bias. In this context, we use data scraped
from Facebook pages (communities) in order to
create hate-rich embeddings.</p>
      <p>Data acquisition We selected a set of publicly
available Facebook pages that may promote or be
the target of hate speech, such as pages known for
promoting nationalism (Italia Patria Mia),
controversies (Dagospia, La Zanzara - Radio 24), hate
against migrants and other minorities (La
Fabbrica Del Degrado, Il Redpillatore, Cloroformio),
support for women and LGBT rights (NON UNA
DI MENO, LGBT News Italia). Using the
Facebook API, we downloaded the comments to posts
as they are the text portions most likely to express
hate, collecting a total of over 1M comments for
almost 13M tokens (Table 1).</p>
      <p>Page Name
Matteo Salvini
NON UNA DI MENO
LGBT News Italia
Italia Patria Mia
Dagospia
La Fabbrica Del Degrado
Boom. Friendzoned.</p>
      <p>
        Cloroformio
Il Redpillatore
Sesso Droga e Pastorizia
PSDM
Cara, sei femminista - Returned
Se solo avrei studiato
La Zanzara - Radio 24
Total
Making Embeddings We built distributed
representations over the acquired data. The
embeddings have been generated with the word2vec 3
skip-gram model
        <xref ref-type="bibr" rid="ref11">(Mikolov et al., 2013)</xref>
        using 300
dimensions, a context window of 5, and
minimum frequency 1. The final vocabulary amounts
to 381,697 words.
      </p>
      <p>
        These hate-rich embeddings are used in
models for hate speech detection. For comparison,
we also use larger, generic embeddings that were
trained on the Italian Wikipedia (more than 300M
tokens)4 using GloVe
        <xref ref-type="bibr" rid="ref2">(Berardi et al., 2015)</xref>
        5; the
vocabulary amounts to 730,613 words. As a
sanity check, and a sort of qualitative intrinsic
evaluation, we probed our embeddings with a few
keywords, reporting in Table 2 the top three nearest
neighbors for the words “immigrati” [migrants]
3https://radimrehurek.com/gensim/
;https://github.com/RaRe-Technologies/
gensim
      </p>
      <p>4http://hlt.isti.cnr.it/
wordembeddings/</p>
      <p>5https://nlp.stanford.edu/projects/
glove/
and “trans”. For the former, it is interesting to see
how the polarized embeddings return more
hateleaning words compared to the generic
embeddings. For the latter, in addition to hateful epithets,
we also see how these embeddings capture the
correct semantic field, while the generic ones do not.
Classification To test the contribution of our
embeddings, we used them in two different
classifiers, comparing them to alternative distributed
representations.</p>
      <p>
        First, we built a Convolutional Neural
Network (CNN), using the implementation of
        <xref ref-type="bibr" rid="ref10">(Kim,
2014)</xref>
        . This is a simple architecture with one
convolutional layer built on top of a word
embeddings layer (hyperparameters: Number of
filters: 6; Filter sizes: 3, 5, 8;
Strides: 1; Activation function:
Rectifier). We experimented with three different
activation strategies for the CNN model: i.)
random initialization, by generating word
embeddings from the training data itself, i.e.
“on-thefly”; ii.) pre-trained 300 dimension general word
embeddings; iii.) our own polarised embeddings.
      </p>
      <p>
        Second, and for further comparison, we also
built a simple Linear Support Vector Machine
(SVM), using the LinearSVC scikit learn
implementation
        <xref ref-type="bibr" rid="ref15">(Pedregosa et al., 2011)</xref>
        . In one setting,
we used only information coming from the two
different sets of pre-trained embeddings (GloVe
generic vs our polarized ones) to observe their
contribution alone, in the same fashion as the
CNN. To use these word vectors in the SVM
model, we mapped the content words in each
sentence and we replaced them with the
corresponding word embeddings values; afterwards, we
computed the average value for each word embedding,
in order to achieve a unique one-dimensional
sentence vector with each word replaced with the
corresponding embedding average. In further
settings, we combined this information with a more
standard n-gram-based tf-idf model. Specifically,
we use 1-3 word and 2-4 character n-grams, with
default parameter values for the SVM.
      </p>
      <p>We train and test our models using the
manually labelled data provided in the context of
the EVALITA 2018 task on Hate Speech
Detection (haspeede) 6. The released
training/development set comprises 3000 Facebook
comments and 3000 tweets. The proportion of
hateful content in this dataset is 39%, with 46%
in the Facebook portion, and 32% in Twitter. We
train on 80% of haspeede (4800 instances), and
test on the remaining 20%. We report precision,
recall, and F-score per class, averaged over ten
random train/test splits. To assess general
performance, we use macro F-score rather than micro
F-score as the classifier’s accuracy on the
minority class is particularly important. This is also
reported as the average of the ten different runs.
Results The results in Table 3 show that despite
our embeddings being almost 25 times smaller
than the generic ones, they yield a substantially
better performance both in the CNN model and
in the SVM classifier. In the former, they are
also more informative than the representations
obtained on-the-fly from the training data. In the
latter, the contribution of embeddings in general
appears though rather marginal on top of a more
standard SVM model based on n-gram tf-idf
information, and the difference according to which
representation is used is not significant. Finally,
it is interesting to note that the polarized
embeddings cover 55% of the tokens in the training data
(vs. only 45% of the generic ones, in spite of the
substantial size difference between the two.
4</p>
    </sec>
    <sec id="sec-5">
      <title>Silver labels</title>
      <p>In a more standard distantly supervised setting,
modulo proxing labels via sources rather than
specific keywords/emojis, we also used the scraped
text as training data directly. Because we
approximate labels with sources, and we had collected
data from supposedly hate-rich pages, for the
current experimental settings we balanced the data by
6http://www.di.unito.it/˜tutreeb/
haspeede-evalita18/index.html
F MACRO F
.749
.760
.786
.728
.750
.806
.807
.802
evaluation, we use the same settings as the
experiments in Section 3, by picking a random test set
out of the haspeede dataset ten times, and
reporting averaged results.</p>
      <p>Results From Table 4 we can make the
following observations: (i) training on silver labels lets
us detect hate speech better than a
most-frequentlabel baseline (macro F=:383); (ii) however, in
this context, training on small amounts of gold
data is substantially more accurate than training on
large amounts of distantly supervised data (:807
vs :464); (iii) adding even small amounts of
silver data to gold decreases performance (:792 vs
:807)8; (iv) also adding more gold data decreases
performance, even more so than adding an equal
amount of silver data, if the manually labeled data
comes from a different dataset (thus created with
different guidelines, and in this case with a
different hate/non-hate distribution). Performance goes
up as expected when adding more data from the
same dataset (:814 vs :807).
5</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusions</title>
      <p>We exploited distant supervision to automatically
obtain representations from Facebook-scraped
content in two forms. First, we generated
polarized, hate-rich distributed representations which
proved superior to larger, generic embeddings
when used both in a CNN and an SVM model
for hate speech detection. Second, we used the
scraped data as training material directly, proxing
scraping Facebook comments from an Italian news
agency (i.e. ANSA), assuming it conveys neutral
content rather than polarized.</p>
      <p>As for the distribution of labels, we followed
the proportion of the Facebook portion of the
haspeede dataset (46% of hateful content, and
the rest non-polarized). We proxy labels according
to sources, and under the above presumed
proportions, we selected a total of 100,000 comments.</p>
      <p>
        For comparison, and in combination, we also
used gold data. In addition to the previously
mentioned 6000 instances from the haspeede task,
we used the Turin dataset, a collection of
990 manually labelled tweets concerning the topic
of immigration, religion and Roma7
        <xref ref-type="bibr" rid="ref16 ref17">(Poletto et al.,
2017; Poletto et al., 2018)</xref>
        . The distribution of
labels in this dataset differs from the EVALITA
dataset, with only 160 (16%) hateful instances.
      </p>
      <p>We trained an SVM classifier with the best
settings as observed in Section 3 (tf-idf and and
polarised embeddings) using different training sets,
combining gold and silver data (see Table 4). For
7The Romani, Romany, or Roma are an ethnic group of
traditionally itinerant people who originated in northern India
and are nowadays subject to ethnic discrimination.
8We also experimented with adding progressively larger
batches of silver data to gold (2K, 3K, 5K, etc.), but this
yielded a steady decrease in performance.
labels (hate vs non-hate) with the sources where
the data was coming from (Facebook pages). This
did not prove as a successful alternative nor
complementary strategy to using gold data, though
performance above baseline indicates some signal is
present. Importantly, though, our experiments also
suggest that gold data is not better than silver data
if it comes from a different dataset. This highlights
a crucial aspect related to the creation of manually
labeled datasets, especially in the highly
subjective area of hate speech and affective computing
in general, where different guidelines and
different annotators clearly introduce large biases and
discrepancies across datasets.</p>
      <p>All considered, we believe that obtaining data
in a distant, more ecological way should be further
pursued and refined. How to better exploit the
information that comes from polarized embeddings
in combination with other features is also left to
future work.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>The authors want to thank the EVALITA 2018
Hate Speech Detection (HaSpeeDe) task
organizers for allowing us to use their datasets.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Angelo</given-names>
            <surname>Basile</surname>
          </string-name>
          , Tommaso Caselli, and
          <string-name>
            <given-names>Malvina</given-names>
            <surname>Nissim</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Predicting Controversial News Using Facebook Reactions</article-title>
          .
          <source>In Proceedings of the Fourth Italian Conference on Computational Linguistics</source>
          (CLiC-it
          <year>2017</year>
          ), Rome, Italy.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Giacomo</given-names>
            <surname>Berardi</surname>
          </string-name>
          , Andrea Esuli, and Diego Marcheggiani.
          <year>2015</year>
          .
          <article-title>Word embeddings go to italy: A comparison of models and training datasets</article-title>
          .
          <source>In IIR.</source>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Erik</given-names>
            <surname>Bleich</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Freedom of expression versus racist hate speech: Explaining differences between high court regulations in the usa and europe</article-title>
          .
          <source>Journal of Ethnic and Migration Studies</source>
          ,
          <volume>40</volume>
          (
          <issue>2</issue>
          ):
          <fpage>283</fpage>
          -
          <lpage>300</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Engin</given-names>
            <surname>Bozdag</surname>
          </string-name>
          and Jeroen van den Hoven.
          <year>2015</year>
          .
          <article-title>Breaking the filter bubble: democracy and design</article-title>
          .
          <source>Ethics and Information Technology</source>
          ,
          <volume>17</volume>
          (
          <issue>4</issue>
          ):
          <fpage>249</fpage>
          -
          <lpage>265</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Fabio Del Vigna</surname>
            ,
            <given-names>Andrea</given-names>
          </string-name>
          <string-name>
            <surname>Cimino</surname>
            , Felice Dell'Orletta,
            <given-names>Marinella</given-names>
          </string-name>
          <string-name>
            <surname>Petrocchi</surname>
            , and
            <given-names>Maurizio</given-names>
          </string-name>
          <string-name>
            <surname>Tesconi</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Hate me, hate me not: Hate speech detection on facebook</article-title>
          .
          <source>In Proceedings of the First Italian Conference on Cybersecurity (ITASEC17)</source>
          , Venice, Italy, January
          <volume>17</volume>
          -
          <issue>20</issue>
          ,
          <year>2017</year>
          , pages
          <fpage>86</fpage>
          -
          <lpage>95</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Chris</given-names>
            <surname>Emmery</surname>
          </string-name>
          , Grzegorz Chrupała, and
          <string-name>
            <given-names>Walter</given-names>
            <surname>Daelemans</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Simple queries as distant labels for predicting gender on twitter</article-title>
          .
          <source>In Proceedings of the 3rd Workshop on Noisy User-generated Text</source>
          , pages
          <fpage>50</fpage>
          -
          <lpage>55</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Iginio</given-names>
            <surname>Gagliardone</surname>
          </string-name>
          , Danit Gal, Thiago Alves, and
          <string-name>
            <given-names>Gabriela</given-names>
            <surname>Martinez</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Countering online hate speech</article-title>
          .
          <source>Unesco Publishing.</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Alec</given-names>
            <surname>Go</surname>
          </string-name>
          , Richa Bhayani, and
          <string-name>
            <given-names>Lei</given-names>
            <surname>Huang</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Twitter sentiment classification using distant supervision</article-title>
          .
          <source>CS224N Project Report</source>
          , Stanford,
          <volume>1</volume>
          (
          <issue>12</issue>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>George</given-names>
            <surname>Kennedy</surname>
          </string-name>
          ,
          <string-name>
            <surname>Andrew McCollough</surname>
            ,
            <given-names>Edward</given-names>
          </string-name>
          <string-name>
            <surname>Dixon</surname>
            , Alexei Bastidas, John Ryan, Chris Loo, and
            <given-names>Saurav</given-names>
          </string-name>
          <string-name>
            <surname>Sahay</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Technology solutions to combat online harassment</article-title>
          .
          <source>In Proceedings of the First Workshop on Abusive Language Online</source>
          , pages
          <fpage>73</fpage>
          -
          <lpage>77</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Yoon</given-names>
            <surname>Kim</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Convolutional neural networks for sentence classification</article-title>
          .
          <source>arXiv preprint arXiv:1408</source>
          .
          <fpage>5882</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Tomas</given-names>
            <surname>Mikolov</surname>
          </string-name>
          , Kai Chen, Greg Corrado, and
          <string-name>
            <given-names>Jeffrey</given-names>
            <surname>Dean</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Efficient estimation of word representations in vector space</article-title>
          .
          <source>arXiv preprint arXiv:1301</source>
          .
          <fpage>3781</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Mike</given-names>
            <surname>Mintz</surname>
          </string-name>
          , Steven Bills, Rion Snow, and
          <string-name>
            <given-names>Dan</given-names>
            <surname>Jurafsky</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Distant supervision for relation extraction without labeled data</article-title>
          .
          <source>In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2-</source>
          Volume
          <volume>2</volume>
          , pages
          <fpage>1003</fpage>
          -
          <lpage>1011</lpage>
          . Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Chikashi</given-names>
            <surname>Nobata</surname>
          </string-name>
          , Joel Tetreault, Achint Thomas,
          <string-name>
            <given-names>Yashar</given-names>
            <surname>Mehdad</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Yi</given-names>
            <surname>Chang</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Abusive language detection in online user content</article-title>
          .
          <source>In Proceedings of the 25th International Conference on World Wide Web</source>
          , pages
          <fpage>145</fpage>
          -
          <lpage>153</lpage>
          . International World Wide Web Conferences Steering Committee.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Eli</given-names>
            <surname>Pariser</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>The filter bubble: What the Internet is hiding from you</article-title>
          .
          <source>Penguin UK.</source>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>F.</given-names>
            <surname>Pedregosa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Varoquaux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gramfort</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Michel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Thirion</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Grisel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Blondel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Prettenhofer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Weiss</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Dubourg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Vanderplas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Passos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Cournapeau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Brucher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Perrot</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E.</given-names>
            <surname>Duchesnay</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Scikit-learn: Machine learning in Python</article-title>
          .
          <source>Journal of Machine Learning Research</source>
          ,
          <volume>12</volume>
          :
          <fpage>2825</fpage>
          -
          <lpage>2830</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <given-names>Fabio</given-names>
            <surname>Poletto</surname>
          </string-name>
          , Marco Stranisci, Manuela Sanguinetti, Viviana Patti, and
          <string-name>
            <given-names>Cristina</given-names>
            <surname>Bosco</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Hate speech annotation: Analysis of an italian twitter corpus</article-title>
          .
          <source>In CEUR WORKSHOP PROCEEDINGS</source>
          , volume
          <year>2006</year>
          , pages
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          . CEUR-WS.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>Fabio</given-names>
            <surname>Poletto</surname>
          </string-name>
          , Cristina Bosco, Viviana Patti, and
          <string-name>
            <given-names>Marco</given-names>
            <surname>Stranisci</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>An italian twitter corpus of hate speech against immigrants</article-title>
          .
          <source>In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC</source>
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>Chris</given-names>
            <surname>Pool</surname>
          </string-name>
          and
          <string-name>
            <given-names>Malvina</given-names>
            <surname>Nissim</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Distant supervision for emotion detection using facebook reactions</article-title>
          .
          <source>In Proceedings of the Workshop on Computational Modeling of People's Opinions, Personality, and Emotions in Social Media (PEOPLES)</source>
          , pages
          <fpage>30</fpage>
          -
          <lpage>39</lpage>
          , Osaka, Japan,
          <string-name>
            <surname>December. COLING</surname>
          </string-name>
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <given-names>Matthew</given-names>
            <surname>Purver</surname>
          </string-name>
          and
          <string-name>
            <given-names>Stuart</given-names>
            <surname>Battersby</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Experimenting with distant supervision for emotion classification</article-title>
          .
          <source>In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics</source>
          , pages
          <fpage>482</fpage>
          -
          <lpage>491</lpage>
          . Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <given-names>Philip</given-names>
            <surname>Seargeant</surname>
          </string-name>
          and
          <string-name>
            <given-names>Caroline</given-names>
            <surname>Tagg</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Social media and the future of open debate: A user-oriented approach to Facebook's filter bubble conundrum</article-title>
          . Discourse, Context &amp; Media.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>