<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards an Italian Lexicon for Polarity Classification (polarITA): a Comparative Analysis of Lexical Resources for Sentiment Analysis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Cristina Bosco</string-name>
          <email>boscog@di.unito.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>PRHLT Research Center Dipartimento di Dipartimento di Informatica Universitat Polite`cnica de Vale`ncia Studi Umanistici Universita` di Torino</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>English. The paper describes a preliminary study for the development of a novel lexicon for Italian sentiment analysis, i.e. where words are associated with polarity values. Given the influence of sentiment lexica on the performance of sentiment analysis systems, a methodology based on the detection and classification of errors in existing lexical resources is proposed and an extrinsic evaluation of the impact of such errors is applied. The final aim is to build a novel resource from the filtering applied to the existing lexical resources, which can integrate them with missing lexical entries and more reliable associations of polarity with entries.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Italiano. L’articolo descrive uno studio
preliminare per lo sviluppo di una nuova
risorsa lessicale per la sentiment analysis
in italiano, i.e. dove alle parole sono
associati valori di polarita`. Data l’influenza
dei lessici di sentiment sulle performance
dei sistemi di sentiment analysis, viene
proposta una metodologia basata sulla
rilevazione e classificazione degli errori
presenti nei lessici attualmente disponibili ed
una valutazione estrinseca dell’impatto di
tali errori sui sistemi. L’obiettivo finale
e` ottenere un nuovo lessico grazie ad un
filtraggio applicato alle risorse lessicali
disponibili, e a un’integrazione con le voci
lessicali mancanti, ottenendo una
maggiore affidabilita` nell’associazione delle
polarita` alle voci.
piece of text
        <xref ref-type="bibr" rid="ref14">(Mohammad, 2016)</xref>
        , is currently
among the most widely investigated topics within
NLP. Overall, the approaches for addressing such
task are mainly based on techniques ranging from
traditional machine learning to novel deep
learning ones, as it can be seen also in the context of
shared tasks on sentiment polarity classification in
Twitter recently proposed, respectively for English
        <xref ref-type="bibr" rid="ref15">(Nakov et al., 2016)</xref>
        and Italian
        <xref ref-type="bibr" rid="ref3">(Barbieri et al.,
2016)</xref>
        , within the SemEval and Evalita periodical
evaluation campaigns. Moreover, the detection of
specific words associated with polarity values or
emotions has been considered as a powerful
information source for identifying the sentiment
behind a text. Among the resources which are more
commonly exploited by SA systems for
performing their task there are therefore sentiment lexica,
i.e., lists of words with associated polarity values
or emotions.
      </p>
      <p>
        Several techniques have been applied for the
development of lexical resources for SA: they can be
built from scratch, manually or automatically, or
extracted from corpora
        <xref ref-type="bibr" rid="ref18 ref6">(Nissim and Patti, 2017)</xref>
        .
Nevertheless, the vast majority of these resources
are written in English, and a lack of resources
currently features several other languages. One of
the most commonly applied alternatives for
having resources in language other than English is
to automatically translate some available English
lexicon via tools such as Google translate1. But
there are many constraints involved in this kind
of process, such as handling synonyms and
polysemous words, multi-word expressions, but also
to deal with cultural differences between source
and target language. Apart from this, possible
variations of polarity across different contexts and
languages should be carefully taken into account,
while such approaches rely somehow on the
assumption that affective norms related to sentiment
are stable across languages.
      </p>
      <p>In this paper we are interested into evaluate the
reliability of the lexical resources currently
available for Italian SA and, providing that the most of
them are obtained by translation, we will mainly
focus on the reliability of automatically translating
English resources to Italian language. For doing
so, we carried out a methodology involving
different facets. Our final aim is to develop a new SA
resource for Italian, which comprises pre-existing
translated lexical entries enriched with the
manual correction of the polarity assigned, as resulting
from our analysis, but also includes entries which
are featured by a polarity but are missing in the
available lexica.</p>
      <p>
        The paper is organized as follows. In the
next section, we describe our methodology which
mainly consists in three steps: the selection of a
sample of tweets from an Italian sentiment
corpus and exploited as part of the gold standard in
the Sentipolc@Evalita2016 shared task
        <xref ref-type="bibr" rid="ref22 ref3">(Stranisci
et al., 2016; Barbieri et al., 2016)</xref>
        ; automatic
extraction of the lexical entries polarized according
to a set of benchmark sentiment lexica for Italian;
the analysis of these entries and the comparison
with those expected by a human judge. Section
three shows instead an extrinsic evaluation of the
impact of the detected errors on the results of the
SA system. Some hints about future development
of this research are given in the conclusion.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Our Methodology</title>
      <p>
        Given the relevance of affective lexica in SA and
related tasks, our major aims in the current
research are to detect the limits of the currently
available lexical resources for Italian and to
explore the possibility to develop a novel resource
by correcting and extending them. In this paper
we focus in particular on the detection of the
deficiencies of existing resources and on their
motivations. Our methodology consists therefore in:
(i) selecting of a sample of tweets from an
Italian sentiment corpus featured by political contents
        <xref ref-type="bibr" rid="ref22">(Stranisci et al., 2016)</xref>
        and exploited as part of
the gold standard in the Sentipolc@Evalita2016
shared task
        <xref ref-type="bibr" rid="ref3">(Barbieri et al., 2016)</xref>
        , with sentiment
polarity annotation at the tweet level; (ii)
automatically extracting the lexical entries polarized
according to a set of benchmark sentiment lexica for
Italian and (iii) manually checking the results for
each expected lexical entry in the context of the
whole tweet (i.e. if the polarity of the entry is that
expected by a human annotator or also if there are
other entries in the tweet that should appear as
polarized but are not in the lexicons).
      </p>
      <p>
        We take as starting point the SA lexica
exploited by
        <xref ref-type="bibr" rid="ref11">(Herna´ndez Far´ıas et al., 2014)</xref>
        in the
IRADABE system at Evalita2014’s SENTIPOLC
        <xref ref-type="bibr" rid="ref5">(Basile et al., 2014)</xref>
        . The same resources where
used also in the upgraded system that participated
at the same task in Evalita2016
        <xref ref-type="bibr" rid="ref12 ref22 ref8">(Buscaldi and
Herna´ndez Far´ıas, 2016)</xref>
        .
      </p>
      <p>
        In those works the lexicon AFINN,
        <xref ref-type="bibr" rid="ref17">(Nielsen,
2011)</xref>
        , the one developed by Hu and Liu
(henceforth HaL)
        <xref ref-type="bibr" rid="ref13">(Hu and Liu, 2004)</xref>
        , and
SentiWordNet (SWN)
        <xref ref-type="bibr" rid="ref2">(Baccianella et al., 2010)</xref>
        were indeed
automatically translated to Italian, to exploit
obtained information as features in their supervised
system, but no specific evaluation or refining of
them was performed. In the present paper we
extend our selection by considering, beyond these
three, a further resource, i.e. Sentix
        <xref ref-type="bibr" rid="ref4">(Basile and
Nissim, 2013)</xref>
        (see Sec. 2.1) which has been
developed following a semantics oriented strategy
(see Sec. 2.1). Henceforth, we will use the
expression benchmark lexica) for referring to the four
resources. As reference corpus, we considered,
instead, TwBuonaScuola
        <xref ref-type="bibr" rid="ref22">(Stranisci et al., 2016)</xref>
        , an
Italian dataset manually annotated for sentiment
polarity and irony, focused on the on-line debate
regarding a controversial Italian political reform,
which is part of the gold standard provided for
the Sentipolc shared task
        <xref ref-type="bibr" rid="ref3">(Barbieri et al., 2016)</xref>
        at
Evalita 2016
        <xref ref-type="bibr" rid="ref6">(Basile et al., 2017)</xref>
        .
      </p>
      <p>Our methodology, whose results are shown in
Sec. 2.2, includes the steps described below.
Given a random selection of 500 tweets from
TwBuonaScuola (henceforth ItalianTweets)
including 2,706 different words, we manually
evaluated the coverage of the benchmark lexica for
the words included in these tweets. In particular,
for each tweet we extracted automatically all the
words which are included in each of the
benchmark lexica and its associated polarity.</p>
      <p>Then, for each tweets belonging to ItalianTweets,
we manually checked the obtained lists of words,
considered in the context of the tweet, with a
twofold objective:
(i) To deduce which words in the benchmark
lexica have a wrong polarity associated;
(ii) To identify those words that express certain
polarity in the corpus but are not included in
the benchmark lexica.
2.1</p>
      <sec id="sec-2-1">
        <title>Sentiment Analysis Resources</title>
        <p>In this section we describe the benchmark lexica.</p>
        <p>
          AFINN
          <xref ref-type="bibr" rid="ref17">(Nielsen, 2011)</xref>
          is an English lexicon
composed of 2,477 words and 15 multi-word
expressions. Each entry is associated with a score
which varies from -5 to +5 in order to respectively
introduce negative and positive polarity. The
starting point for the development of this resource is
a list of obscene words and some positive words;
then the lexicon has been extended with words
from a corpus of tweets and other lists of words
from Urban Dictionary2 for representing entries
typical of Internet language (e.g. “WTF” and
“LOL”). After the manual annotation of the
entries the lexicon has been evaluated based on a
corpus of tweets manually annotated for SA.
        </p>
        <p>
          HaL,
          <xref ref-type="bibr" rid="ref13">(Hu and Liu, 2004)</xref>
          , has been built within
a project for developing methods to deal with
opinions expressed in reviews about various kinds
of goods. A group of 30 adjectives featured by a
single and stable polarity and manually annotated
has been expanded by including the words which
in WordNet’s synsets are synonyms or antonyms
of these seeds, providing that synonyms are
featured by the same polarity and antonyms by the
opposite one. The lexicon currently includes 6,800
entries classified as positive or negative.
        </p>
        <p>
          SentiWordNet 3.0
          <xref ref-type="bibr" rid="ref2">(Baccianella et al., 2010)</xref>
          is
among the larger and more used resources
exploited for SA. The main goal of the
SentiWordNet project is the fully automated annotation of
the polarity of the WordNet’s synsets using scores
that vary from 0.0 to 1.0 to each of the three
basic polarity values (positive, negative, neutral) in
order to obtain 1 as the sum of them. By contrast
with the other resources, SentiWordNet takes into
account different possible senses for each word.
        </p>
        <p>
          As far as Italian is concerned, only a few
resources exist, such as Sentix
          <xref ref-type="bibr" rid="ref4">(Basile and Nissim,
2013)</xref>
          and SABRINA
          <xref ref-type="bibr" rid="ref7">(Borz`ı et al., 2015)</xref>
          .
Sentix is the result of the alignment of four
semantic database, namely WordNet
          <xref ref-type="bibr" rid="ref10">(Fellbaum, 1998)</xref>
          ,
SentiWordNet, MultiWordNet
          <xref ref-type="bibr" rid="ref20">(Pianta et al., 2002)</xref>
          and Babelnet
          <xref ref-type="bibr" rid="ref16">(Navigli and Ponzetto, 2012)</xref>
          . The
methodology consists in transferring to the Italian
section of WordNet the information about polarity
encoded in the English SentiWordNet’s synsets,
thus aligning Italian and English synsets.
The development of SABRINA instead is based
on the application of a prior polarity method on
2http://www.urbandictionary.com
two sets of Italian words, the first composed of
277,000 entries with associated inflexion.
However the lexicon is not publicly available.
Finally let us mention ItEM
          <xref ref-type="bibr" rid="ref19">(Passaro et al., 2015)</xref>
          ,
an Italian emotive lexicon which aims at offering
information about affect expressed in text
according to finer levels of granularity, i.e. referring
not simply to positive or negative sentiment
polarity but to emotional categories. In ItEM each
word is tagged with an emotional label from the
height basic emotions of the Plutchik’s
psychological model
          <xref ref-type="bibr" rid="ref21">(Plutchik, 1980)</xref>
          .
        </p>
        <p>
          Several scholars are devoting their efforts to the
development of resources for other languages, by
applying translation or other methodologies. Let
us cite e.g. FEEL
          <xref ref-type="bibr" rid="ref1">(Abdaoui et al., 2017)</xref>
          , a French
lexicon where words are associated with polarity
and emotions obtained thanks to the application of
translation tools to NRC-EmoLEx3 and a manual
validation of results.
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Qualitative Analysis of Benchmark Lexica</title>
        <p>In order to detect the coverage and correctness of
each benchmark lexicon, we selected from our
reference sample corpus the list of words that
according to a human judge are featured by some
affective value in the context of the tweet where they
appear. Then, for each entry of this list and for
each benchmark lexicon, we observed if the word
is represented in the resource and featured by the
same polarity.</p>
        <p>Given the preliminary nature of this investigation
only a couple of researchers have been involved
in the task. Moreover, a further limit of our
current research approach depends on the reference
to a given context (that determined by our sample
corpus); issues related to the context will be
accounted for in future investigations.</p>
        <p>We observed different coverages of the
benchmark lexica on our Twitter corpus, first of all in
terms of numbers of affective words occurring in
the tweets for each lexicon. The full vocabulary of
the tweets is composed of 2,706 different words.
Only some of these words are featured by some
affective value, and focusing on them only we
observed the following occurrences: 160 words in
AFINN, 190 words in HaL, 302 words in SWN
and 551 in Sentix. These word sets are partially
overlapped, since 69 words are included in all the
3http://www.saifmohammad.com/WebPages/
lexicons.html</p>
        <sec id="sec-2-2-1">
          <title>Resource</title>
        </sec>
        <sec id="sec-2-2-2">
          <title>AFINN</title>
          <p>HaL
SWN
Sentix</p>
          <p>The total amount of words missing or with an
attributed erroneous polarity in the benchmark
lexica is 388. As far as the erroneous polarization
concerns, as summarized in Table 1, these words
are featured by four different kinds of errors: (i) a
positive word is annotated as negative; (ii) a
negative word is annotated as positive; (iii) a neutral4
word is annotated as positive; and (iv) a neutral
word is annotated as negative. The values are
expressed in percentage with respect to the coverage
of the lexica. As far as the distribution of errors
in the four classes, they are for all lexica
prevailingly distributed in the last two classes, i.e. iii and
iv, laying foundation for the hypothesis that in the
automatic transition between English and Italian
several non (clearly) polarized Italian words were
instead polarized.</p>
          <p>Nevertheless, observing Table 1, we can see
also that all the lexica are featured by very
similar amounts of errors, regardless of the
methodology applied for their development (i.e. translation
or extraction from semantic databases). Several
errors, in particular for what concerns the
polarity associated to specific words, can be generated
during translation, and a portion of them is
therefore motivated by the application of translation
tools mainly because they do not consider context
where each word occurs. But observing the results
extracted from Sentix, which is not obtained
simply by translation, and weighting the larger
coverage that features this resource, we can see that
errors occurs in a percentage that positively
compares with that of the other resources. In this case
the problem probably depends on misalignment
of synsets for different languages. For example,
the Italian word “istituto”, whose meaning can be
4We considered neutral a word which is featured by a
polarity which may vary across contexts, indicated by None in
Table 2.
“school” or “institution”, is aligned with “prison”
and “house/prison”, with a negative polarity which
is not appropriate for the Italian word.</p>
          <p>Several errors could be probably avoided in
the transition among languages by applying a
pre-processing including Part of Speech tagging
and considering the grammatical category of the
source and target terms. See for instance, the
word tagliando (cutting) that occurs in the
corpus as a Verb and in the benchmark lexica is
instead aligned with the corresponding noun with
the meaning of voucher/coupon. This motivates
our decision about the attribution of PoS tags to
the words in the first nucleus of a novel resource
obtained by extending and correcting the existing
ones. The overall impression is that, a manual
check, even is a very time-consuming task, is
always necessary and unavoidable, both when the
new lexicon is obtained by translation, and when
it is obtained relying on synset alignment.
3</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Lost in Translation: Impact of the</title>
    </sec>
    <sec id="sec-4">
      <title>Errors</title>
      <p>The methodology even if applied on a small set of
tweets and based on a manual check of the
benchmark lexica, confirms the hypothesis that many
directions can be followed to improve the quality of
existing lexical resources. The first result of this
preliminary analysis is the collection of a list of
words with associated polarity which will be the
nucleus of the novel resource, i.e. polarITA. Each
of the words in polarITA has been annotated with
an overall polarity value (i.e., positive, negative,
or none), and its corresponding Part-Of-Speech
(POS) label. Table 2 summarizes the distribution
of the words in polarITA in terms of polarity and
POS labels.</p>
      <p>
        Experiments on a larger corpus and a
quantitative analysis based on a more formal
classification of errors is needed for the development of a
fully developed reliable lexical resource, together
with an in-depth investigation of the relevance of
context in the attribution of polarity, which is a
very important issue. A comparison of the
results that a given SA engine exploiting features
extracted from sentiment lexica, for instance
IRADABE
        <xref ref-type="bibr" rid="ref11 ref12 ref22 ref8">(Herna´ndez Far´ıas et al., 2014; Buscaldi and
Herna´ndez Far´ıas, 2016)</xref>
        , obtains using each of the
benchmark lexica and using polarITA is planned
as future work for the evaluation of the novel
lexicon, which is not currently suitable because the
limited size of our reference corpus and the
consequent partial coverage of errors.
      </p>
      <p>
        Considering the current preliminary stage of
development of polarITA, we tried an extrinsic
evaluation for detecting the impact on the performance
of SA systems of the errors currently featuring the
benchmark lexica and corrected in the novel
lexicon. We compared the words which are
missing or assigned to erroneous polarity in the
benchmark lexica with the Italian words more
commonly used and understood by native speakers,
whose collection is available in the Vocabolario
di base della lingua italiana (vocItalian)5 recently
newly released. Like the first version of this
resource, published in 1980,
        <xref ref-type="bibr" rid="ref9">(De Mauro, 1980)</xref>
        , it
includes three word classes: 2,999 High Usage
words (HU), 2,231 High Availability words (HA)
and 1,979 Foundational words (FO).
      </p>
      <p>In polarITA we collected until now 284 words of
the vocItalian, whose distribution across the three
classes is shown in Table 2. Among the words in
the FO category we found “bene” (good),
“mentire” (lie), and “giustizia” (justice). While words
like “assassino” (killer), “preoccupato” (worried),
and “entusiasta” (enthusiastic) are part of the HU
category. Finally, in the HA category it is
possible to find words such as “dannoso” (harmful) and
“emozionante” (exciting).</p>
      <p>This analysis suggests some hints for further
investigation, showing that the failures of lexica
currently available for Italian SA affect words very
commonly used in communication and therefore
the improvement of these resources may hopefully
result in an advancement for SA and related tasks.
4</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusions and Future Work</title>
      <p>In this paper we propose the preliminary
investigation about a methodology for the development of
a novel lexical resource for Italian SA, namely
polarITA, which takes advantage of the analysis and
filtering of errors occurring in the available
lexical resources. We carried out a manual analysis
of a set of tweets for determining the reliability of
sentiment-related lexica, showing that, even if the
transfer of lexical information between two
different languages is a common practice to address the
lack of resources, information related to sentiment
is lost during it. The identified errors are then
ex5https://www.internazionale.it/
opinione/tullio-de-mauro/2016/12/23/ilnuovo-vocabolario-di-base-della-linguaitaliana
ploited as a starting point for developing the novel
resource.</p>
      <p>
        As future work, we are planning to extend the
resource in several directions: by investigating
multi-word expressions, extending the coverage to
a larger corpus, exploring the impact of figurative
language devices such as irony and sarcasm in the
use of certain polarized words
        <xref ref-type="bibr" rid="ref12 ref22 ref8">(Herna´ndez Far´ıas
et al., 2016)</xref>
        . Moreover, our future effort will be
oriented to the automatization of a larger part of
the methodology and its application to other
languages currently under resourced.
      </p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgements</title>
      <p>
        C. Bosco and V. Patti were partially funded by
Progetto di Ateneo/CSP 2016 (Immigrants, Hate and
Prejudice in Social Media, S1618 L2 BOSC 01)
and by Fondazione CRT
        <xref ref-type="bibr" rid="ref8">(Hate Speech and Social
Media, 2016.0688)</xref>
        .
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Amine</given-names>
            <surname>Abdaoui</surname>
          </string-name>
          , Je´roˆme Aze´,
          <string-name>
            <given-names>Sandra</given-names>
            <surname>Bringay</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Pascal</given-names>
            <surname>Poncelet</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>FEEL: a French Expanded Emotion Lexicon</article-title>
          .
          <source>Language Resources and Evaluation</source>
          ,
          <volume>51</volume>
          :
          <fpage>833</fpage>
          -
          <lpage>855</lpage>
          , September.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Stefano</given-names>
            <surname>Baccianella</surname>
          </string-name>
          , Andrea Esuli, and
          <string-name>
            <given-names>Fabrizio</given-names>
            <surname>Sebastiani</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining</article-title>
          .
          <source>In Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)</source>
          , pages
          <fpage>2200</fpage>
          -
          <lpage>2204</lpage>
          , Valletta,
          <string-name>
            <given-names>Malta. European</given-names>
            <surname>Language Resources Association (ELRA).</surname>
          </string-name>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Francesco</given-names>
            <surname>Barbieri</surname>
          </string-name>
          , Valerio Basile, Danilo Croce, Malvina Nissim, Nicole Novielli, and
          <string-name>
            <given-names>Viviana</given-names>
            <surname>Patti</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Overview of the EVALITA 2016 SENTiment POLarity Classification Task</article-title>
          . In Basile, Cutugno, Nissim, Patti, and Sprugnoli, editors,
          <source>Proceedings of Third Italian Conference on Computational Linguistics</source>
          (CLiC-it
          <year>2016</year>
          ) &amp;
          <article-title>Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian</article-title>
          .
          <source>Final Workshop (EVALITA</source>
          <year>2016</year>
          ).
          <source>CEUR Workshop Proceedings.</source>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Valerio</given-names>
            <surname>Basile</surname>
          </string-name>
          and
          <string-name>
            <given-names>Malvina</given-names>
            <surname>Nissim</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Sentiment Analysis on Italian Tweets</article-title>
          .
          <source>In Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis</source>
          , pages
          <fpage>100</fpage>
          -
          <lpage>107</lpage>
          , Atlanta, USA. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Valerio</given-names>
            <surname>Basile</surname>
          </string-name>
          , Andrea Bolioli, Malvina Nissim, Viviana Patti, and
          <string-name>
            <given-names>Paolo</given-names>
            <surname>Rosso</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Overview of the Evalita 2014 SENTIment POLarity Classification Task</article-title>
          .
          <source>In Proceedings of the 4th evaluation campaign of Natural Language Processing and Speech tools for Italian (EVALITA</source>
          <year>2014</year>
          ), Pisa, Italy.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Pierpaolo</given-names>
            <surname>Basile</surname>
          </string-name>
          , Francesco Cutugno, Malvina Nissim, Viviana Patti, and
          <string-name>
            <given-names>Rachele</given-names>
            <surname>Sprugnoli</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Evalita goes social: Tasks, data, and community at the 2016 edition</article-title>
          . IJCoL - Italian
          <source>Journal of Computational Linguistics</source>
          ,
          <volume>3</volume>
          (
          <issue>1</issue>
          ):
          <fpage>93</fpage>
          -
          <lpage>127</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Valeria</given-names>
            <surname>Borz</surname>
          </string-name>
          `ı, Simone Faro, Arianna Pavone, and
          <string-name>
            <given-names>Sabrina</given-names>
            <surname>Sansone</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Prior Polarity Lexical Resources for the Italian Language</article-title>
          . CoRR, abs/1507.00133.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Davide</given-names>
            <surname>Buscaldi</surname>
          </string-name>
          and Delia Irazu´ Herna´ndez Far´ıas.
          <year>2016</year>
          .
          <article-title>IRADABE2: Lexicon Merging and Positional Features for Sentiment Analysis in Italian</article-title>
          .
          <source>In Proceedings of the 5th Evaluation Campaign of Natural Language Processing and Speech Tools for Italian (EVALITA</source>
          <year>2016</year>
          ). aAcademia University Press.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Tullio De Mauro</surname>
          </string-name>
          .
          <year>1980</year>
          .
          <article-title>Guida all'uso delle parole Num. 3 dei Libri di base</article-title>
          .
          <source>Editori Riuniti</source>
          , Roma.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Christiane</given-names>
            <surname>Fellbaum</surname>
          </string-name>
          .
          <year>1998</year>
          .
          <article-title>WordNet: An Electronic Lexical Database</article-title>
          . Bradford Books.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Delia</given-names>
            <surname>Irazu</surname>
          </string-name>
          ´
          <article-title>Herna´ndez Far´ıas, Davide Buscaldi, and Bele´m Priego-Sa´nchez</article-title>
          .
          <year>2014</year>
          .
          <article-title>IRADABE: Adapting English Lexicons to the Italian Sentiment Polarity Classification task</article-title>
          .
          <source>In First Italian Conference on Computational Linguistics</source>
          (CLiC-it
          <year>2014</year>
          )
          <article-title>and the fourth</article-title>
          <source>International Workshop EVALITA</source>
          <year>2014</year>
          , pages
          <fpage>75</fpage>
          -
          <lpage>81</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Delia</given-names>
            <surname>Irazu</surname>
          </string-name>
          ´
          <article-title>Herna´ndez Far´ıas, Viviana Patti</article-title>
          , and
          <string-name>
            <given-names>Paolo</given-names>
            <surname>Rosso</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Irony Detection in Twitter: The Role of Affective Content</article-title>
          .
          <source>ACM Trans. Internet Technol</source>
          .,
          <volume>16</volume>
          (
          <issue>3</issue>
          ):
          <volume>19</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>19</lpage>
          :
          <fpage>24</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Minqing</given-names>
            <surname>Hu</surname>
          </string-name>
          and
          <string-name>
            <given-names>Bing</given-names>
            <surname>Liu</surname>
          </string-name>
          .
          <year>2004</year>
          .
          <article-title>Mining and summarizing customer reviews</article-title>
          .
          <source>In Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '04</source>
          , pages
          <fpage>168</fpage>
          -
          <lpage>177</lpage>
          , New York, NY, USA. ACM.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Saif</surname>
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Mohammad</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Sentiment Analysis: Detecting Valence, Emotions, and Other Affectual States from Text</article-title>
          . In Herb Meiselman, editor,
          <source>Emotion Measurement. Elsevier.</source>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>Preslav</given-names>
            <surname>Nakov</surname>
          </string-name>
          , Alan Ritter, Sara Rosenthal, Fabrizio Sebastiani, and
          <string-name>
            <given-names>Veselin</given-names>
            <surname>Stoyanov</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>SemEval2016 Task 4: Sentiment Analysis in Twitter</article-title>
          .
          <source>In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>18</lpage>
          , San Diego, California.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <given-names>Roberto</given-names>
            <surname>Navigli</surname>
          </string-name>
          and Simone Paolo Ponzetto.
          <year>2012</year>
          .
          <article-title>BabelNet: The Automatic Construction, Evaluation and Application of a Wide-Coverage Multilingual Semantic Network</article-title>
          .
          <source>Artificial Intelligence</source>
          ,
          <volume>193</volume>
          :
          <fpage>217</fpage>
          -
          <lpage>250</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Finn</surname>
            <given-names>A</given-names>
          </string-name>
          ˚rup Nielsen.
          <year>2011</year>
          .
          <article-title>A new ANEW: evaluation of a word list for sentiment analysis in microblogs</article-title>
          .
          <source>In Proceedings of the ESWC2011 Workshop on 'Making Sense of Microposts': Big things come in small packages</source>
          , volume
          <volume>718</volume>
          <source>of CEUR Workshop Proceedings</source>
          , pages
          <fpage>93</fpage>
          -
          <lpage>98</lpage>
          , Heraklion, Crete, Greece.
          <source>CEUR-WS.org.</source>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>Malvina</given-names>
            <surname>Nissim</surname>
          </string-name>
          and
          <string-name>
            <given-names>Viviana</given-names>
            <surname>Patti</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Semantic aspects in sentiment analysis</article-title>
          .
          <source>In Federico Alberto Pozzi</source>
          , Elisabetta Fersini, Enza Messina, and Bing Liu, editors,
          <source>Sentiment Analysis in Social Networks</source>
          , pages
          <fpage>31</fpage>
          -
          <lpage>48</lpage>
          . Morgan Kaufmann, Boston.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <given-names>Lucia</given-names>
            <surname>Passaro</surname>
          </string-name>
          , Laura Pollacci, and
          <string-name>
            <given-names>Alessandro</given-names>
            <surname>Lenci</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>ItEM: A Vector Space Model to Bootstrap an Italian Emotive Lexicon</article-title>
          . volume II.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <given-names>E.</given-names>
            <surname>Pianta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bentivogli</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Girardi</surname>
          </string-name>
          .
          <year>2002</year>
          .
          <article-title>MultiWordNet: Developing an Aligned Multilingual Database</article-title>
          .
          <source>In Proceedings of International Conference on Global WordNet.</source>
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <given-names>Robert</given-names>
            <surname>Plutchik</surname>
          </string-name>
          .
          <year>1980</year>
          .
          <article-title>A general psychoevolutionary theory of emotion</article-title>
          . In R. Plutchik and H. Kellerman, editors,
          <source>Emotion: Theory</source>
          , research, and experience: Vol.
          <volume>1</volume>
          . Theories of emotion, pages
          <fpage>3</fpage>
          -
          <lpage>33</lpage>
          . Academic press, New York.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <given-names>Marco</given-names>
            <surname>Stranisci</surname>
          </string-name>
          , Cristina Bosco, Delia Irazu´
          <article-title>Herna´ndez Far´ıas, and</article-title>
          <string-name>
            <given-names>Viviana</given-names>
            <surname>Patti</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Annotating Sentiment and Irony in the Online Italian Political Debate on #labuonascuola</article-title>
          .
          <source>In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC</source>
          <year>2016</year>
          ).
          <article-title>European Language Resources Association (ELRA).</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>