<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Towards an Italian Lexicon for Polarity Classification (polarITA): a Comparative Analysis of Lexical Resources for Sentiment Analysis</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Delia</forename><surname>Iraz</surname></persName>
						</author>
						<author>
							<persName><forename type="first">Hernández</forename><surname>Farías</surname></persName>
							<email>dhernandez1@dsic.upv.es</email>
						</author>
						<author>
							<persName><forename type="first">Irene</forename><surname>Laganà</surname></persName>
							<email>irene.lagana01@universitadipavia.it</email>
						</author>
						<author>
							<persName><forename type="first">Viviana</forename><surname>Patti</surname></persName>
							<email>patti@di.unito.it</email>
						</author>
						<author>
							<persName><forename type="first">Cristina</forename><surname>Bosco</surname></persName>
							<email>bosco@di.unito.it</email>
						</author>
						<author>
							<affiliation key="aff0">
								<orgName type="institution">PRHLT Research Center</orgName>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff1">
								<orgName type="institution">Universitat Politècnica de València</orgName>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff2">
								<orgName type="department">Dipartimento di Studi Umanistici</orgName>
								<orgName type="institution">Università di Pavia</orgName>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff3">
								<orgName type="department">Dipartimento di Informatica Università di Torino</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">Towards an Italian Lexicon for Polarity Classification (polarITA): a Comparative Analysis of Lexical Resources for Sentiment Analysis</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">F1BAEF06A083764312C6E9EF2D40373C</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T09:57+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract/>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>English. The paper describes a preliminary study for the development of a novel lexicon for Italian sentiment analysis, i.e. where words are associated with polarity values. Given the influence of sentiment lexica on the performance of sentiment analysis systems, a methodology based on the detection and classification of errors in existing lexical resources is proposed and an extrinsic evaluation of the impact of such errors is applied. The final aim is to build a novel resource from the filtering applied to the existing lexical resources, which can integrate them with missing lexical entries and more reliable associations of polarity with entries.</p><p>Italiano. L'articolo descrive uno studio preliminare per lo sviluppo di una nuova risorsa lessicale per la sentiment analysis in italiano, i.e. dove alle parole sono associati valori di polarità. Data l'influenza dei lessici di sentiment sulle performance dei sistemi di sentiment analysis, viene proposta una metodologia basata sulla rilevazione e classificazione degli errori presenti nei lessici attualmente disponibili ed una valutazione estrinseca dell'impatto di tali errori sui sistemi. L'obiettivo finale è ottenere un nuovo lessico grazie ad un filtraggio applicato alle risorse lessicali disponibili, e a un'integrazione con le voci lessicali mancanti, ottenendo una maggiore affidabilità nell'associazione delle polarità alle voci.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Sentiment Analysis (SA), described as the task of automatically determine the polarity in a given piece of text <ref type="bibr" target="#b13">(Mohammad, 2016)</ref>, is currently among the most widely investigated topics within NLP. Overall, the approaches for addressing such task are mainly based on techniques ranging from traditional machine learning to novel deep learning ones, as it can be seen also in the context of shared tasks on sentiment polarity classification in Twitter recently proposed, respectively for English <ref type="bibr" target="#b14">(Nakov et al., 2016)</ref> and Italian <ref type="bibr" target="#b2">(Barbieri et al., 2016)</ref>, within the SemEval and Evalita periodical evaluation campaigns. Moreover, the detection of specific words associated with polarity values or emotions has been considered as a powerful information source for identifying the sentiment behind a text. Among the resources which are more commonly exploited by SA systems for performing their task there are therefore sentiment lexica, i.e., lists of words with associated polarity values or emotions.</p><p>Several techniques have been applied for the development of lexical resources for SA: they can be built from scratch, manually or automatically, or extracted from corpora <ref type="bibr" target="#b17">(Nissim and Patti, 2017)</ref>. Nevertheless, the vast majority of these resources are written in English, and a lack of resources currently features several other languages. One of the most commonly applied alternatives for having resources in language other than English is to automatically translate some available English lexicon via tools such as Google translate<ref type="foot" target="#foot_0">1</ref> . But there are many constraints involved in this kind of process, such as handling synonyms and polysemous words, multi-word expressions, but also to deal with cultural differences between source and target language. Apart from this, possible variations of polarity across different contexts and languages should be carefully taken into account, while such approaches rely somehow on the assumption that affective norms related to sentiment are stable across languages.</p><p>In this paper we are interested into evaluate the reliability of the lexical resources currently available for Italian SA and, providing that the most of them are obtained by translation, we will mainly focus on the reliability of automatically translating English resources to Italian language. For doing so, we carried out a methodology involving different facets. Our final aim is to develop a new SA resource for Italian, which comprises pre-existing translated lexical entries enriched with the manual correction of the polarity assigned, as resulting from our analysis, but also includes entries which are featured by a polarity but are missing in the available lexica.</p><p>The paper is organized as follows. In the next section, we describe our methodology which mainly consists in three steps: the selection of a sample of tweets from an Italian sentiment corpus and exploited as part of the gold standard in the Sentipolc@Evalita2016 shared task <ref type="bibr" target="#b21">(Stranisci et al., 2016;</ref><ref type="bibr" target="#b2">Barbieri et al., 2016)</ref>; automatic extraction of the lexical entries polarized according to a set of benchmark sentiment lexica for Italian; the analysis of these entries and the comparison with those expected by a human judge. Section three shows instead an extrinsic evaluation of the impact of the detected errors on the results of the SA system. Some hints about future development of this research are given in the conclusion.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Our Methodology</head><p>Given the relevance of affective lexica in SA and related tasks, our major aims in the current research are to detect the limits of the currently available lexical resources for Italian and to explore the possibility to develop a novel resource by correcting and extending them. In this paper we focus in particular on the detection of the deficiencies of existing resources and on their motivations. Our methodology consists therefore in: (i) selecting of a sample of tweets from an Italian sentiment corpus featured by political contents <ref type="bibr" target="#b21">(Stranisci et al., 2016)</ref> and exploited as part of the gold standard in the Sentipolc@Evalita2016 shared task <ref type="bibr" target="#b2">(Barbieri et al., 2016)</ref>, with sentiment polarity annotation at the tweet level; (ii) automatically extracting the lexical entries polarized according to a set of benchmark sentiment lexica for Italian and (iii) manually checking the results for each expected lexical entry in the context of the whole tweet (i.e. if the polarity of the entry is that expected by a human annotator or also if there are other entries in the tweet that should appear as polarized but are not in the lexicons). We take as starting point the SA lexica exploited by <ref type="bibr" target="#b10">(Hernández Farías et al., 2014)</ref> in the IRADABE system at Evalita2014's SENTIPOLC <ref type="bibr" target="#b4">(Basile et al., 2014)</ref>. The same resources where used also in the upgraded system that participated at the same task in Evalita2016 <ref type="bibr" target="#b7">(Buscaldi and Hernández Farías, 2016)</ref>. In those works the lexicon AFINN, <ref type="bibr" target="#b16">(Nielsen, 2011)</ref>, the one developed by Hu and Liu (henceforth HaL) <ref type="bibr" target="#b12">(Hu and Liu, 2004)</ref>, and SentiWord-Net (SWN) <ref type="bibr" target="#b1">(Baccianella et al., 2010)</ref> were indeed automatically translated to Italian, to exploit obtained information as features in their supervised system, but no specific evaluation or refining of them was performed. In the present paper we extend our selection by considering, beyond these three, a further resource, i.e. Sentix <ref type="bibr" target="#b3">(Basile and Nissim, 2013</ref>) (see Sec. 2.1) which has been developed following a semantics oriented strategy (see Sec. 2.1). Henceforth, we will use the expression benchmark lexica) for referring to the four resources. As reference corpus, we considered, instead, TwBuonaScuola <ref type="bibr" target="#b21">(Stranisci et al., 2016)</ref>, an Italian dataset manually annotated for sentiment polarity and irony, focused on the on-line debate regarding a controversial Italian political reform, which is part of the gold standard provided for the Sentipolc shared task <ref type="bibr" target="#b2">(Barbieri et al., 2016)</ref> at Evalita 2016 <ref type="bibr" target="#b5">(Basile et al., 2017)</ref>. Our methodology, whose results are shown in Sec. 2.2, includes the steps described below. Given a random selection of 500 tweets from TwBuonaScuola (henceforth ItalianTweets) including 2,706 different words, we manually evaluated the coverage of the benchmark lexica for the words included in these tweets. In particular, for each tweet we extracted automatically all the words which are included in each of the benchmark lexica and its associated polarity. Then, for each tweets belonging to ItalianTweets, we manually checked the obtained lists of words, considered in the context of the tweet, with a twofold objective: (i) To deduce which words in the benchmark lexica have a wrong polarity associated;</p><p>(ii) To identify those words that express certain polarity in the corpus but are not included in the benchmark lexica.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Sentiment Analysis Resources</head><p>In this section we describe the benchmark lexica. AFINN (Nielsen, 2011) is an English lexicon composed of 2,477 words and 15 multi-word expressions. Each entry is associated with a score which varies from -5 to +5 in order to respectively introduce negative and positive polarity. The starting point for the development of this resource is a list of obscene words and some positive words; then the lexicon has been extended with words from a corpus of tweets and other lists of words from Urban Dictionary<ref type="foot" target="#foot_1">2</ref> for representing entries typical of Internet language (e.g. "WTF" and "LOL"). After the manual annotation of the entries the lexicon has been evaluated based on a corpus of tweets manually annotated for SA.</p><p>HaL, <ref type="bibr" target="#b12">(Hu and Liu, 2004)</ref>, has been built within a project for developing methods to deal with opinions expressed in reviews about various kinds of goods. A group of 30 adjectives featured by a single and stable polarity and manually annotated has been expanded by including the words which in WordNet's synsets are synonyms or antonyms of these seeds, providing that synonyms are featured by the same polarity and antonyms by the opposite one. The lexicon currently includes 6,800 entries classified as positive or negative.</p><p>SentiWordNet 3.0 <ref type="bibr" target="#b1">(Baccianella et al., 2010</ref>) is among the larger and more used resources exploited for SA. The main goal of the SentiWord-Net project is the fully automated annotation of the polarity of the WordNet's synsets using scores that vary from 0.0 to 1.0 to each of the three basic polarity values (positive, negative, neutral) in order to obtain 1 as the sum of them. By contrast with the other resources, SentiWordNet takes into account different possible senses for each word.</p><p>As far as Italian is concerned, only a few resources exist, such as Sentix <ref type="bibr" target="#b3">(Basile and Nissim, 2013)</ref> and SABRINA <ref type="bibr" target="#b6">(Borzì et al., 2015)</ref>. Sentix is the result of the alignment of four semantic database, namely WordNet <ref type="bibr" target="#b9">(Fellbaum, 1998)</ref>, SentiWordNet, MultiWordNet <ref type="bibr" target="#b19">(Pianta et al., 2002)</ref> and Babelnet <ref type="bibr" target="#b15">(Navigli and Ponzetto, 2012)</ref>. The methodology consists in transferring to the Italian section of WordNet the information about polarity encoded in the English SentiWordNet's synsets, thus aligning Italian and English synsets. The development of SABRINA instead is based on the application of a prior polarity method on two sets of Italian words, the first composed of 277,000 entries with associated inflexion. However the lexicon is not publicly available. Finally let us mention ItEM <ref type="bibr" target="#b18">(Passaro et al., 2015)</ref>, an Italian emotive lexicon which aims at offering information about affect expressed in text according to finer levels of granularity, i.e. referring not simply to positive or negative sentiment polarity but to emotional categories. In ItEM each word is tagged with an emotional label from the height basic emotions of the Plutchik's psychological model <ref type="bibr" target="#b20">(Plutchik, 1980)</ref>.</p><p>Several scholars are devoting their efforts to the development of resources for other languages, by applying translation or other methodologies. Let us cite e.g. FEEL <ref type="bibr" target="#b0">(Abdaoui et al., 2017)</ref>, a French lexicon where words are associated with polarity and emotions obtained thanks to the application of translation tools to NRC-EmoLEx<ref type="foot" target="#foot_2">3</ref> and a manual validation of results.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Qualitative Analysis of Benchmark Lexica</head><p>In order to detect the coverage and correctness of each benchmark lexicon, we selected from our reference sample corpus the list of words that according to a human judge are featured by some affective value in the context of the tweet where they appear. Then, for each entry of this list and for each benchmark lexicon, we observed if the word is represented in the resource and featured by the same polarity. Given the preliminary nature of this investigation only a couple of researchers have been involved in the task. Moreover, a further limit of our current research approach depends on the reference to a given context (that determined by our sample corpus); issues related to the context will be accounted for in future investigations.</p><p>We observed different coverages of the benchmark lexica on our Twitter corpus, first of all in terms of numbers of affective words occurring in the tweets for each lexicon. The full vocabulary of the tweets is composed of 2,706 different words. Only some of these words are featured by some affective value, and focusing on them only we observed the following occurrences: 160 words in <ref type="bibr">AFINN,</ref><ref type="bibr">190 words in HaL,</ref><ref type="bibr">302</ref> words in SWN and 551 in Sentix. These word sets are partially overlapped, since 69 words are included in all the lexica.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Resource</head><p>Error (i) (ii) (iii) (iv) AFINN 1.2 2.5 16.8 8.7 HaL 1.5 1.0 12.6 12.6 SWN 5.9 1.6 15.5 13.2 Sentix 5.9 2.1 15.2 16.6</p><p>Table <ref type="table">1</ref>: Distribution of different errors in the benchmark lexica (percentage wrt the coverage of the lexicon).</p><p>The total amount of words missing or with an attributed erroneous polarity in the benchmark lexica is 388. As far as the erroneous polarization concerns, as summarized in Table <ref type="table">1</ref>, these words are featured by four different kinds of errors: (i) a positive word is annotated as negative; (ii) a negative word is annotated as positive; (iii) a neutral<ref type="foot" target="#foot_3">4</ref> word is annotated as positive; and (iv) a neutral word is annotated as negative. The values are expressed in percentage with respect to the coverage of the lexica. As far as the distribution of errors in the four classes, they are for all lexica prevailingly distributed in the last two classes, i.e. iii and iv, laying foundation for the hypothesis that in the automatic transition between English and Italian several non (clearly) polarized Italian words were instead polarized.</p><p>Nevertheless, observing Table <ref type="table">1</ref>, we can see also that all the lexica are featured by very similar amounts of errors, regardless of the methodology applied for their development (i.e. translation or extraction from semantic databases). Several errors, in particular for what concerns the polarity associated to specific words, can be generated during translation, and a portion of them is therefore motivated by the application of translation tools mainly because they do not consider context where each word occurs. But observing the results extracted from Sentix, which is not obtained simply by translation, and weighting the larger coverage that features this resource, we can see that errors occurs in a percentage that positively compares with that of the other resources. In this case the problem probably depends on misalignment of synsets for different languages. For example, the Italian word "istituto", whose meaning can be "school" or "institution", is aligned with "prison" and "house/prison", with a negative polarity which is not appropriate for the Italian word.</p><p>Several errors could be probably avoided in the transition among languages by applying a pre-processing including Part of Speech tagging and considering the grammatical category of the source and target terms. See for instance, the word tagliando (cutting) that occurs in the corpus as a Verb and in the benchmark lexica is instead aligned with the corresponding noun with the meaning of voucher/coupon. This motivates our decision about the attribution of PoS tags to the words in the first nucleus of a novel resource obtained by extending and correcting the existing ones. The overall impression is that, a manual check, even is a very time-consuming task, is always necessary and unavoidable, both when the new lexicon is obtained by translation, and when it is obtained relying on synset alignment.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Lost in Translation: Impact of the Errors</head><p>The methodology even if applied on a small set of tweets and based on a manual check of the benchmark lexica, confirms the hypothesis that many directions can be followed to improve the quality of existing lexical resources. The first result of this preliminary analysis is the collection of a list of words with associated polarity which will be the nucleus of the novel resource, i.e. polarITA. Each of the words in polarITA has been annotated with an overall polarity value (i.e., positive, negative, or none), and its corresponding Part-Of-Speech (POS) label. Table <ref type="table" target="#tab_0">2</ref> summarizes the distribution of the words in polarITA in terms of polarity and POS labels.</p><p>Experiments on a larger corpus and a quantitative analysis based on a more formal classification of errors is needed for the development of a fully developed reliable lexical resource, together with an in-depth investigation of the relevance of context in the attribution of polarity, which is a very important issue. A comparison of the results that a given SA engine exploiting features extracted from sentiment lexica, for instance IRAD-ABE <ref type="bibr" target="#b10">(Hernández Farías et al., 2014;</ref><ref type="bibr" target="#b7">Buscaldi and Hernández Farías, 2016)</ref>, obtains using each of the benchmark lexica and using polarITA is planned as future work for the evaluation of the novel lexicon, which is not currently suitable because the limited size of our reference corpus and the consequent partial coverage of errors.</p><p>Considering the current preliminary stage of development of polarITA, we tried an extrinsic evaluation for detecting the impact on the performance of SA systems of the errors currently featuring the benchmark lexica and corrected in the novel lexicon. We compared the words which are missing or assigned to erroneous polarity in the benchmark lexica with the Italian words more commonly used and understood by native speakers, whose collection is available in the Vocabolario di base della lingua italiana (vocItalian) 5 recently newly released. Like the first version of this resource, published in 1980, <ref type="bibr" target="#b8">(De Mauro, 1980)</ref>, it includes three word classes: 2,999 High Usage words (HU), 2,231 High Availability words (HA) and 1,979 Foundational words (FO). In polarITA we collected until now 284 words of the vocItalian, whose distribution across the three classes is shown in Table <ref type="table" target="#tab_0">2</ref>. Among the words in the FO category we found "bene" (good), "mentire" (lie), and "giustizia" (justice). While words like "assassino" (killer), "preoccupato" (worried), and "entusiasta" (enthusiastic) are part of the HU category. Finally, in the HA category it is possible to find words such as "dannoso" (harmful) and "emozionante" (exciting).</p><p>This analysis suggests some hints for further investigation, showing that the failures of lexica currently available for Italian SA affect words very commonly used in communication and therefore the improvement of these resources may hopefully result in an advancement for SA and related tasks.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Conclusions and Future Work</head><p>In this paper we propose the preliminary investigation about a methodology for the development of a novel lexical resource for Italian SA, namely po-larITA, which takes advantage of the analysis and filtering of errors occurring in the available lexical resources. We carried out a manual analysis of a set of tweets for determining the reliability of sentiment-related lexica, showing that, even if the transfer of lexical information between two different languages is a common practice to address the lack of resources, information related to sentiment is lost during it. The identified errors are then ex- ploited as a starting point for developing the novel resource.</p><p>As future work, we are planning to extend the resource in several directions: by investigating multi-word expressions, extending the coverage to a larger corpus, exploring the impact of figurative language devices such as irony and sarcasm in the use of certain polarized words <ref type="bibr">(Hernández Farías et al., 2016)</ref>. Moreover, our future effort will be oriented to the automatization of a larger part of the methodology and its application to other languages currently under resourced.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 2 :</head><label>2</label><figDesc>5 https://www.internazionale.it/ opinione/tullio-de-mauro/2016/12/23/ilnuovo-vocabolario-di-base-della-linguaitaliana Distribution of the words in polarITA in terms of polarity, POS labels, and vocItalian.</figDesc><table><row><cell cols="2">Total words</cell><cell>388</cell></row><row><cell cols="2">Polarity</cell><cell></cell></row><row><cell cols="3">Positive Negative None</cell></row><row><cell>225</cell><cell>140</cell><cell>23</cell></row><row><cell cols="3">Part-of-speech labels</cell></row><row><cell cols="2">Adjective</cell><cell>84</cell></row><row><cell cols="2">Adjective/Noun</cell><cell>1</cell></row><row><cell cols="2">Adjective/Pronoun</cell><cell>2</cell></row><row><cell cols="2">Adverb</cell><cell>16</cell></row><row><cell cols="2">Interjection</cell><cell>3</cell></row><row><cell>Noun</cell><cell></cell><cell>187</cell></row><row><cell cols="2">Noun/Adverb</cell><cell>1</cell></row><row><cell cols="2">Preposition</cell><cell>1</cell></row><row><cell cols="2">Pronoun</cell><cell>1</cell></row><row><cell>Verb</cell><cell></cell><cell>92</cell></row><row><cell cols="2">vocItalian</cell><cell></cell></row><row><cell>FO</cell><cell></cell><cell>187</cell></row><row><cell>HU</cell><cell></cell><cell>86</cell></row><row><cell>HA</cell><cell></cell><cell>11</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://translate.google.com/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">http://www.urbandictionary.com</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">http://www.saifmohammad.com/WebPages/ lexicons.html</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">We considered neutral a word which is featured by a polarity which may vary across contexts, indicated by None in Table2.</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Acknowledgements C. Bosco and V. Patti were partially funded by Progetto di Ateneo/CSP 2016 (Immigrants, Hate and Prejudice in Social Media, S1618 L2 BOSC 01) and by Fondazione CRT (Hate Speech and Social Media, 2016.0688).</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">FEEL: a French Expanded Emotion Lexicon</title>
		<author>
			<persName><forename type="first">Amine</forename><surname>Abdaoui</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jérôme</forename><surname>Azé</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sandra</forename><surname>Bringay</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Pascal</forename><surname>Poncelet</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Language Resources and Evaluation</title>
		<imprint>
			<biblScope unit="volume">51</biblScope>
			<biblScope unit="page" from="833" to="855" />
			<date type="published" when="2017-09">2017. September</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining</title>
		<author>
			<persName><forename type="first">Stefano</forename><surname>Baccianella</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Andrea</forename><surname>Esuli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Fabrizio</forename><surname>Sebastiani</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC&apos;10)</title>
				<meeting>the Seventh International Conference on Language Resources and Evaluation (LREC&apos;10)<address><addrLine>Valletta, Malta</addrLine></address></meeting>
		<imprint>
			<publisher>ELRA</publisher>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="2200" to="2204" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Overview of the EVALITA 2016 SENTiment POLarity Classification Task</title>
		<author>
			<persName><forename type="first">Francesco</forename><surname>Barbieri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Valerio</forename><surname>Basile</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Danilo</forename><surname>Croce</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Malvina</forename><surname>Nissim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Nicole</forename><surname>Novielli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Viviana</forename><surname>Patti</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of Third Italian Conference on Computational Linguistics (CLiC-it 2016) &amp; Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop</title>
				<editor>
			<persName><forename type="first">Cutugno</forename><surname>Basile</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Patti</forename><surname>Nissim</surname></persName>
		</editor>
		<editor>
			<persName><surname>Sprugnoli</surname></persName>
		</editor>
		<meeting>Third Italian Conference on Computational Linguistics (CLiC-it 2016) &amp; Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop<address><addrLine>EVALITA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2016">2016. 2016</date>
		</imprint>
	</monogr>
	<note>CEUR Workshop Proceedings</note>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Sentiment Analysis on Italian Tweets</title>
		<author>
			<persName><forename type="first">Valerio</forename><surname>Basile</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Malvina</forename><surname>Nissim</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis</title>
				<meeting>the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis<address><addrLine>Atlanta, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="100" to="107" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Overview of the Evalita 2014 SENTIment POLarity Classification Task</title>
		<author>
			<persName><forename type="first">Andrea</forename><surname>Valerio Basile</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Malvina</forename><surname>Bolioli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Viviana</forename><surname>Nissim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paolo</forename><surname>Patti</surname></persName>
		</author>
		<author>
			<persName><surname>Rosso</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 4th evaluation campaign of Natural Language Processing and Speech tools for Italian (EVALITA 2014)</title>
				<meeting>the 4th evaluation campaign of Natural Language Processing and Speech tools for Italian (EVALITA 2014)<address><addrLine>Pisa, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Evalita goes social: Tasks, data, and community at the 2016 edition</title>
		<author>
			<persName><forename type="first">Pierpaolo</forename><surname>Basile</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Francesco</forename><surname>Cutugno</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Malvina</forename><surname>Nissim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Viviana</forename><surname>Patti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Rachele</forename><surname>Sprugnoli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IJCoL -Italian Journal of Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="93" to="127" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">Prior Polarity Lexical Resources for the Italian Language</title>
		<author>
			<persName><forename type="first">Valeria</forename><surname>Borzì</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Simone</forename><surname>Faro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Arianna</forename><surname>Pavone</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sabrina</forename><surname>Sansone</surname></persName>
		</author>
		<idno>CoRR, abs/1507.00133</idno>
		<imprint>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">IRADABE2: Lexicon Merging and Positional Features for Sentiment Analysis in Italian</title>
		<author>
			<persName><forename type="first">Davide</forename><surname>Buscaldi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Delia</forename><surname>Irazú</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Hernández</forename><surname>Farías</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 5th Evaluation Campaign of Natural Language Processing and Speech Tools for Italian</title>
				<meeting>the 5th Evaluation Campaign of Natural Language Processing and Speech Tools for Italian<address><addrLine>EVALITA</addrLine></address></meeting>
		<imprint>
			<publisher>aAcademia University Press</publisher>
			<date type="published" when="2016">2016. 2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<author>
			<persName><forename type="first">Tullio</forename><surname>De</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mauro</forename></persName>
		</author>
		<title level="m">Guida all&apos;uso delle parole Num. 3 dei Libri di base</title>
				<meeting><address><addrLine>Roma</addrLine></address></meeting>
		<imprint>
			<publisher>Editori Riuniti</publisher>
			<date type="published" when="1980">1980</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title level="m" type="main">WordNet: An Electronic Lexical Database</title>
		<author>
			<persName><forename type="first">Christiane</forename><surname>Fellbaum</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1998">1998</date>
			<publisher>Bradford Books</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">IRADABE: Adapting English Lexicons to the Italian Sentiment Polarity Classification task</title>
		<author>
			<persName><forename type="first">Delia</forename><surname>Irazú</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Hernández</forename><surname>Farías</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Davide</forename><surname>Buscaldi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Belém</forename><surname>Priego-Sánchez</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">First Italian Conference on Computational Linguistics (CLiC-it 2014) and the fourth International Workshop EVALITA 2014</title>
				<imprint>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="75" to="81" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Irony Detection in Twitter: The Role of Affective Content</title>
		<author>
			<persName><forename type="first">Delia</forename><surname>Irazú</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Hernández</forename><surname>Farías</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Viviana</forename><surname>Patti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paolo</forename><surname>Rosso</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Trans. Internet Technol</title>
		<imprint>
			<biblScope unit="volume">16</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page">24</biblScope>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Mining and summarizing customer reviews</title>
		<author>
			<persName><forename type="first">Minqing</forename><surname>Hu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bing</forename><surname>Liu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD &apos;04</title>
				<meeting>the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD &apos;04<address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2004">2004</date>
			<biblScope unit="page" from="168" to="177" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Sentiment Analysis: Detecting Valence, Emotions, and Other Affectual States from Text</title>
		<author>
			<persName><forename type="first">M</forename><surname>Saif</surname></persName>
		</author>
		<author>
			<persName><surname>Mohammad</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Emotion Measurement</title>
				<editor>
			<persName><forename type="first">Herb</forename><surname>Meiselman</surname></persName>
		</editor>
		<imprint>
			<publisher>Elsevier</publisher>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">SemEval-2016 Task 4: Sentiment Analysis in Twitter</title>
		<author>
			<persName><forename type="first">Preslav</forename><surname>Nakov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alan</forename><surname>Ritter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sara</forename><surname>Rosenthal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Fabrizio</forename><surname>Sebastiani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Veselin</forename><surname>Stoyanov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)</title>
				<meeting>the 10th International Workshop on Semantic Evaluation (SemEval-2016)<address><addrLine>San Diego, California</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="1" to="18" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">BabelNet: The Automatic Construction, Evaluation and Application of a Wide-Coverage Multilingual Semantic Network</title>
		<author>
			<persName><forename type="first">Roberto</forename><surname>Navigli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Simone</forename><surname>Paolo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ponzetto</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Artificial Intelligence</title>
		<imprint>
			<biblScope unit="volume">193</biblScope>
			<biblScope unit="page" from="217" to="250" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">A new ANEW: evaluation of a word list for sentiment analysis in microblogs</title>
		<author>
			<persName><surname>Finn Årup Nielsen</surname></persName>
		</author>
		<ptr target="CEUR-WS.org" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the ESWC2011 Workshop on &apos;Making Sense of Microposts&apos;: Big things come in small packages</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<meeting>the ESWC2011 Workshop on &apos;Making Sense of Microposts&apos;: Big things come in small packages<address><addrLine>Heraklion, Crete, Greece</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2011">2011</date>
			<biblScope unit="volume">718</biblScope>
			<biblScope unit="page" from="93" to="98" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Semantic aspects in sentiment analysis</title>
		<author>
			<persName><forename type="first">Malvina</forename><surname>Nissim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Viviana</forename><surname>Patti</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Sentiment Analysis in Social Networks</title>
				<editor>
			<persName><forename type="first">Alberto</forename><surname>Federico</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Elisabetta</forename><surname>Pozzi</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Enza</forename><surname>Fersini</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Bing</forename><surname>Messina</surname></persName>
		</editor>
		<editor>
			<persName><surname>Liu</surname></persName>
		</editor>
		<meeting><address><addrLine>Boston</addrLine></address></meeting>
		<imprint>
			<publisher>Morgan Kaufmann</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="31" to="48" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<title level="m" type="main">ItEM: A Vector Space Model to Bootstrap an Italian Emotive Lexicon</title>
		<author>
			<persName><forename type="first">Lucia</forename><surname>Passaro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Laura</forename><surname>Pollacci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alessandro</forename><surname>Lenci</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2015">2015</date>
			<biblScope unit="volume">II</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Mul-tiWordNet: Developing an Aligned Multilingual Database</title>
		<author>
			<persName><forename type="first">E</forename><surname>Pianta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Bentivogli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Girardi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of International Conference on Global WordNet</title>
				<meeting>International Conference on Global WordNet</meeting>
		<imprint>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">A general psychoevolutionary theory of emotion</title>
		<author>
			<persName><forename type="first">Robert</forename><surname>Plutchik</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Emotion: Theory, research, and experience</title>
				<editor>
			<persName><forename type="first">R</forename><surname>Plutchik</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">H</forename><surname>Kellerman</surname></persName>
		</editor>
		<meeting><address><addrLine>New York</addrLine></address></meeting>
		<imprint>
			<publisher>Academic press</publisher>
			<date type="published" when="1980">1980</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="3" to="33" />
		</imprint>
	</monogr>
	<note>Theories of emotion</note>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Annotating Sentiment and Irony in the Online Italian Political Debate on #labuonascuola</title>
		<author>
			<persName><forename type="first">Marco</forename><surname>Stranisci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Cristina</forename><surname>Bosco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Delia Irazú Hernández</forename><surname>Farías</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Viviana</forename><surname>Patti</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)</title>
				<meeting>the Tenth International Conference on Language Resources and Evaluation (LREC 2016)</meeting>
		<imprint>
			<publisher>ELRA</publisher>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
	<note>European Language Resources Association</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
