<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>OEG at TASS 2017: Spanish Sentiment Analysis of tweets at document level</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mar a Navas-Loro</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Universidad Politecnica de Madrid Ontology Engineering Group Campus de Montegancedo 28660 Boadilla del Monte</institution>
          ,
          <addr-line>Madrid</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>V ctor Rodr guez-Doncel Universidad Politecnica de Madrid Ontology Engineering Group Campus de Montegancedo 28660 Boadilla del Monte</institution>
          ,
          <addr-line>Madrid</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <fpage>43</fpage>
      <lpage>49</lpage>
      <abstract>
        <p>This paper describes the `oeg' submission to task 1 of the TASS 2017 workshop, focusing on Sentiment Analysis at tweet level. Di erent parameters and systems were tested in each one of the three corpora released for the task, including di erent Machine Learning algorithms and morphosyntactic analyses for negation detection, along with the use of lexicons and dedicated preprocessing techniques for detecting and correcting frequent errors and expressions in tweets. The obtained results o er a basis for the design of future strategies for systems to tackle Sentiment Analysis in Twitter.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Recent boom on Sentiment Analysis, partly
due to Natural Language Processing (NLP)
new techniques and to the wide use of
social networks in everyday life by Internet
users, has derived into the creation of new
resources and techniques to analyze opinions
in almost every possible eld. One of the
evidences of this growing importance of
Opinion Mining is its use by brands in order to
discover customer's opinions. These opinions
can be found in posts in the social media,
being possible to some extent to
automatically evaluate their polarity and the impact of
marketing campaigns. According to Nielsen
        <xref ref-type="bibr" rid="ref20">(Nielsen, 2012)</xref>
        , up to 70% of users take into
account the product experience published by
other users, being this analysis therefore
extremely valuable for companies.
      </p>
      <p>
        The OEG, together with Havas
Media, has participated in the LPS-BIGGER
project1, where software components have
been developed for the categorization of
brand-related messages into four categories
framed in marketing analysis, being one of
them a sentiment analysis task. This
software is capable of classifying Twitter
messages in Spanish and English into one or
more of eight pre-de ned emotions
(lovehate, satisfaction-dissatisfaction, trust-fear,
happiness-sadness). An adaptation of this
infrastructure has been used to detect polarity
in the Spanish messages proposed by Task
1 of the TASS 2017 workshop. The TASS
workshop
        <xref ref-type="bibr" rid="ref15">(Mart nez-Camara et al., 2017)</xref>
        has
1http://www.cienlpsbigger.es/
      </p>
      <p>Copyright © 2017 by the paper's authors. Copying permitted for private and academic purposes.
become one of the rst e orts to cover
Sentiment Analysis in Spanish, challenging since
2012 both researchers and industry to
analyze di erent annotated or tagged corpora.</p>
      <p>In this edition, two tasks have been proposed,
but the OEG participation just covers the</p>
      <p>rst one, dealing with Sentiment Analysis at
tweet level.</p>
      <p>The reminder will be as follows. Section
2 covers related works in the area, including
both general approaches and proposals.
Section 3 exposes our proposals in detail.
Section 4 includes the results and an analysis on
them, and Section 5 presents our conclusions
on our participation and future lines.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Several authors, such as Pang and Lee
        <xref ref-type="bibr" rid="ref21">(Pang,
Lee, and others, 2008)</xref>
        and Liu
        <xref ref-type="bibr" rid="ref14">(Liu, 2010)</xref>
        ,
have published comprehensive reviews of
research in Sentiment Analysis, often seen as
a classi cation problem (understood as the
task of classifying a text document on a
bunch of prede ned categories
        <xref ref-type="bibr" rid="ref12 ref14">(Liu, 2010;
Kleinginna and Kleinginna, 1981)</xref>
        ) and
therefore addressed with di erent approaches:
Rule-based systems
        <xref ref-type="bibr" rid="ref11 ref5">(Ding and Liu, 2007;
Kan, 2011)</xref>
        . Applied both on plain texts
and on POS-annotated texts, they
usually rely on sentiment lexicons relating
lexical units to sentiments. They
usually present good precision results but
require a big set of rules to get a great
recall.
      </p>
      <p>
        Machine Learning systems
        <xref ref-type="bibr" rid="ref18 ref28">(Mullen and
Collier, 2004; Turney, 2002)</xref>
        , using
supervised or unsupervised techniques.
They are usually trained with di erent
kind of features, such as the words of
the sentence, their lemmas, or n-grams.
These systems require of a training
process, but they are usually unable to
capture irony and other more complex
linguistic phenomena. Classical
algorithms used in this approach are Nave
Bayes
        <xref ref-type="bibr" rid="ref17">(Minsky, 1961)</xref>
        and Support
Vector Machines
        <xref ref-type="bibr" rid="ref4">(Cortes and Vapnik, 1995)</xref>
        ;
more modern approaches include recent
NLP proposals such as Word
Embeddings
        <xref ref-type="bibr" rid="ref16">(Mikolov et al., 2013)</xref>
        , as happens
in
        <xref ref-type="bibr" rid="ref9">(Giatsoglou et al., 2017)</xref>
        . Also di
erent ways of handling texts, such as using
Bag of Words and Bag of Lemmas, can
be found along with the use of lexicons
and FSS (Feature Subset Selection)
techniques, since they have demonstrated to
be useful for Sentiment Analysis tasks
        <xref ref-type="bibr" rid="ref8">(Gamon, 2004)</xref>
        .
      </p>
      <p>
        Hybrid Systems
        <xref ref-type="bibr" rid="ref21 ref22">(Pang, Lee, and others,
2008; Prabowo and Thelwall, 2009)</xref>
        ,
trying to avoid handicaps of each of the
previous approaches.
      </p>
      <p>
        All these kind of systems usually rely on
lexicons to enrich post representation, such as
SentiWordnet
        <xref ref-type="bibr" rid="ref13 ref6">(Esuli and Sebastiani, 2006)</xref>
        ,
the MPQA (Multi-Perspective Question
Answering) Subjectivity Lexicon
        <xref ref-type="bibr" rid="ref30 ref31">(Wiebe,
Wilson, and Cardie, 2005)</xref>
        , and the Harvard
General Inquirer
        <xref ref-type="bibr" rid="ref26">(Stone et al., 1968)</xref>
        for English,
associating polarity values to lexical atoms.
However, the polarity of words varies from its
use context and without a term
disambiguation, the value of sentilexicons is limited.
      </p>
      <p>
        Additionally, analyzing social media posts
has a complex plus due to the usual presence
of irony and other linguisitc phenomena of
the sort
        <xref ref-type="bibr" rid="ref3">(Chatzakou and Vakali, 2015)</xref>
        , and
the fact of post being often written fast and in
an informal way, leading both to typos and to
the inclusions of symbols, shorthands,
emoticons or slung expressions that change every
day following di erent Internet trends.
      </p>
      <p>
        Nevertheless studies and resources on
languages di erent from English are scarce.
Some examples of works for Spanish are the
adaptation performed in
        <xref ref-type="bibr" rid="ref2 ref22">(Brooke, To loski,
and Taboada, 2009)</xref>
        of the system described
in
        <xref ref-type="bibr" rid="ref27">(Taboada et al., 2011)</xref>
        by translating the
core lexicons and adapting other resources in
various ways; also
        <xref ref-type="bibr" rid="ref24">(Sidorov et al., 2012)</xref>
        presented an analysis of various parameter
settings for most popular machine learning
classi ers for the Spanish language; nally,
syntactic structure of the text was used in
        <xref ref-type="bibr" rid="ref29">(Vilares, Alonso, and Gomez-Rodr guez, 2013)</xref>
        to tackle negation, among others. A
corpus in the speci c eld of sentiments towards
brands has recently appeared
        <xref ref-type="bibr" rid="ref19">(Navas-Loro et
al., 2017)</xref>
        .
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Classi cation approach</title>
      <p>The described system uses a Machine
Learning approach with a Java-based standard
pipeline. Some linguistic aspects such as
negation and some di erent labeling strategies
have been explored in more depth.
3.1</p>
      <sec id="sec-3-1">
        <title>Labeling strategy</title>
        <p>A binary classi cation system assigns or not
a label to a text document, e.g. the classi
cation of an email text as ham/spam. The
labeling strategy is a straightforward operation
{when a threshold is reached the message is
categorised as spam, otherwise the message
is classi ed as ham. In case of classifying
the polarity of a text, the labeling strategy
is somewhat more complex, as the number
of possible results is at least three, positive,
negative or neutral. For the TASS task 1,
there is one category more: \no sentiment",
which is di erent from neutral.</p>
        <p>In order to classify a message as P
(positive), N (negative), NEU (neutral) and
NONE (no polarity), di erent strategies can
be followed:
1. Labeling each document with one of the
four categories independently and using
four binary classi ers. The category is
assigned to the class with the highest
score among the four results. In order to
have an optimal Bayesian classi er, the
results for each class can be weighted by
their a priori probabilities.
2. Labeling each document as either
positive/non-positive and
negative/nonnegative. Two binary classi ers are
used, and depending on the individual
classi er results (P and N) the classi ed
category is determined as follows:</p>
        <sec id="sec-3-1-1">
          <title>If a tweet has similar values for P</title>
          <p>and N (this is, the distance d = jP
N j is less than some threshold td),
we can consider it as neutral (NEU).</p>
        </sec>
        <sec id="sec-3-1-2">
          <title>If a tweet has despicable values for</title>
          <p>P and N but not extremely close to
zero (this is, N and P values area
between some some thresholds tnmin &lt;
P &lt; tnmax and tnmin &lt; N &lt; tnmax
with 0 &lt; tnmin ), we can consider it
as neutral (NEU).</p>
          <p>
            If a tweet has close-to-zero values
for P and N (this is, N and P are
such that P tnmin and N
tnmin ), we can consider it as NONE.
3. Labeling each document with one of the
three tags NEU, P and N, and then using
one single classi er. The classi er score
determines if the document is classi ed
as P (high values), as N (low values) or
as NEU (intermediate values). The
system would never return NONE (which is
only present in 14% of the documents in
the InterTASS corpus). Ignoring neutral
values has been reported as a bad
strategy, as studied by
            <xref ref-type="bibr" rid="ref13 ref6">(Koppel and Schler,
2006)</xref>
            .
4. Using a multi-sentiment classi er, where
some categories (happiness, love,
satisfaction, trust) lead to choosing P, and
some categories lead to choosing to N
(sadness, hate, dissatisfaction, fear).
5. Using a two-stages classi er, where the
rst one determines the subjectivity
(sentiment/no sentiment) and the
second determines the polarity, as
proposed by
            <xref ref-type="bibr" rid="ref30 ref31">(Wilson, Wiebe, and Ho
mann, 2005)</xref>
            .
3.2
          </p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Linguistic considerations</title>
        <p>During our processing, diverse linguistic
features were taken into account.
3.2.1 Features and negation</p>
        <p>treatment
We tested two di erent lexical units in our
systems: tokens and lemmas. In some tested
systems we also considered as features words
extracted from lexicons, and also the
presence of negation in verbs, detected by using
trees extracted by deep syntactic analysis.</p>
        <p>
          Negation was tackled by detecting the
presence of \NEG" constituents in the
verbal groups, with the logic that if a negation
is present within a verbal group, the polarity
is inverted. Double negation was not
considered.
3.2.2 Preprocessing
For preprocessing the tweets, often full of
grammar errors and social networks
expressions that are highly decisive in polarity (such
as emoticons), we have developed a lter able
to detect these phenomena partly, similar to
the one described by Quiros et al.
          <xref ref-type="bibr" rid="ref10 ref23">(Quiros,
Segura-Bedmar, and Mart nez, 2016)</xref>
          . More
concretely, it is able to recognize:
        </p>
        <sec id="sec-3-2-1">
          <title>Several laugh \ja"...). patterns</title>
          <p>(\hahaha",
URL formats (in order to delete them,
since they give no information).</p>
          <p>Slang expressions and replacements in
Spanish social networks, such as:
{ `q', `k', `qu', `ke', `qe' ! `que'.
{ `d' ! `de'.
{ `tb' ! `tambien'.
{ `lol' ! `ja'.</p>
          <p>{ `xq', `pq', `porq' ! ` porque' .</p>
          <p>Typos related to repeated letters
(`LOOOOOL' ! `LOL' )
Suppression of numbers, as they just
tend to carry polarity in concrete
expressions.</p>
          <p>Emoticons detection and polarization,
such as:
{ positive polarity : f `:-)', `;)', `:D',
`&lt;3', `:&lt;', `:P', `o:', `*.*' g.
{ negative polarity : f `:'(', `:S', `:$',
`: (', `:C' g.</p>
          <p>Additionally, for some systems a stopword
lter was tested. This lter based on a list of
common words that carry no semantic or
polarity meaning, created combining the results
of algorithms (such as TF-IDF) and manual
revision.
3.3</p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>Means</title>
        <p>
          3.3.1 External resources
IXA-pipes
          <xref ref-type="bibr" rid="ref1">(Agerri, Bermudez, and Rigau,
2014)</xref>
          has been the NLP suite of choice, being
the POS tagger, lemmatizer and constituent
extractor the key components that were used.
Weka
          <xref ref-type="bibr" rid="ref7">(Frank, Hall, and Witten, 2016)</xref>
          has
been used as the implementation of the
machine learning algorithms, given its exibility
and matureness.
        </p>
        <p>
          In order to assess the a ectivity of
documents, a dataset of Spanish words and their
arousal (the level of activation or intensity
that a stimulus elicits) and valence (how
pleasant a stimulus is) was also used. The 875
words studied by Hinojosa et al.
          <xref ref-type="bibr" rid="ref10">(Hinojosa
et al., 2016)</xref>
          were lemmatized and matched
against the lemmas in the document. The
sum of the matched tokens' arousal was
normalized and compared against a threshold to
determine whether a non polarized message
was NEU or NONE. Equivalently, the
normalized value of the sum of the matched
tokens' valence was used as an additional
feature for the feature vector. Both uses of the
dataset of Hinojosa proved to be e
ectiveless, as a low percent of the messages actually
matched any word in the dictionary and
results did not improve. Arguably, other larger
datasets might have been used
(
          <xref ref-type="bibr" rid="ref25">StadthagenGonzalez et al., 2017</xref>
          ).
3.4
        </p>
      </sec>
      <sec id="sec-3-4">
        <title>Algorithms used</title>
        <p>Nave Bayes is a generative model assuming
that features are independent given a class
and that calculates the probability given a
class using Bayes theorem. Even when
independence does not happen in Natural
Language, this technique has shown to deliver
good results in NLP. Multinomial version of
Nave Bayes is speci cally performant when
dealing with language, being lexical units
(words, lemmas...) frequency the data
frequently used. Also SMO (Sequential Minimal
Optimization for Support Vector Machines)
classi ers have been tested, but results did
not improve those from Nave Bayes.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Final systems and results</title>
      <p>
        We present now the con gurations that
turned out to be the best ones among the
di erent options exposed before:
laOEG implements the second
labelbased strategy described in section 3.1,
which performed better than any of the
other approaches. Several thresholds
were tested: Multinomial Nave Bayes
algorithm's threshold was set to 0:30
(also executions with values from 0:01 to
0:50 were performed), while di erent
values were tested for internal thresholds,
with values such as td = 0:10, tnmax =
0:15 and tnmin = 0:10. The feature
vector was built simply using tokens after
the pipeline described above. The
Multinomial Nave Bayes performed slightly
better than the SMO classi er, being
also one of its strengths its versatility
(training and features can be easily
managed in di erent manners) and its
velocity, being the fastest system among the
proposed; however, besides these strong
aspects, it is also the simplest approach,
processing just shallowly at token level
and being therefore unable to detect
nuances such as negation or irony.
victor0 Same as above, but using
lemmas in the feature vector and
considering negation. The presence of
negation in the verbal group at di erent
constituent levels led to the addition of extra
features (e.g. `don't like' is handled as a
single feature instead as three words or
lemmas in a bag) .
victor2 Same as victor0, but also
considers the presence of stopwords and
using the Hinojosa dataset
        <xref ref-type="bibr" rid="ref10">(Hinojosa et al.,
2016)</xref>
        to better distinguish between NEU
and NONE.
victor3b Same as victor2, but using IBM
Watson Natural Language
Understanding2 module when its output was clear
(con dence level bigger than 0.75).
      </p>
      <p>Numerical results of the systems proposed
by OEG are exposed in Table 1 (for the
InterTASS corpus), Table 2 (for the full test
General Corpus of TASS) and Table 3 (for
the 1k General Corpus of TASS), along with
the highest and the lowest results of the
overall of the participants for each corpus.</p>
      <p>System
victor2
victor0
laOEG
Max. result
Min. result
The mere adaptation of a di erently
purposed classi er does not yield optimal results
for the TASS challenge. However, our
results have proved to be one of the most stable
among the three corpora for testing, since we
obtained similar results with each of the
systems in all of them, fact that is not common
to other participant's proposals. Groups
being the rst classi ed in a corpus can be in
the last half of the ranking in another one.
We acknowledge that corpora from previous
TASS editions should have been used, and
that additional machine learning approaches,
2https://www.ibm.com/watson/developercloud
/doc/natural-language-understanding/</p>
      <p>System
victor2
laOEG
Max. result
Min. result</p>
      <p>External out-of-the-box software did not
prove to work any better. IBM Waton's
Natural Language Understanding was tested,
because no train is needed, Spanish language is
covered and emotion and sentiment analysis
functionality is ready to be used. However,
its results did not prove any better than the
system described in this paper.</p>
      <p>The distinction between NEU and NON
is a very speci c feature of this challenge
that justi es speci c research on the
strategies presented in Section 3.1. Also, in future
TASS editions, special focus will be given
to word sense disambiguation, introducing
concepts as tokens rather than simple words
or n-grams; and we will extensively use the
broader sentilexicons newly appeared.</p>
    </sec>
    <sec id="sec-5">
      <title>Agradecimientos</title>
      <p>This work has been partially supported by
LPS-BIGGER (IDI-20141259, Ministerio de
Econom a y Competitividad), a research
assistant grant by the Consejer a de
Educacion, Juventud y Deporte de la Comunidad
de Madrid partially founded by the
European Social Fund (PEJ16/TIC/AI-1984) and
a Juan de la Cierva contract.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Agerri</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bermudez</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Rigau</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>IXA pipeline: E cient and Ready to Use Multilingual NLP tools</article-title>
          .
          <source>In LREC</source>
          , volume
          <year>2014</year>
          , pages
          <fpage>3823</fpage>
          {
          <fpage>3828</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Brooke</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , M. To loski, and
          <string-name>
            <given-names>M.</given-names>
            <surname>Taboada</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Cross-Linguistic Sentiment Analysis: From English to Spanish</article-title>
          .
          <source>In RANLP</source>
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Chatzakou</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Vakali</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Harvesting opinions and emotions from social media textual resources</article-title>
          .
          <source>IEEE Internet Computing</source>
          ,
          <volume>19</volume>
          (
          <issue>4</issue>
          ):
          <volume>46</volume>
          {
          <fpage>50</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Cortes</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>V.</given-names>
            <surname>Vapnik</surname>
          </string-name>
          .
          <year>1995</year>
          .
          <article-title>Supportvector networks</article-title>
          .
          <source>Machine learning</source>
          ,
          <volume>20</volume>
          (
          <issue>3</issue>
          ):
          <volume>273</volume>
          {
          <fpage>297</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Ding</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>B.</given-names>
            <surname>Liu</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>The utility of linguistic rules in opinion mining</article-title>
          .
          <source>In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval</source>
          , pages
          <volume>811</volume>
          {
          <fpage>812</fpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Esuli</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>F.</given-names>
            <surname>Sebastiani</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>SENTIWORDNET: A high-coverage lexical resource for opinion mining</article-title>
          .
          <source>In Institute of Information Science and Technologies (ISTI) of the Italian National Research Council (CNR).</source>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Frank</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Hall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and I. H.</given-names>
            <surname>Witten</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>The WEKA Workbench</article-title>
          .
          <article-title>Online Appendix for "Data Mining: Practical machine learning tools and techniques"</article-title>
          . Morgan Kaufmann.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Gamon</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2004</year>
          .
          <article-title>Sentiment classi cation on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis</article-title>
          .
          <source>In Proceedings of the 20th international conference on Computational Linguistics</source>
          , page 841.
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Giatsoglou</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>M. G.</given-names>
            <surname>Vozalis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Diamantaras</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Vakali</surname>
          </string-name>
          , G. Sarigiannidis, and
          <string-name>
            <given-names>K. C.</given-names>
            <surname>Chatzisavvas</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Sentiment analysis leveraging emotions and word embeddings</article-title>
          .
          <source>Expert Systems with Applications</source>
          ,
          <volume>69</volume>
          :
          <fpage>214</fpage>
          {
          <fpage>224</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Hinojosa</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>N.</surname>
          </string-name>
          <article-title>Mart nez-Garc a, C. Villalba-Garc a, U. FernandezFolgueiras, A</article-title>
          .
          <string-name>
            <surname>Sanchez-Carmona</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Pozo</surname>
            , and
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Montoro</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>A ective norms of 875 spanish words for ve discrete emotional categories and two emotional dimensions</article-title>
          .
          <source>Behavior research methods</source>
          ,
          <volume>48</volume>
          (
          <issue>1</issue>
          ):
          <volume>272</volume>
          {
          <fpage>284</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Kan</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <year>2011</year>
          .
          <article-title>Rule-based approach to sentiment analysis at ROMIP 2011</article-title>
          .
          <article-title>In Sentiment analysis track at ROMIP2011</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Kleinginna</surname>
            ,
            <given-names>P. R.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Kleinginna</surname>
          </string-name>
          .
          <year>1981</year>
          .
          <article-title>A categorized list of emotion definitions, with suggestions for a consensual de nition</article-title>
          .
          <source>Motivation and emotion</source>
          ,
          <volume>5</volume>
          (
          <issue>4</issue>
          ):
          <volume>345</volume>
          {
          <fpage>379</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Koppel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Schler</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>The importance of neutral examples for learning sentiment</article-title>
          .
          <source>Computational Intelligence</source>
          ,
          <volume>22</volume>
          (
          <issue>2</issue>
          ):
          <volume>100</volume>
          {
          <fpage>109</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <year>2010</year>
          .
          <article-title>Sentiment analysis and subjectivity</article-title>
          . In N. Indurkhya and
          <string-name>
            <surname>F. J</surname>
          </string-name>
          . Damerau, editors,
          <source>Handbook of Natural Language Processing. Chapman and Hall/CRC</source>
          , pages
          <volume>627</volume>
          {
          <fpage>666</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Mart</surname>
            nez-Camara,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>M. C.</surname>
          </string-name>
          <article-title>D az-</article-title>
          <string-name>
            <surname>Galiano</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
            a-Cumbreras,
            <given-names>M.</given-names>
          </string-name>
          <article-title>Garc aVega, and</article-title>
          <string-name>
            <given-names>J.</given-names>
            <surname>Villena-Roman</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Overview of tass 2017</article-title>
          . In J. Villena Roman,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Garc a Cumbreras</surname>
          </string-name>
          ,
          <string-name>
            <surname>D. G. M. C. Mart</surname>
            nez-Camara, Eugenio, and
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
          </string-name>
          a Vega, editors,
          <source>Proceedings of TASS 2017: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2017</year>
          ), volume
          <volume>1896</volume>
          <source>of CEUR Workshop Proceedings</source>
          , Murcia, Spain, September. CEURWS.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Chen</surname>
          </string-name>
          , G. Corrado, and
          <string-name>
            <given-names>J.</given-names>
            <surname>Dean</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>E cient estimation of word representations in vector space</article-title>
          .
          <source>arXiv preprint arXiv:1301</source>
          .
          <fpage>3781</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Minsky</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>1961</year>
          .
          <article-title>Steps toward arti - cial intelligence</article-title>
          .
          <source>Proceedings of the IRE</source>
          ,
          <volume>49</volume>
          (
          <issue>1</issue>
          ):8{
          <fpage>30</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Mullen</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>N.</given-names>
            <surname>Collier</surname>
          </string-name>
          .
          <year>2004</year>
          .
          <article-title>Sentiment Analysis using Support Vector Machines with Diverse Information Sources</article-title>
          .
          <source>In Proceedings of EMNLP 2004</source>
          , pages
          <fpage>412</fpage>
          {
          <fpage>418</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Navas-Loro</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>V.</surname>
          </string-name>
          <article-title>Rodr guez-Doncel, I. Santana-Perez, and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Sanchez</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Spanish Corpus for Sentiment Analysis towards Brands</article-title>
          .
          <source>In Proc. of the 19th Int. Conf. on Speech and Computer (SPECOM)</source>
          , pages
          <fpage>680</fpage>
          {
          <fpage>689</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Nielsen</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>The social media report</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Pang</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Lee</surname>
          </string-name>
          , et al.
          <year>2008</year>
          .
          <article-title>Opinion mining and sentiment analysis</article-title>
          .
          <source>Foundations and Trends R in Information Retrieval</source>
          ,
          <volume>2</volume>
          (
          <issue>1</issue>
          { 2):1{
          <fpage>135</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <surname>Prabowo</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Thelwall</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Sentiment analysis: A combined approach</article-title>
          .
          <source>Journal of Informetrics</source>
          ,
          <volume>3</volume>
          (
          <issue>2</issue>
          ):
          <volume>143</volume>
          {
          <fpage>157</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Quiros</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Segura-Bedmar</surname>
          </string-name>
          , and P. Mart nez.
          <year>2016</year>
          .
          <article-title>Labda at the 2016 tass challenge task: Using word embeddings for the sentiment analysis task</article-title>
          .
          <source>In TASS@ SEPLN</source>
          , pages
          <volume>29</volume>
          {
          <fpage>33</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>Sidorov</surname>
            , G.,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Miranda-Jimenez</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>ViverosJimenez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Gelbukh</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Castro-Sanchez</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Velasquez</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <article-title>D az-</article-title>
          <string-name>
            <surname>Rangel</surname>
            ,
            <given-names>S. SuarezGuerra</given-names>
          </string-name>
          , A. Trevin~o, and
          <string-name>
            <given-names>J.</given-names>
            <surname>Gordon</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Empirical study of machine learning based approach for opinion mining in tweets</article-title>
          .
          <source>In Mexican international conference on Articial intelligence</source>
          , pages
          <fpage>1</fpage>
          <lpage>{</lpage>
          14. Springer.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <surname>Stadthagen-Gonzalez</surname>
            , H.,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Imbault</surname>
            ,
            <given-names>M. A. P.</given-names>
          </string-name>
          <string-name>
            <surname>Sanchez</surname>
            , and
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Brysbaert</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Norms of valence and arousal for 14,031 spanish words</article-title>
          .
          <source>Behavior research methods</source>
          ,
          <volume>49</volume>
          (
          <issue>1</issue>
          ):
          <volume>111</volume>
          {
          <fpage>123</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <surname>Stone</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>D. C.</given-names>
            <surname>Dunphy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and D. M.</given-names>
            <surname>Ogilvie</surname>
          </string-name>
          .
          <year>1968</year>
          .
          <article-title>The general inquirer: A computer approach to content analysis</article-title>
          .
          <source>Journal of Regional Science</source>
          ,
          <volume>8</volume>
          (
          <issue>1</issue>
          ):
          <volume>113</volume>
          {
          <fpage>116</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <surname>Taboada</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Brooke</surname>
          </string-name>
          , M. To loski, K. Voll, and
          <string-name>
            <given-names>M.</given-names>
            <surname>Stede</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Lexiconbased methods for sentiment analysis</article-title>
          .
          <source>Computational linguistics</source>
          ,
          <volume>37</volume>
          (
          <issue>2</issue>
          ):
          <volume>267</volume>
          {
          <fpage>307</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <string-name>
            <surname>Turney</surname>
            ,
            <given-names>P. D.</given-names>
          </string-name>
          <year>2002</year>
          .
          <article-title>Thumbs up or thumbs down?: semantic orientation applied to unsupervised classi cation of reviews</article-title>
          .
          <source>In Proceedings of the 40th annual meeting on association for computational linguistics</source>
          , pages
          <volume>417</volume>
          {
          <fpage>424</fpage>
          .
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          <string-name>
            <surname>Vilares</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Alonso</surname>
          </string-name>
          , and
          <string-name>
            <surname>C.</surname>
          </string-name>
          <article-title>GomezRodr guez</article-title>
          .
          <year>2013</year>
          .
          <article-title>Clasi cacion de polaridad en textos con opiniones en espan~ol mediante analisis sintactico de dependencias</article-title>
          .
          <source>Procesamiento del Lenguaje Natural</source>
          ,
          <volume>50</volume>
          :
          <fpage>13</fpage>
          {
          <fpage>20</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          <string-name>
            <surname>Wiebe</surname>
            , J., T. Wilson, and
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Cardie</surname>
          </string-name>
          .
          <year>2005</year>
          .
          <article-title>Annotating expressions of opinions and emotions in language</article-title>
          .
          <source>Language resources and evaluation</source>
          ,
          <volume>39</volume>
          (
          <issue>2</issue>
          ):
          <volume>165</volume>
          {
          <fpage>210</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          <string-name>
            <surname>Wilson</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wiebe</surname>
          </string-name>
          , and P. Ho mann.
          <year>2005</year>
          .
          <article-title>Recognizing contextual polarity in phrase-level sentiment analysis</article-title>
          .
          <source>In Proceedings of the conference on human language technology and empirical methods in natural language processing</source>
          , pages
          <volume>347</volume>
          {
          <fpage>354</fpage>
          .
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>