<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>DeustoTech Internet at TASS 2015: Sentiment analysis and polarity classi cation in spanish tweets</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Juan Sixto Cesteros</string-name>
          <email>jsixto@deusto.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Aitor Almeida</string-name>
          <email>aitor.almeida@deusto.es</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Diego Lopez de Ipin~a</string-name>
          <email>dipina@deusto.es</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>DeustoTech{Deusto, Institute of Technology, Universidad de Deusto</institution>
          ,
          <addr-line>48007 Bilbao</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>DeustoTech{Deusto, Institute of Technology, Universidad de Deusto</institution>
          ,
          <addr-line>48007 Bilbao</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>DeustoTech{Deusto, Institute of Technology, Universidad de Deusto</institution>
          ,
          <addr-line>48007 Bilbao</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2015</year>
      </pub-date>
      <fpage>23</fpage>
      <lpage>28</lpage>
      <abstract>
        <p>This article describes our system presented at the workshop for sentiment analysis TASS 2015. Our system approaches the task 1 of the workshop, which consists on performing an automatic sentiment analysis to determine the global polarity of a set of tweets in Spanish. To do this, our system is based on a model supervised Linear Support Vector Machines combined with some polarity lexicons. The in uence of the di erent linguistic features and the di erent sizes of n-grams in improving algorithm performance. Also the results obtained, the various tests that have been conducted, and a discussion of the results are presented.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Since the origin of Web 2.0, Internet contains
a very large amounts of user-generated
information on an unlimited number of topics.</p>
      <p>Many entities such as corporations or
political groups try to learn about that knowledge
to know the opinion of users. Social Media
platforms such as Facebook or Twitter have
proven to be useful for this tasks,due to the
very high volume of messages that these
platforms generate in real time and the very large
number of users that use them everyday.</p>
      <p>Faced with this challenge, in the last years
the number of the Sentiment Analysis
researches has increased appreciably, especially
those based in Twitter and microblogging.</p>
      <p>
        It should be taken into account that the
performance of these researches is
languagedependent, re ecting the considerable di
erences between languages and the di culty of
establish standard linguistic rules
        <xref ref-type="bibr" rid="ref11 ref20 ref22 ref8">(Han,
Cook, and Baldwin, 2013)</xref>
        .
      </p>
      <p>
        In this context, the TASS1 workshop
        <xref ref-type="bibr" rid="ref26">(Villena-Roman et al., 2015)</xref>
        is an
evaluation workshop for sentiment analysis focused
on Spanish language, organized as a
satellite event of the annual conference of the
Spanish Society for Natural Language Processing
(SEPLN)2. This paper is focused on the rst
task of the workshop consist on determining
the global polarity of twitter messages.
      </p>
      <p>This paper presents a global polarity
clas1Taller de Analisis de Sentimientos en la SEPLN
2http://www.sepln.org/
Publicado en http://ceur-ws.org/Vol-1397/. CEUR-WS.org es una publicación en serie con ISSN reconocido
si cation in Spanish tweets based on polarity
lexicons and linguistic features. It is adapted
to Spanish tweet texts, which involve
particular linguistic characteristics like short length,
limited to 140 characters, slang, spelling and
grammatical errors and other user mentions.</p>
      <p>The rest of the paper is organized as
follows: the sentiment analysis related works
are described in Section 2, the developed
system's description is presented in Section 3,
evaluation and results in Section 4 and
conclusion and future work are discussed in
Section 5.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related work</title>
      <p>
        There exists a large amount of literature
addressing the sentiment analysis eld,
especially applied to Twitter and microblogging
context. General surveys about Opinion
Mining and Sentiment Analysis may be found
        <xref ref-type="bibr" rid="ref18">(Pang and Lee, 2008)</xref>
        ,
        <xref ref-type="bibr" rid="ref14">(Martinez-Camara et
al., 2014)</xref>
        , although due to the enormous
diversity of applications on this eld, di erent
approaches to solve problems in numerous
scopes have been generated, like user
classi cation
        <xref ref-type="bibr" rid="ref1 ref19 ref7">(Pennacchiotti and Popescu, 2011)</xref>
        ,
Spam detection in social media
        <xref ref-type="bibr" rid="ref5">(Gao et al.,
2010)</xref>
        , classi cation of product reviews
        <xref ref-type="bibr" rid="ref2">(Dave, Lawrence, and Pennock, 2003)</xref>
        ,
demographic studies
        <xref ref-type="bibr" rid="ref15">(Mislove et al., 2011)</xref>
        ,
political sentiment and election results
prediction
        <xref ref-type="bibr" rid="ref1 ref19 ref7">(Bermingham and Smeaton, 2011)</xref>
        and
even clinical depression prediction via
Twitter
        <xref ref-type="bibr" rid="ref4">(De Choudhury et al., 2013)</xref>
        .
      </p>
      <p>
        Twitter has certain speci c
characteristics which distinguish them from other
social networks, e.g. short texts, @user
mentions, #hashtags and retweets. All of these
characteristics have been extensively studied
        <xref ref-type="bibr" rid="ref10 ref17 ref3">(Pak and Paroubek, 2010)</xref>
        ,
        <xref ref-type="bibr" rid="ref1 ref19 ref7">(Han and
Baldwin, 2011)</xref>
        . Some of them have been
resolved through the text normalization approach
        <xref ref-type="bibr" rid="ref11 ref20 ref22 ref8">(Ruiz, Cuadros, and Etchegoyhen, 2013)</xref>
        while others have been used as key elements in
classi cation approach
        <xref ref-type="bibr" rid="ref28">(Wang et al., 2011)</xref>
        .
Indeed, several researches prove that the
indepth knowledge of these characteristics will
signi cantly improve the social media based
applications
        <xref ref-type="bibr" rid="ref11">(Jungherr, 2013)</xref>
        ,
        <xref ref-type="bibr" rid="ref27">(Wang et al.,
2013)</xref>
        .
      </p>
      <p>
        For several years we assist to an
exponential increase of studies based on
sentiment analysis and opinion mining in
Twitter. According to the state of art, two main
approaches exist in sentiment analysis:
supervised learning and unsupervised learning.
Supervised systems implement classi cation
models based on classi cation algorithms,
being the most frequent the Support Vector
Machine (SVM)
        <xref ref-type="bibr" rid="ref11 ref6">(Go, Bhayani, and Huang,
2009)</xref>
        , Logistic Regression (LR)
        <xref ref-type="bibr" rid="ref16 ref23">(Thelwall,
Buckley, and Paltoglou, 2012)</xref>
        , Conditional
Random Fields (CRF)
        <xref ref-type="bibr" rid="ref10 ref17 ref3">(Jakob and Gurevych,
2010)</xref>
        and K Nearest Neighbors (KNN)
        <xref ref-type="bibr" rid="ref10 ref17 ref3">(Davidov, Tsur, and Rappoport, 2010)</xref>
        .
Unsupervised systems are based on the use of
lexicons to calculate the semantic
orientation
        <xref ref-type="bibr" rid="ref24">(Turney, 2002)</xref>
        and present a new
perspective for classi cation tasks, most e
ective in cross-domain and multilingual
applications.
      </p>
      <p>
        During the last TASS workshop in 2014
        <xref ref-type="bibr" rid="ref26">(Villena-Roman et al., 2015)</xref>
        , LyS presented a
supervised liblinear classi er with several
lexicons of Spanish language, whose results are
among the best in task 1 (Sentiment Analysis
at the tweet level)
        <xref ref-type="bibr" rid="ref25">(Vilares et al., 2014)</xref>
        .
Further,
        <xref ref-type="bibr" rid="ref21 ref9">(San Vicente and Saralegi, 2014)</xref>
        presented a Support Vector Machine (SVM) based
on a classi er that merges polarity lexicons
with several linguistic features as
punctuation marks or negation signs. Finally, the best
results in task 1 correspond to
        <xref ref-type="bibr" rid="ref21 ref9">(Hurtado and
Pla, 2014)</xref>
        , who present a Linear-SVM
based classi er that addresses the task using a
one-vs-all strategy in conjuction with a
vectorized list of tf-idf coe cients as text
representation.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>System description</title>
      <p>
        Several tools and datasets have been used
during the experiments to develop our nal
system. Because our system only approaches
the Task 1: Sentiment Analysis at global
level, this consists in a unique pipeline that
reaches the process completely. At the
beginning, a naive normalization system is
applied to the tweet texts with the purpose to
standardize several Twitter own features,
like #Hashtags or @User mentions. Then, the
Freeling language analysis tool3
        <xref ref-type="bibr" rid="ref16 ref23">(Padro and
Stanilovsky, 2012)</xref>
        is used to tokenize,
lemmatize and annotate the texts with
part-ofspeech tags (pos-tagging).
      </p>
      <p>During this step, based on a list of stop
words for Spanish language, this words are
annotated to be ignored by polarity ranking
steps.</p>
      <sec id="sec-3-1">
        <title>3http://nlp.lsi.upc.edu/freeling/</title>
        <p>
          The task has been addressed as an
automatic multi-class classi cation job. For this
reason, it has been considered appropriate to
focus this problem with a one-vs-all strategy,
in a similar way to the presented by
          <xref ref-type="bibr" rid="ref21 ref9">(Hurtado
and Pla, 2014)</xref>
          in TASS 2014. These binary
classi ers have been developed using two
different approaches, LinearSVC Machines and
Support Vector Regression (SVR) Machines.
the comparison of machine-learning based
results is shown in Results section.
        </p>
        <p>To represent the text's as vectorized
features, two main sources have been used: the
polarity lexicon punctuations and the
Okapi BM25 ranking function, to represent
document's scoring (Robertson et al., 1995).
BM25 is a bag-of-words retrieval function
that ranks a set of documents based on the
query terms appearing in each document.
The formula used to implement BM25 in the
system is de ned below:
score(D; Q) =
n
X IDF (qi) T F (qi)
i=1
f (qi; D) (k1 + 1)
b + b ajvDgdjl )
(2)
T F (qi) =</p>
        <p>f (qi; D) + k1 (1
IDF (qi) = log</p>
        <p>N</p>
        <p>n(qi) + 0;5
n(qi) + 0;5</p>
        <p>To calculate the score of a document D,
f (qi; D) is the frecuency of each word lemma
(qi), jDj is the length of the text D in words
and avgdl is the average text length. After
several experiments over the training corpus,
the free parameters k1 and b have been
optimized to k1 = 76 and b = 0;75. System
develops one BM25 dictionary for each one-vs-all
classi er.</p>
        <p>In conjunction with the document's score,
each tweet has been represented using
different polarity lexicons in order to classify
them into the six (P+, P, NEU, N, N+ and
NONE) and the four (P, N, NEU and
NONE) polarities. We use several datasets to
score the polarity levels of words and lemmas.
Owing to di erent characteristics of each
dataset, such as semantic-orientation values,
scores are calculated separately and
considered as independent attributes in the system.
(1)
(3)</p>
        <sec id="sec-3-1-1">
          <title>LYSA Twitter lexicon v0.1. LYSA</title>
          <p>
            is an automatically-built polarity lexicon
for Spanish language that was created
by downloading messages from Twitter,
and includes both negative and
positive Spanish words
            <xref ref-type="bibr" rid="ref25">(Vilares et al., 2014)</xref>
            .
The lexicon entries includes a
semanticorientation values ranged from -5 to 5,
making it a good resource for multiple
sentiment levels identi cation.
          </p>
        </sec>
        <sec id="sec-3-1-2">
          <title>ElhPolar dictionary v1.0. The Elh</title>
          <p>
            Polar polarity lexicon for Spanish was
created from di erent sources, and
includes both negative and positive words
            <xref ref-type="bibr" rid="ref11 ref20 ref22 ref8">(Saralegi and San Vicente, 2013)</xref>
            .
          </p>
          <p>
            The Spanish Opinion Lexicon
(SOL). The Spanish Opinion Lexicon
(SOL) is composed by 1,396 positive
and 3,151 negative words, thus in
total SOL has 4,547 opinion words4
            <xref ref-type="bibr" rid="ref13">(Mart nez-Camara et al., 2013)</xref>
            . The
lexicon has been elaborated from the
Bing Liu's word list using Reverso as
translator
            <xref ref-type="bibr" rid="ref12">(M. and L., 2004)</xref>
            .
          </p>
          <p>Negation Words List. A list of
negation spanish words has been created
during the experiments. This list is used as
a text feature in order to detect
negative sentences and possible polarity
inversions.</p>
          <p>We also consider other text characteristics
as classi er features, like text length in words
quantity or a list of sentiments represented by
emoticons using the Wikipedia's list of
emoticons5. To conclude the system's prediction,
another automatic classi er has been
implemented, trained with the predictions of the
binary results to select one label.
4</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Results</title>
      <p>Our results are relative to the Task 1:
Sentiment Analysis at global level of TASS 2015.
This task consists on performing an
automatic sentiment analysis to determine the
global polarity of each message in the provided
corpus. There are two di erent evaluations:
one based on 6 di erent polarity labels (P+,
P, NEU, N, N+, NONE) and another based
on just 4 labels (P, N, NEU, NONE). Also
there are two test sets: complete set and 1k</p>
      <sec id="sec-4-1">
        <title>4http://sinai.ujaen.es/sol/ 5https://en.wikipedia.org/wiki/List of emoticons</title>
        <p>set, a subset of the rst one containing only
1000 tweets with a similar distribution to the
training corpus was extracted to be used for
an alternate evaluation of the performance of
systems.</p>
        <p>Tables 1 and 2 show the performance of
di erent tested models using the full and 1k
sets. For the rating of the developed system,
3 di erent systems have been presented for
each subtask. Our submitted models consist
in di erent features as follows:</p>
        <p>Run 1: Words and lemmas based
polarity dictionaries as features, di ering
between positive and negative scores
and between di erent datasets. Okapi
BM25 scores of mono-grams used as
features with the lemmas of the tweet
texts. Binary classi ers were
implemented using LinearSVC Machines and the
global classi er uses their predictions
(True or False).</p>
        <p>Run 2: Words and lemmas based
polarity dictionaries as features, di ering
between positive and negative scores and
between di erent datasets. Okapi BM25
scores of mono-grams and bi-grams used
as features with the lemmas of the tweet
texts. Binary classi ers were
implemented using LinearSVC Machines and the
global classi er uses their predictions
(True or False).</p>
        <p>Run 3: Similar to Run 2, with the
exception of the binary classi ers that were
implemented using Support Vector
Regression (SVR) Machines and the global
classi er uses their predictions (0 to 1
oat values).</p>
        <sec id="sec-4-1-1">
          <title>6 Labels</title>
        </sec>
        <sec id="sec-4-1-2">
          <title>4 Labels Run</title>
          <p>Run1
Run2
Run3
Run1
Run2
Run3</p>
        </sec>
        <sec id="sec-4-1-3">
          <title>6 Labels (1k)</title>
        </sec>
        <sec id="sec-4-1-4">
          <title>4 Labels (1k) Run</title>
          <p>Run1
Run2
Run3
Run1
Run2
Run3
based in SVR. This suggests that the
precision of the regression values, in contrast with
the binary values of the SVM classi ers, has
a negative impact on the global classi er.
However, the use of mono-grams and bi-grams
as features presents di erent success rates
depending of the test. This part of the system
must be analysed in-depth in order to
comprehend the performance di erence between
both systems.
5</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusions and Future work</title>
      <p>This paper describes the participation of the
DeustoTech Internet research group in the
Task 1: Sentiment Analysis at global level
at TASS 2015. In our rst participation, our
team presents a system based in Support
Vector Machines in conjunction with several well
established polarity lexicons. Experimental
results present a good baseline to continue
working through the development of new
models and developing an structure able to take
full advantage of multiple supervised learning
systems.</p>
      <p>As future work, we propose to research
on di erent approaches to aboard the
measure of sentiment analysis problems, especially
those related to sentiment degrees with the
aim to detect clearly di erences between
different sentiment levels (Good vs Very Good,
for example).</p>
      <p>For further work, we would like to
improve the present system including some steps
previously to the classi er module, that have
been demonstrated to improve the nal
results like a normalization pipeline based on
tweets. Also, the necessity of improving the
tokenization module to include features like
punctuation signs, web addresses, and named
entities has become apparent.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>The research activities described in this
paper are funded by DeustoTech INTERNET,
Deusto Institute of Technology, a research
institute within the University of Deusto.
Robertson, S.E., S. Walker, S. Jones,
Hancock-Beaulieu M. M., and Gatford M.
1995. Okapi at trec-3. NIST SPECIAL
PUBLICATION SP, 109-109.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Bermingham</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>A. F.</given-names>
            <surname>Smeaton</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>On using Twitter to monitor political sentiment and predict election results</article-title>
          .
          <source>In Bermingham, Adam and Smeaton</source>
          , Alan F. (
          <year>2011</year>
          )
          <article-title>On using Twitter to monitor political sentiment and predict election results</article-title>
          .
          <source>In: Sentiment Analysis where AI meets Psychology (SAAIP) Workshop at the International Joint Conference for Natural Language Processing (IJCNLP)</source>
          ,
          <source>13th November</source>
          <year>2011</year>
          ,
          <string-name>
            <given-names>Chiang</given-names>
            <surname>Mai</surname>
          </string-name>
          , Thailand.,
          <string-name>
            <surname>Chiang</surname>
            <given-names>Mai</given-names>
          </string-name>
          , Thailand, November.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Dave</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lawrence</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Pennock</surname>
          </string-name>
          .
          <year>2003</year>
          .
          <article-title>Mining the Peanut Gallery: Opinion Extraction and Semantic Classi cation of Product Reviews</article-title>
          .
          <source>In Proceedings of the 12th International Conference on World Wide Web, WWW '03</source>
          , pages
          <fpage>519</fpage>
          {
          <fpage>528</fpage>
          , New York, NY, USA. ACM.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Davidov</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Tsur</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Rappoport</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Enhanced sentiment learning using twitter hashtags and smileys</article-title>
          .
          <source>In Proceedings of the 23rd International Conference on Computational Linguistics: Posters</source>
          , pages
          <volume>241</volume>
          {
          <fpage>249</fpage>
          .
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>De Choudhury</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gamon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Counts</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E.</given-names>
            <surname>Horvitz</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Predicting Depression via Social Media</article-title>
          .
          <source>In ICWSM.</source>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Gao</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wilson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B. Y.</given-names>
            <surname>Zhao</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Detecting and Characterizing Social Spam Campaigns</article-title>
          .
          <source>In Proceedings of the 10th ACM SIGCOMM Conference on Internet Measurement, IMC '10</source>
          , pages
          <fpage>35</fpage>
          {
          <fpage>47</fpage>
          , New York, NY, USA. ACM.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Go</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Bhayani</surname>
          </string-name>
          , and
          <string-name>
            <given-names>L.</given-names>
            <surname>Huang</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Twitter sentiment classi cation using distant supervision</article-title>
          .
          <source>CS224N Project Report</source>
          , Stanford,
          <volume>1</volume>
          :
          <fpage>12</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Han</surname>
            ,
            <given-names>B</given-names>
          </string-name>
          . and
          <string-name>
            <given-names>T.</given-names>
            <surname>Baldwin</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Lexical Normalisation of Short Text Messages: Makn Sens a #twitter.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Han</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , P. Cook, and
          <string-name>
            <given-names>T.</given-names>
            <surname>Baldwin</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>unimelb: Spanish Text Normalisation</article-title>
          .
          <source>In Tweet-Norm@ SEPLN</source>
          , pages
          <volume>32</volume>
          {
          <fpage>36</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Hurtado</surname>
            , L
            <given-names>F.</given-names>
            and F.
          </string-name>
          <string-name>
            <surname>Pla</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>ELiRF-UPV en TASS 2014</article-title>
          : Analisis de sentimientos, deteccion de topicos y analisis de sentimientos de aspectos en twitter.
          <source>Procesamiento del Lenguaje Natural.</source>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Jakob</surname>
            ,
            <given-names>N. and I.</given-names>
          </string-name>
          <string-name>
            <surname>Gurevych</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Extracting opinion targets in a single-and crossdomain setting with conditional random elds</article-title>
          .
          <source>In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing</source>
          , pages
          <volume>1035</volume>
          {
          <fpage>1045</fpage>
          .
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Jungherr</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <year>2013</year>
          .
          <article-title>Tweets and Votes, a Special Relationship: The 2009 Federal Election in Germany</article-title>
          .
          <source>In Proceedings of the 2Nd Workshop on Politics, Elections and Data</source>
          ,
          <source>PLEAD '13</source>
          , pages
          <fpage>5</fpage>
          {
          <fpage>14</fpage>
          , New York, NY, USA. ACM.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>M.</surname>
          </string-name>
          , Hu and
          <string-name>
            <surname>Bing L.</surname>
          </string-name>
          <year>2004</year>
          .
          <article-title>Mining and summarizing customer reviews</article-title>
          .
          <source>Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining.</source>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Mart</surname>
            nez-Camara,
            <given-names>E.</given-names>
          </string-name>
          , M. T. Mart nValdivia,
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Molina-Gonzalez</surname>
          </string-name>
          , and
          <string-name>
            <given-names>L. A.</given-names>
            <surname>Uren~</surname>
          </string-name>
          a-Lopez.
          <year>2013</year>
          .
          <article-title>Bilingual experiments on an opinion comparable corpus</article-title>
          .
          <source>Proceedings of the 4th Workshop on Computational Approaches</source>
          to Subjectivity,
          <article-title>Sentiment and Social Media Analysis</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Martinez-Camara</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>M. T. MartinValdivia</surname>
            ,
            <given-names>L. A.</given-names>
          </string-name>
          <string-name>
            <surname>Uren</surname>
          </string-name>
          <article-title>~a-Lopez, and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Montejo-Raez</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Sentiment analysis in Twitter</article-title>
          .
          <source>Natural Language Engineering</source>
          ,
          <volume>20</volume>
          (
          <issue>01</issue>
          ):
          <volume>1</volume>
          {
          <fpage>28</fpage>
          ,
          <string-name>
            <surname>January</surname>
          </string-name>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Mislove</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          .,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          , YY. Ahn, JP. Onnela, and
          <string-name>
            <given-names>J. N.</given-names>
            <surname>Rosenquist</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Understanding the Demographics of Twitter Users</article-title>
          .
          <source>In ICWSM.</source>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Padro</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Stanilovsky</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Freeling 3.0: Towards wider multilinguality</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Pak</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Paroubek</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Twitter as a Corpus for Sentiment Analysis and Opinion Mining</article-title>
          .
          <source>In LREC.</source>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Pang</surname>
            ,
            <given-names>B. and L.</given-names>
          </string-name>
          <string-name>
            <surname>Lee</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>Opinion Mining and Sentiment Analysis</article-title>
          .
          <source>Found. Trends Inf. Retr.</source>
          ,
          <volume>2</volume>
          (
          <issue>1</issue>
          -2):1{
          <fpage>135</fpage>
          ,
          <string-name>
            <surname>January</surname>
          </string-name>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Pennacchiotti</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>AM.</given-names>
            <surname>Popescu</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>A Machine Learning Approach to Twitter User Classi cation</article-title>
          .
          <source>In ICWSM.</source>
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Ruiz</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>M. Cuadros</surname>
            , and
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Etchegoyhen</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Lexical Normalization of Spanish Tweets with Preprocessing Rules, Domain-Speci c Edit Distances, and Language Models</article-title>
          .
          <source>In Proceedings of the Tweet Normalization Workshop at SEPLN</source>
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <given-names>San</given-names>
            <surname>Vicente</surname>
          </string-name>
          , I. and
          <string-name>
            <given-names>X.</given-names>
            <surname>Saralegi</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Looking for features for supervised tweet polarity classi cation</article-title>
          .
          <source>Procesamiento del Lenguaje Natural.</source>
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <surname>Saralegi</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          <source>and I. San Vicente</source>
          .
          <year>2013</year>
          .
          <article-title>Elhuyar at TASS 2013</article-title>
          .
          <article-title>In Proceedings of "XXIX Congreso de la Sociedad Espan~ola de Procesamiento de lenguaje natural"</article-title>
          .
          <source>Workshop on Sentiment Analysis at SEPLN (TASS</source>
          <year>2013</year>
          ). Madrid. pp.
          <fpage>143</fpage>
          -
          <lpage>150</lpage>
          . ISBN:
          <fpage>978</fpage>
          -
          <lpage>84</lpage>
          -695-8349-4.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Thelwall</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Buckley</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Paltoglou</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Sentiment strength detection for the social web</article-title>
          .
          <source>Journal of the American Society for Information Science and Technology</source>
          ,
          <volume>63</volume>
          (
          <issue>1</issue>
          ):
          <volume>163</volume>
          {
          <fpage>173</fpage>
          ,
          <string-name>
            <surname>January</surname>
          </string-name>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>Turney</surname>
            ,
            <given-names>P. D.</given-names>
          </string-name>
          <year>2002</year>
          .
          <article-title>Thumbs up or thumbs down?: semantic orientation applied to unsupervised classi cation of reviews</article-title>
          .
          <source>In Proceedings of the 40th annual meeting on association for computational linguistics</source>
          , pages
          <volume>417</volume>
          {
          <fpage>424</fpage>
          .
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <surname>Vilares</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Doval</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>A Alonso, and C. Gomez-Rodr guez</article-title>
          .
          <year>2014</year>
          . Lys at TASS 2014:
          <article-title>A prototype for extracting and analysing aspects from spanish tweets</article-title>
          .
          <source>Procesamiento del Lenguaje Natural.</source>
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <surname>Villena-Roman</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Garc</surname>
          </string-name>
          a-Morera,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Garc</surname>
          </string-name>
          a-Cumbreras,
          <string-name>
            <surname>E.</surname>
          </string-name>
          <article-title>Mart nez-</article-title>
          <string-name>
            <surname>Camara</surname>
            ,
            <given-names>M. T.</given-names>
          </string-name>
          <string-name>
            <surname>Mart</surname>
            n-Valdivia, and
            <given-names>L. A.</given-names>
          </string-name>
          <string-name>
            <surname>Uren</surname>
          </string-name>
          ~aLopez.
          <year>2015</year>
          .
          <article-title>Overview of TASS</article-title>
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Tokarchuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Cuadrado</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Poslad</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Exploiting Hashtags for Adaptive Microblog Crawling</article-title>
          .
          <source>In Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM '13</source>
          , pages
          <fpage>311</fpage>
          {
          <fpage>315</fpage>
          , New York, NY, USA. ACM.
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zhou</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Zhang</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Topic Sentiment Analysis in Twitter: A Graph-based Hashtag Sentiment Classi cation Approach</article-title>
          .
          <source>In Proceedings of the 20th ACM International Conference on Information and Knowledge Management</source>
          ,
          <source>CIKM '11</source>
          , pages
          <fpage>1031</fpage>
          {
          <fpage>1040</fpage>
          , New York, NY, USA. ACM.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>