<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>RETUYT-InCo at TASS 2018: Sentiment Analysis in Spanish Variants using Neural Networks and SVM</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Luis Chiruzzo Aiala Rosa</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Facultad de Ingenier a, Universidad de la Republica Montevideo</institution>
          ,
          <country country="UY">Uruguay</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>URL references were replaced by the token</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <fpage>57</fpage>
      <lpage>63</lpage>
      <abstract>
        <p>This paper presents three approaches for classifying the sentiment of tweets for di erent Spanish variants in the TASS 2018 challenge. The classi ers are based on Support Vector Machines (SVM), Convolutional Neural Netowrks (CNN) and Long Short Term Memory networks (LSTM). Although di erent classi ers worked better for di erent language variants, the use of word embeddings was key for obtaining performance improvements. Also, using a mixed-balanced training method for the LSTM resulted in a signi cant improvement in the detection of neutral tweets.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Sentiment analysis is one of the most
important tasks related to subjectivity
analysis within Natural Language Processing. The
sentiment analysis of tweets is especially
interesting due to the large volume of
information generated every day, the subjective
nature of most messages, and the easy access
to this material for analysis and processing.</p>
      <p>The existence of speci c tasks related to this</p>
      <p>eld, for several years now, shows the
interest of the NLP community in working on this
subject. The International Workshop on
Semantic Evaluation (SemEval) includes a task
on Tweets Sentiment Analysis since 2013 1.</p>
      <p>For Spanish, the TASS workshop, organized
by the SEPLN (Sociedad Espan~ola para el
Procesamiento del Lenguaje Natural),
focuses on this task since 20122.</p>
      <p>1https://www.cs.york.ac.uk/semeval-2013/
task2.html
2http://www.sepln.org/workshops/tass/2012/</p>
      <p>
        In the TASS editions prior to 2017, most of
the participants presented machine learning
systems based on hand crafted features. For
example, in TASS 2016
        <xref ref-type="bibr" rid="ref1 ref13 ref16 ref3 ref4">(Garc a-Cumbreras
et al., 2016)</xref>
        best results were obtained by a
system based on an ensemble of Logistic
Regression classi ers including features derived
from a subjective lexicon, negation
processing, and n-grams
        <xref ref-type="bibr" rid="ref3">(Ceron-Guzman, 2016)</xref>
        ;
and a system based on a set of SVM classi ers
with morpho-syntactic information and
ngrams as features
        <xref ref-type="bibr" rid="ref1 ref13 ref6">(Hurtado y Pla, 2016)</xref>
        .
Other authors
        <xref ref-type="bibr" rid="ref1 ref1 ref13 ref13 ref16 ref6 ref6">(Montejo-Raez y D az-Galiano,
2016; Quiros, Segura-Bedmar, y Mart nez,
2016)</xref>
        used word embeddings, reaching lower
results.
      </p>
      <p>
        In TASS 2017
        <xref ref-type="bibr" rid="ref9">(Mart nez-Camara et al.,
2017)</xref>
        (task 1) several systems used deep
learning approaches. The best results
were obtained by:
        <xref ref-type="bibr" rid="ref5">Hurtado, Pla, y Gonzalez
(2017</xref>
        ), who experimented with di erent deep
neural network architectures, using as
in
      </p>
      <p>
        Copyright © 2018 by the paper's authors. Copying permitted for private and academic purposes.
put domain-speci c and general-domain sets
of embeddings; Ceron-Guzman (2017), who
presented an ensemble of SVM and Logistic
Regression classi ers;
        <xref ref-type="bibr" rid="ref17">Rosa et al. (2017)</xref>
        , who
presented an SVM classi er based on the
centroid of the tweets embeddings, a deep neural
network (CNN), and a combination of both;
and
        <xref ref-type="bibr" rid="ref11">Moctezuma et al. (2017)</xref>
        , who combined
an SVM classi er with genetic programming.
      </p>
      <p>
        On the other hand, for the rst time,
SemEval 2018
        <xref ref-type="bibr" rid="ref12">(Mohammad et al., 2018)</xref>
        included a dataset for Spanish tweets sentiment
analysis. The corpus used in task 1.4 (ordinal
classi cation of sentiment) is annotated with
7 values, indicating di erent levels of
positive or negative sentiment. The best results for
Spanish were obtained by systems based on
deep neural networks.
      </p>
      <p>
        In this paper we describe di erent
approaches for Spanish tweet classi cation
presented by the RETUYT-InCo team for the TASS
2018 sentiment analysis challenge
        <xref ref-type="bibr" rid="ref8">(Mart
nezCamara et al., 2018)</xref>
        : an SVM-based classi er
which uses a set of features, including word
embeddings; and two deep neural network
approaches: CNN and LSTM.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Corpus pre-processing</title>
      <p>For this year's edition of the challenge, the
organizers provided three sets of corpora for
Spanish variants spoken in di erent
countries: Spain (ES), Costa Rica (CR) and Peru
(PE). For each of the variants, training,
development and test data was provided. The
training and development sets were
annotated with four possible polarity categories per
tweet: P, N, NEU or NONE. The test corpora had
no annotations.</p>
      <p>For some of our experiments, we also used
the general TASS training data from a
previous edition of the competition. This corpus
was divided in training (85 %) and
development (15 %) subsets. Table 1 shows the sizes
of the di erent corpora and the number of
tweets for each class.</p>
      <p>Each corpus was pre-processed as follows:
Redundant space characters and ellipsis
were removed.</p>
      <sec id="sec-2-1">
        <title>Corpus</title>
      </sec>
      <sec id="sec-2-2">
        <title>General</title>
      </sec>
      <sec id="sec-2-3">
        <title>InterTASS-ES</title>
      </sec>
      <sec id="sec-2-4">
        <title>InterTASS-CR</title>
      </sec>
      <sec id="sec-2-5">
        <title>InterTASS-PE</title>
      </sec>
      <sec id="sec-2-6">
        <title>Category</title>
      </sec>
      <sec id="sec-2-7">
        <title>Train</title>
        <p>N
NEU
NONE</p>
        <p>P
Total</p>
        <p>N
NEU
NONE</p>
        <p>P
Total</p>
        <p>N
NEU
NONE</p>
        <p>P
Total</p>
        <p>N
NEU
NONE</p>
        <p>P
Total</p>
        <p>Sequences of three or more occurrences
of the same character were replaced by
a unique occurrence of that character.
For instance, \holaaaa" was replaced by
\hola".</p>
        <p>Interjections denoting laughter
(\jajajaja", \jejeje", \jajaj") were replaced by
the token \jaja".</p>
        <p>The text was converted to lowercase.</p>
        <p>We did not include any grammatical
information, like lemma, POS-tag, morphological
or syntactic information.
3
3.1</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Resources</title>
      <sec id="sec-3-1">
        <title>Positive and Negative</title>
      </sec>
      <sec id="sec-3-2">
        <title>Lexicons</title>
        <p>
          We built a subjective lexicon consisting of
the union of three subjective lexicons
available for Spanish
          <xref ref-type="bibr" rid="ref18 ref2">(Cruz et al., 2014; Saralegi
y San Vicente, 2013; Brooke, To loski, y
Taboada, 2009)</xref>
          . The lexicon, containing 6875
negative lemmas and 4853 positive lemmas,
was expanded with the in ectional forms of
each lemma, reaching a total of 76291 words
(48959 negative and 27332 positive). This
was done in order to alleviate the fact that
tweets were not lemmatized. For the
lexicon expansion we used the FreeLing
dictionary
          <xref ref-type="bibr" rid="ref14">(Padro y Stanilovsky, 2012)</xref>
          .
        </p>
        <p>
          In a previous work
          <xref ref-type="bibr" rid="ref17">(Rosa et al., 2017)</xref>
          ,
we used the same three Spanish lexicons,
but we took the intersection instead of the
union, obtaining a lexicon with 4730 words.
Some experiments showed that the largest
lexicon provides a small improvement on
results when used to calculate some SVM
features (as described below).
3.2
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>Word embeddings set</title>
        <p>
          We used a 300 dimension word embeddings
set, trained by
          <xref ref-type="bibr" rid="ref1 ref13 ref6">(Azzinnari y Mart nez, 2016)</xref>
          using word2vec
          <xref ref-type="bibr" rid="ref10">(Mikolov et al., 2013)</xref>
          . These
embeddings are based on a corpus of almost
six billion words in Spanish. Most of the texts
come from Internet media sites.
3.3
        </p>
      </sec>
      <sec id="sec-3-4">
        <title>Word Polarity Predictor</title>
        <p>We built a regression algorithm based on
SVM using the subjective lexicon as training
set. This model should be able to predict a
real number representing the polarity of each
word. The model takes as input the 300 real
values of the vector representing the word
and returns a real value for the word polarity.
For training, we assigned the value 1 to
positive words and the value -1 to negative words.
In table 2 we show the result of applying this
classi er to some words.</p>
        <p>Word</p>
        <p>Prediction
apoyamos 1.09973945</p>
        <p>amigo 0.89985318
excelente 1.04574863
cansancio -0.98582263
abat an -1.02370082
horrible -0.91882273
apartamento -0.30991363</p>
        <p>telefono -0.48884958</p>
        <p>As these examples show, words expected
to be positive have values close to 1 and
words expected to be negative have values
close to -1. On the other hand, neutral words
have values closer to 0 than to 1 or -1.
3.4</p>
      </sec>
      <sec id="sec-3-5">
        <title>Category Markers</title>
        <p>We obtained the list of all the words in the
training corpus and for each one we
calculated the distribution of the four categories
in all the tweets where this word occurs. We
consider that a word is a category marker if
it occurs at least 75 % times in this category.
Using this information we built markers lists
for the four categories: 429 positive words,
438 negative words, 12 neutral words, and 33
no opinion markers.
4
4.1</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Classi ers</title>
      <sec id="sec-4-1">
        <title>SVM based approach</title>
        <p>
          The SVM classi er con gurations are almost
the same as the ones described in
          <xref ref-type="bibr" rid="ref17">(Rosa et
al., 2017)</xref>
          . However, the use of the new
positive and negative word lexicons implied
retraining the word polarity predictor and
rebuilding the feature sets. These are the features
used by the SVM classi ers:
        </p>
        <p>
          Centroid of tweet word embeddings.
Previous works showed that, while using
the centroid (or mean vector) is a
simple technique, it reaches good results for
several NLP problems, particularly for
sentiment analysis
          <xref ref-type="bibr" rid="ref19">(White et al., 2015)</xref>
          .
(300 real values)
Polarity of the nine more relevant words
of the tweet according to the polarity
predictor. The number nine is the
average length of tweets in the training
corpus, ltering stop words. We considered
that the more relevant words are those
words whose polarities have the highest
absolute value. If the tweet has less than
nine words we completed the nine values
repeating the polarities of the words in
the tweet. (9 real values)
Number of words belonging to the
positive lexicon and to the negative lexicon.
(2 natural values)
Number of words whose vector
representations are close to the mean vector of
the positive and the negative lexicons.
(2 natural values)
Number of words belonging to the lists
of category markers. (4 natural values)
Features indicating if the original tweet
has repeated characters or some word
written entirely in upper case. (2
boolean values)
Tentative polarity (P, N, NEU, NONE)
of the tweet, based on the number of
positive and negative words in the tweet,
taking into account negation markers
(from a list). We inverted the polarity
of words occurring between the negation
marker and a punctuation mark. (4
classes)
The ve more relevant words from the
training corpus, according to a bag of
words classi er. The value ve was
experimentally de ned. We ltered out
words belonging to a list of stop words
adapted for this task (some words
relevant for sentiment analysis, such as \no"
and \pero" were removed from the stop
words list). (5 boolean values)
        </p>
        <p>
          As in the previous editions, the SVM
experiments were done using the scikit-learn
toolkit
          <xref ref-type="bibr" rid="ref15">(Pedregosa et al., 2011)</xref>
          and trained using
the multiclass probability estimation method
based on
          <xref ref-type="bibr" rid="ref20">(Wu, Lin, y Weng, 2004)</xref>
          .
4.2
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>CNN based approach</title>
        <p>
          Our CNN approach uses a simpler network
than the one used in
          <xref ref-type="bibr" rid="ref17">(Rosa et al., 2017)</xref>
          .
In that case it was a convolutional network
with three branches considering two, three
and four words of context, but in our case
only one convolutional branch considering
three words of context was used, as shown in
gure 1. The input of the network is the
sequence of word embeddings corresponding to
each word in the tweet, up to a maximum of
32 words. This input is fed to the
convolutional layer, then the output goes to a max
pooling layer and a dense layer with a
dropout of 0.2 before going to a softmax layer
for output. For training this network we keep
a 70 %-30 % split for validation and use early
stopping over the validation set.
4.3
        </p>
      </sec>
      <sec id="sec-4-3">
        <title>LSTM based approach</title>
        <p>Our LSTM neural network architecture uses
the embedding for each word as input, up to
a maximum of 32 words. This input is sent
through a LSTM layer and then a dense
layer with a dropout of 0.2, before getting the
output through a softmax layer, as shown in
gure 2.</p>
        <p>The initial experiments using this network
yielded good accuracy results, but the
macroF measure was very low because the network
did not predict any output for the class NEU.
This class has proven to be the most di cult
to learn throughout our experiments.
However, we started to get better results using
a di erent training strategy: we created two
versions of the training corpus, one of them
with all the tweets, and the other one taking
the same number of tweets for each category
(exactly the same number of tweets as the
NEU category, which was the one with the
fewest tweets). We call this set the balanced
corpus.</p>
        <p>The training strategy involves training one
epoch with the whole corpus and one epoch
with the balanced corpus, then iterate this
training process until the performance over
the development set stopped improving.
Training the network in this fashion yields a little
less accuracy but it compensates in macro-F
measure, as it captures a lot more tweets of
the NEU category.</p>
        <p>
          Both neural network approaches (CNN
and LSTM) were implemented using the
Keras library (Chollet, 2015) and trained using
the adam optimization algorithm
          <xref ref-type="bibr" rid="ref7">(Kingma y
Ba, 2014)</xref>
          .
5
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Results</title>
      <p>Three di erent corpora considering three
Spanish variants were used for this task:
from Spain (ES), Costa Rica (CR) and
Peru (PE). Furthermore, the systems could be
trained with training data for the
corresponding Spanish variant (monolingual case), or
they could be trained using data from other
variants (cross-lingual case). We decided to
submit the two best results for each classi er
family on each of the variants and training
combinations. Our results are shown in
tables 3 and 4.</p>
      <p>Taking in consideration the macro-F
measure, our systems achieved good performance
in all the test variants, ranking top 1 for
monolingual CR and PE and cross-lingual ES
and CR; and ranking top 2 for monolingual
ES and cross-lingual PE. The best results for
our systems in the monolingual training
case were achieved by the neural networks
approaches: in two cases, the best systems were
LSTMs and in the other case it was a CNN.
In the cross-lingual training cases, on the
other hand, the three best systems were SVMs.</p>
      <p>
        We submitted another system that
combined the output probabilities of the best
LSTM and SVM, in order to leverage the
information of both classi ers. This approach
had yielded good results in the past
        <xref ref-type="bibr" rid="ref17">(Rosa et
al., 2017)</xref>
        . In this case, although the
performance of the combined approach was good
(49.1 % macro-F for the ES corpus), it was
still a little lower than the LSTM approaches.
      </p>
      <p>As can be seen in table 5, one of the
reasons the LSTM could have gotten better
results over the test set was because it could
We presented three approaches for TASS
2018 Task 1 about classifying the sentiment
of tweets in di erent Spanish variants. The
approaches we used are: SVM using word
embedding centroids and manually crafted
features, CNN using word embeddings as input,
and LSTM using word embeddings, trained
with focus on improving the recognition of
neutral tweets. None of the classi ers was a
clear winner in our experiments, as some of
them worked better than others for di erent
Spanish variants. However, we found that the
training method used for the LSTMs signi
cantly improved their macro-F measure by
improving the detection of neutral tweets. In
all cases, the use of word embeddings was key
to improve the performance of the methods.</p>
      <p>Dev</p>
      <sec id="sec-5-1">
        <title>Test</title>
        <p>F1</p>
        <p>P
F1
46.4
47.1
45.0
44.8
43.8
47.0 Ceron-Guzman, J. A. 2017. Classi er
ensembles that push the state-of-the-art in
sentiment analysis of spanish tweets. En
Proceedings of TASS.
47.6
47.4
42.1
46.2 Chollet, F. 2015. Keras. https://github.
47.3 com/fchollet/keras.
44.4</p>
        <p>Cruz, F. L., J. A. Troyano, B. Pontes, y F. J.</p>
        <p>Ortega. 2014. Building layered,
multilingual sentiment lexicons at synset and
lemma levels. Expert Systems with
Applications, 41(13):5984{5994.
Guillena A. Piad Mor s, y J.
VillenaRoman, editores, Proceedings of TASS
2018: Workshop on Semantic Analysis
at SEPLN (TASS 2018), volumen 2172
de CEUR Workshop Proceedings, Sevilla,
Spain, September. CEUR-WS.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Azzinnari</surname>
            ,
            <given-names>A</given-names>
            . y A
          </string-name>
          . Mart nez.
          <year>2016</year>
          . Representacion de Palabras en Espacios de Vectores. Proyecto de grado, Universidad de la Republica, Uruguay.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Brooke</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , M. To loski, y M.
          <year>Taboada</year>
          .
          <year>2009</year>
          .
          <article-title>Cross-linguistic sentiment analysis: From english to spanish</article-title>
          .
          <source>En RANLP</source>
          , paginas
          <volume>50</volume>
          {
          <fpage>54</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Ceron-Guzman</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          <year>2016</year>
          . Jacerong at tass
          <year>2016</year>
          :
          <article-title>An ensemble classi er for sentiment tweets at global level</article-title>
          .
          <source>En Proceedings of TASS 2016: Workshop on Sentiment Analysis at SEPLN co-located with 32nd SEPLN Conference (SEPLN</source>
          <year>2016</year>
          ), Salamanca, Spain.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          44.4 Garc
          <string-name>
            <surname>a-Cumbreras</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Villena-Roman</surname>
          </string-name>
          ,
          <string-name>
            <surname>E.</surname>
          </string-name>
          <article-title>Mart nez-</article-title>
          <string-name>
            <surname>Camara</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <article-title>D az-</article-title>
          <string-name>
            <surname>Galiano</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Mart</surname>
          </string-name>
          n-Valdivia, y L.
          <source>na Lopez</source>
          .
          <year>2016</year>
          .
          <article-title>Overview of tass 2016</article-title>
          .
          <source>En Proceedings of TASS</source>
          <year>2016</year>
          :
          <article-title>Workshop on Sentiment Analysis at SEPLN co-located with the 32nd SEPLN Conference (SEPLN</article-title>
          <year>2016</year>
          ), Salamanca, Spain, September.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Hurtado</surname>
            , L.-
            <given-names>F.</given-names>
            , F.
          </string-name>
          <string-name>
            <surname>Pla</surname>
            ,
            <given-names>y J.</given-names>
          </string-name>
          <string-name>
            <surname>Gonzalez</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>ELiRF-UPV at TASS 2017: Sentiment Analysis in Twitter based on Deep Learning</article-title>
          .
          <source>En Proceedings of TASS.</source>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Hurtado</surname>
            ,
            <given-names>L. F. y F.</given-names>
          </string-name>
          <string-name>
            <surname>Pla</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Elirfupv en tass 2016: Analisis de sentimientos en twitter</article-title>
          .
          <source>En Proceedings of TASS 2016: Workshop on Sentiment Analysis at SEPLN co-located with 32nd SEPLN Conference (SEPLN</source>
          <year>2016</year>
          ), Salamanca, Spain.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Kingma</surname>
            ,
            <given-names>D. P. y J.</given-names>
          </string-name>
          <string-name>
            <surname>Ba</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Adam: A method for stochastic optimization</article-title>
          .
          <source>CoRR, abs/1412</source>
          .6980.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Mart</surname>
            nez-Camara, E.,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Almeida Cruz</surname>
          </string-name>
          , M. C.
          <article-title>D az-</article-title>
          <string-name>
            <surname>Galiano</surname>
            ,
            <given-names>S. Estevez</given-names>
          </string-name>
          <string-name>
            <surname>Velarde</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
            a-Cumbreras, M. Garc aVega,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Gutierrez Vazquez</surname>
            ,
            <given-names>A. Montejo</given-names>
          </string-name>
          <string-name>
            <surname>Raez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Montoyo</surname>
            <given-names>Guijarro</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mun</surname>
          </string-name>
          <article-title>~oz Guillena, A. Piad Mor s</article-title>
          , y J.
          <source>VillenaRoman</source>
          .
          <year>2018</year>
          .
          <article-title>Overview of TASS 2018: Opinions, health and emotions</article-title>
          . En E. Mart
          <string-name>
            <surname>nez-Camara Y. Almeida Cruz M. C. D az-Galiano S. Estevez Velarde M. A. Garc</surname>
          </string-name>
          a
          <string-name>
            <surname>-Cumbreras M. Garc aVega Y. Gutierrez Vazquez A. Montejo Raez A. Montoyo Guijarro R. Mun</surname>
          </string-name>
          ~oz
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Mart</surname>
            nez-Camara,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>M. C.</surname>
          </string-name>
          <article-title>D az-</article-title>
          <string-name>
            <surname>Galiano</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Garc</surname>
          </string-name>
          a-Cumbreras, M. Garc aVega, y J.
          <string-name>
            <surname>Villena-Roman</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Overview of tass 2017</article-title>
          .
          <string-name>
            <surname>En J. Villena Roman M. A. Garc a Cumbreras D. G. M. C. Mart</surname>
          </string-name>
          nez-Camara, Eugenio, y M.
          <article-title>Garc a Vega, editores</article-title>
          ,
          <source>Proceedings of TASS 2017: Workshop on Semantic Analysis at SEPLN (TASS</source>
          <year>2017</year>
          ), volumen 1896 de CEUR Workshop Proceedings, Murcia, Spain, September. CEURWS.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Chen</surname>
          </string-name>
          , G. Corrado,
          <string-name>
            <given-names>y J.</given-names>
            <surname>Dean</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>E cient estimation of word representations in vector space</article-title>
          .
          <source>arXiv preprint arXiv:1301</source>
          .
          <fpage>3781</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Moctezuma</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gra</surname>
          </string-name>
          , S. MirandaJimenez, E. Tellez,
          <string-name>
            <given-names>A.</given-names>
            <surname>Coronado</surname>
          </string-name>
          , C. Sanchez, y J.
          <string-name>
            <surname>Ortiz-Bejar</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>A genetic programming approach to sentiment analysis for twitter:</article-title>
          <source>Tass'17. En Proceedings of TASS.</source>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Mohammad</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Bravo-Marquez</surname>
          </string-name>
          , M. Salameh, y
          <string-name>
            <given-names>S.</given-names>
            <surname>Kiritchenko</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Semeval2018 task 1: A ect in tweets</article-title>
          .
          <source>En Proceedings of The 12th International Workshop on Semantic Evaluation, paginas</source>
          <volume>1</volume>
          {
          <fpage>17</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Montejo-Raez</surname>
          </string-name>
          , A.
          <string-name>
            <surname>y M. C. D</surname>
          </string-name>
          az-Galiano.
          <year>2016</year>
          .
          <article-title>Participacion de SINAI en TASS 2016</article-title>
          .
          <source>En Proceedings of TASS 2016: Workshop on Sentiment Analysis at SEPLN co-located with 32nd SEPLN Conference (SEPLN</source>
          <year>2016</year>
          ), Salamanca, Spain.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Padro</surname>
            , L. y
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Stanilovsky</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Freeling 3.0: Towards wider multilinguality</article-title>
          .
          <source>En Proceedings of the Language Resources and Evaluation Conference (LREC</source>
          <year>2012</year>
          ), Istanbul, Turkey, May. ELRA.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Pedregosa</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Varoquaux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gramfort</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Michel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Thirion</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Grisel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Blondel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Prettenhofer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Weiss</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Dubourg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Vanderplas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Passos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Cournapeau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Brucher</surname>
          </string-name>
          , M. Perrot,
          <string-name>
            <given-names>y E.</given-names>
            <surname>Duchesnay</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Scikit-learn: Machine learning in Python</article-title>
          .
          <source>Journal of Machine Learning Research</source>
          ,
          <volume>12</volume>
          :
          <fpage>2825</fpage>
          {
          <fpage>2830</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Quiros</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>I.</surname>
          </string-name>
          Segura-Bedmar, y P. Mart nez.
          <year>2016</year>
          .
          <article-title>LABDA at the 2016 TASS challenge task: Using word embeddings for the sentiment analysis task</article-title>
          .
          <source>En Proceedings of TASS 2016: Workshop on Sentiment Analysis at SEPLN co-located with 32nd SEPLN Conference (SEPLN</source>
          <year>2016</year>
          ), Salamanca, Spain.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Rosa</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chiruzzo</surname>
          </string-name>
          , M. Etcheverry, y
          <string-name>
            <given-names>S.</given-names>
            <surname>Castro</surname>
          </string-name>
          .
          <year>2017</year>
          . RETUYT en TASS 2017: Analisis de Sentimientos de Tweets en Espan~
          <article-title>ol utilizando SVM y CNN</article-title>
          .
          <source>En Proceedings of TASS.</source>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Saralegi</surname>
            ,
            <given-names>X. y I. San</given-names>
          </string-name>
          <string-name>
            <surname>Vicente</surname>
          </string-name>
          .
          <year>2013</year>
          . Elhuyar at tass
          <year>2013</year>
          . XXIX Congreso de la Sociedad Espaola de Procesamiento de lenguaje natural,
          <source>Workshop on Sentiment Analysis at SEPLN (TASS2013)</source>
          , paginas
          <volume>143</volume>
          {
          <fpage>150</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>White</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Togneri</surname>
          </string-name>
          , W. Liu, y
          <string-name>
            <given-names>M.</given-names>
            <surname>Bennamoun</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>How well sentence embeddings capture meaning</article-title>
          .
          <source>En Proceedings of the 20th Australasian Document Computing Symposium, pagina 9</source>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Wu</surname>
          </string-name>
          , T.-F.,
          <string-name>
            <surname>C.-J. Lin</surname>
            , y
            <given-names>R. C.</given-names>
          </string-name>
          <string-name>
            <surname>Weng</surname>
          </string-name>
          .
          <year>2004</year>
          .
          <article-title>Probability estimates for multi-class classi cation by pairwise coupling</article-title>
          .
          <source>Journal of Machine Learning Research</source>
          ,
          <volume>5</volume>
          (Aug):
          <volume>975</volume>
          {
          <fpage>1005</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>