<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>ELiRF-UPV at IberEval 2017: Stance and Gender Detection in Tweets</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jose-Angel Gonzalez</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ferran Pla</string-name>
          <email>fpla@dsic.upv.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Llu s-F. Hurtado</string-name>
          <email>lhurtado@dsic.upv.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Departament de Sistemes Informatics i Computacio Universitat Politecnica de Valencia</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <fpage>193</fpage>
      <lpage>198</lpage>
      <abstract>
        <p>This paper describes the participation of ELiRF-UPV team at the two Spanish subtasks of the Stance and Gender Detection in Tweets on Catalan Independence track of the IberEval workshop. We tested several approaches based on di erent models and tweet representations. Our best approaches are based on neural networks with one-hot vector representation and Support Vector Machines using bag-of-ngrams of chars. We achieved the rst place on the gender detection subtask and the fourth place on the stance detection subtask.</p>
      </abstract>
      <kwd-group>
        <kwd>Neural Networks</kwd>
        <kwd>Support Vector Machine</kwd>
        <kwd>bag-of-words</kwd>
        <kwd>one-hot vectors</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>The corpus is composed by tweets labeled with respect to the independence of
Catalonia (three classes: AGAINST, NEUTRAL, FAVOR) and with respect to
the gender of the author of each tweet (two classes: MALE and FEMALE).</p>
      <p>These tweets are provided in Spanish and Catalan, however, we have only
worked with the Spanish version of the proposed corpus. On the other hand, it is
necessary to take into account that the corpus is unbalanced in terms of stance
detection, being a clear bias between classes AGAINST and NEUTRAL with
respect to class FAVOR. This unbalance does not occur in the gender detection
subtask as can be seen in Table 1.
In this section we describe the main characteristics of the system developed to
the Stance and Gender Detection in Tweets on Catalan Independence track of
the IberEval workshop. This description includes the preprocessing used, the
di erent tweets representations used and, the di erent models that were taken
into account during the tuning phase.
The preprocessing process of the tweets was a bit di erent depending on the
subtasks. In both cases, we removed the accents and converted all the text to
lowercase. The web links (URL), and the numbers were substituted by a speci c
label.</p>
      <p>We assumed that the hashtags, the emoticons and the mentions to other users
would be informative to determine the opinions of a user but not his/her gender.
Accordingly to this assumption, we substituted the hashtags, the emoticons and
user's mentions by a speci c label for the Gender subtask, but we kept their
values for the Stance subtask.</p>
      <p>
        Since stance detection may be related, in some way, to sentiment analysis, we
tested the use of polarity lexicons for the Stance subtask. Speci cally, we tried
to include NRC lexicon [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] as extra features for stance detection.
3.3
      </p>
      <sec id="sec-1-1">
        <title>Models</title>
        <p>
          We explored di erent models depending on the representation of the tweets.
This way, Long short-term memory (LSTM) neural networks [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] assembled with
Convolutional neural networks (CNN) were used to deal with the sequential
representations (embeddings) and the one-hot vectors of chars representation,
while for the bag-of-ngrams (both at the word level and at the char level)
representation Support Vector Machines (SVM) with linear kernel and Multilayer
Perceptrons (MLP) were used.
        </p>
        <p>
          The NRC polarity lexicon was used only with the embeddings representation.
The topology used was similar to the one described in [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] but without the subnet
dedicated to processing the sequences formed with embeddings obtained from
the training corpus.
3.4
        </p>
      </sec>
      <sec id="sec-1-2">
        <title>Tuning</title>
        <p>In order to select the representations and the models (including their parameters)
more appropriated to each subtask, a tuning process was performed. The corpus
provided by the organizers of the task was split into two sets, a set with the
80% of the tweet for learning the model and the remaining 20% of the corpus
was used as tuning. The partitions were the same for all the tuning process.
For tuning the models of each subtask, the o cial evaluation measure of each
subtask was taken into account as optimization criterion.</p>
        <p>Faced with the impossibility of testing all combinations of models and
representations, only those combinations we thought that made more sense were
considered. Table 2 shows the most relevant combinations of features and
models as well as the results obtained during the tuning phase.</p>
        <p>Regarding the stance detection subtask, as can be seen in the rst row of
Table 2, with sequential representation at word level (Wikipedia embeddings)
the result obtained was 51.84 for F1 measure. Unfortunately, worse results were
obtained when the polarity sequence according to the NRC lexicon was added
to the embeddings representation (Emb+NRC).</p>
        <p>We also tried the use of a di erent sequential representation formed by
onehot vectors at character level and processed with the same network used in the
previous experiments. This representation together with CNN + LSTM obtained
the best results on tuning, 55.10 of F1 measure as can be seen in the third row
of Table 2.</p>
        <p>Regarding the gender detection subtask, although the sequential
representation at word level (embeddings from Wikipedia) processed using CNN + LSTM
obtained good results, 64.47% in Accuracy, the best results in the tuning phase
were achieved by the representations based on bag-of-ngrams of chars. The
representation based on bag-of-ngrams of words achieved signi cantly worst results.</p>
        <p>The models that obtained better results were the Support Vector Machines
with linear kernel. Speci cally, the SVM model using bag-of-unigram of chars
as representation of the tweets achieved 66.92% of Accuracy; while adding
bagof-bigrams of chars to the previous model slightly increases the Accuracy to
66.99%. These results correspond to the last two rows in Table 2.
4</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Results</title>
      <p>In view of the results obtained during the tuning phase and due the limitation
of the track, we decided to send the following two runs to the competition.
{ run1
stance detection: CNN + LSTM + char-one-hot
gender detection: SVM + bag-of-2grams of chars
{ run2
stance detection: CNN + LSTM + char-one-hot (the same as in run1)
gender detection: SVM + bag-of-1grams of chars + bag-of-2grams of
chars (the best accuracy at tunning)</p>
      <p>Once we have analyzed the results, both in the tuning phase and in the o cial
competition, we want to point out some interesting things.</p>
      <p>In both subtasks, methods based on deep-learning have shown to o er
competitive results. However, in the case of gender detection subtask, the best results
have been obtained with a priori simple model, SVM and bag-of-chars. We
hypothesize that the good results achieved by SVM models in this subtask is due
to the greater robustness of these models (compared with deep-learning based
models) to deal with the bias problem. If the imbalance is very large, it can cause
that the network assigns all the samples only to the majority classes. The
solution used for the stance detection subclass was to perform a scaling of the loss
function during the training phase. This has prevented the network from
classifying all tweets in the AGAINST and NEUTRAL classes (the majority classes
with much di erence in the Spanish version of the stance detection corpus).</p>
      <p>
        Regarding the stance detection subtask, a sequential character-level
representation has been chosen due to the increasing interest this kind of representations
are having in the deep-learning area and the good results they are achieving
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. In this way, we have been able to verify that, e ectively, this type of
representations (in conjunction with neural networks that handle sequences) provide
competitive results in text classi cation tasks such as the stance detection
subtask.
5
      </p>
    </sec>
    <sec id="sec-3">
      <title>Conclusions and Future work</title>
      <p>We have presented the participation of the ELiRF-UPV team at the Stance
and Gender Detection in Tweets on Catalan Independence track of the IberEval
workshop. Our team has participated in the two Spanish subtasks of the track
and has achieved competitive results. Our best approaches were based on
neural networks with sequential representation of the tweets and Support Vector
Machines with bag-of-ngrams of chars.</p>
      <p>As future work, we plan to use representations based on one-hot vectors
at character level and CNN + LSTM on other tweet classi cation problems
(TASS, SemEval, ...) in order to study their behavior in tasks other than stance
detection.</p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgements</title>
      <p>This work has been partially supported by the Spanish MINECO and FEDER
founds under project ASLP-MULAN: Audio, Speech and Language Processing
for Multimedia Analytics, TIN2014-54288-C4-3-R.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Gonzalez</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pla</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hurtado</surname>
            ,
            <given-names>L.F.</given-names>
          </string-name>
          :
          <article-title>ELiRF-UPV at SemEval-2017 Task 4: Sentiment Analysis using Deep Learning</article-title>
          .
          <source>In: Proceedings of the 11th International Workshop on Semantic Evaluation</source>
          . pp.
          <volume>722</volume>
          {
          <fpage>726</fpage>
          . SemEval '17,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computational Linguistics, Vancouver, Canada (
          <year>August 2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Hochreiter</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schmidhuber</surname>
            ,
            <given-names>J.: Long</given-names>
          </string-name>
          <string-name>
            <surname>Short-Term Memory</surname>
          </string-name>
          .
          <source>Neural Computation</source>
          <volume>9</volume>
          (
          <issue>8</issue>
          ),
          <volume>1735</volume>
          {
          <fpage>1780</fpage>
          (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>E cient estimation of word representations in vector space</article-title>
          .
          <source>CoRR abs/1301</source>
          .3781 (
          <year>2013</year>
          ), http://arxiv.org/abs/ 1301.3781
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sutskever</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Distributed representations of words and phrases and their compositionality</article-title>
          .
          <source>CoRR abs/1310</source>
          .4546 (
          <year>2013</year>
          ), http://arxiv.org/abs/1310.4546
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Mohammad</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kiritchenko</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sobhani</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cherry</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Semeval-2016 task 6: Detecting stance in tweets</article-title>
          .
          <source>In: Proceedings of the International Workshop on Semantic Evaluation. SemEval '16</source>
          , San Diego, California (
          <year>June 2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Mohammad</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Turney</surname>
          </string-name>
          , P.D.:
          <article-title>Crowdsourcing a Word-</article-title>
          <source>Emotion Association Lexicon</source>
          <volume>29</volume>
          (
          <issue>3</issue>
          ),
          <volume>436</volume>
          {
          <fpage>465</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Rehurek</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sojka</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Software Framework for Topic Modelling with Large Corpora</article-title>
          .
          <source>In: Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks</source>
          . pp.
          <volume>45</volume>
          {
          <fpage>50</fpage>
          . ELRA, Valletta, Malta (May
          <year>2010</year>
          ), http://is.muni.cz/ publication/884893/en
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rangel</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Potthast</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stamatatos</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tschuggnall</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stein</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <source>Overview of PAN'16</source>
          , pp.
          <volume>332</volume>
          {
          <fpage>350</fpage>
          . Springer International Publishing (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Taule</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mart</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rangel</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bosco</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Patti</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Overview of the task of Stance and Gender Detection in Tweets on Catalan Independence at IBEREVAL 2017</article-title>
          .
          <source>In: Proceedings of the Second Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval</source>
          <year>2017</year>
          ).
          <source>CEUR Workshop Proceedings. CEUR-WS.org</source>
          ,
          <year>2017</year>
          , Murcia (Spain) (
          <year>September 2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10. Wikipedia:
          <article-title>Wikipedia spanish dumps (</article-title>
          <year>2017</year>
          ), https://dumps.wikimedia.org/ eswiki/, [Online; accessed 18-May-2017]
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , LeCun, Y.:
          <article-title>Character-level Convolutional Networks for Text Classi cation</article-title>
          .
          <source>In: Proceedings of the 28th International Conference on Neural Information Processing Systems</source>
          . pp.
          <volume>649</volume>
          {
          <fpage>657</fpage>
          . NIPS'15, MIT Press, Cambridge, MA, USA (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>