<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>ELiRF-UPV at MultiStanceCat 2018</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jose-Angel Gonzalez[</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Llu s-Felip Hurt</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Departament de Sistemes Informatics i Computacio Universitat Politecnica de Valencia</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <fpage>173</fpage>
      <lpage>179</lpage>
      <abstract>
        <p>This paper describes the participation of ELiRF-UPV team at the Spanish subtasks of the MultiModal Stance Detection in tweets on Catalan #1Oct Referendum workshop. Our best approach is based on Convolutional Neural Networks using word embeddings and polarity/emotion lexicons. We obtained competitive results on the Spanish subtask using only the text of the tweet, dispensing with contexts and images.</p>
      </abstract>
      <kwd-group>
        <kwd>Deep Learning Networks</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introducction</title>
      <p>The corpus is composed by tweets labeled with respect to the stance of the
Catalan rst October Referendum (2017). There are three classes: AGAINST
(AG), NEUTRAL (NE) and FAVOR (FA). These tweets are provided in Spanish
and Catalan, however, we worked only with the Spanish subtask. Moreover,
although context of the tweet and images are also provided by the organizers,
we only used the text of the tweet.</p>
      <sec id="sec-1-1">
        <title>J-A. Gonzalez et al.</title>
        <p>
          From the o cial training corpus, we randomly selected 80% in order to train
our models. The remaining 20% was used as development set. Table 1 shows the
sample distribution per class in the Spanish corpus.
In this Section, we describe the two models used in the competition. Both
models share the same preprocessing of the tweets by means of the TweetMotif [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]
package. We applied a normalization step consisted on lowercasing the words,
removing some language-speci c characters such as accent, dieresis, special
language characters, and normalizing Twitter-speci c tokens (hashtags, user
mentions and urls) by replacing them for a xed word e.g. #1octL6 ! #hashtag.
        </p>
        <p>As rst model for the experimentation, we used a Support Vector Machine
(SVM) classi er with di erent representations of the tweets. Concretely, we used
bag-of-word-ngrams and bag-of-char-ngrams with several values of n (including
combination of ngrams e.g. bag-of-word-1-4grams means the concatenation of
n = [1; 4]grams).</p>
        <p>
          As second model for the experimentation, we used a Convolutional Neural
Network (CNN) architecture inspired by the work presented in [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ], with the
aim of obtaining representations of the tweets similar to continuous versions
of the bag-of-ngrams. We represented the tweets using Word2Vec distributed
representations of words [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. Moreover, to enrich the system, we used several
polarity/emotion lexicons combined with the word embeddings.
        </p>
        <p>
          We used ELHPolar [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], ISOL [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], MLSenticon [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] and the Spanish version
of NRC [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] as lexicons. As word embeddings, we trained a skip-gram model,
with 300 dimensions for each word, from 87 million Spanish tweets collected for
previous experimental work.
        </p>
        <p>We represent each tweet x as a matrix S 2 Rn (d+v), where n is the maximum
number of words per tweet, d is the dimensionality of word embeddings and v is
the dimensionality of the polarity/emotion features, that is, the number of
polarity/emotion lexicons. In order to obtain this representation, we use an embedding
model h(w) 2 Rd and a set of lexicons h0 (w) = [h01(w); h02(w); :::; h0l(w)] 2 Rv,
where h0k(w) is the polarity value of the word w in the lexicon k.</p>
        <p>Therefore, given a tweet x with n tokens, x = w1; w2; :::; wn, we represent
it as a matrix S in which, each row i is the concatenation of the embedding of
wi (h(wi)) and a vector with the polarity values of wi in each lexicon (h0 (wi)),</p>
      </sec>
      <sec id="sec-1-2">
        <title>ELiRF-UPV at MultiStanceCat 2018 3</title>
        <p>S = [h(w1)jh0 (w1); h(w2)jh0 (w2); :::; h(wn)jh0 (wn)]. In the case where a word wi
is out of vocabulary for the embedding models, we replace its embedding by the
embedding of the word \unknown", h(wi) = h(\unknown"). Similarly, if wi is
not included in any lexicon, h0 (wi) = [0; 0; :::; 0] 2 Rv.</p>
        <p>Due to the variable length of the tweets, we used zero padding at the start
of a tweet if it does not reach the maximum speci ed length. Otherwise, if the
length of a tweet is greater than the maximum, we only consider the rst n words
of the tweet. In this task, the average number of words per tweet is navg = 18:5,
and the maximum length is nmax = 34. We decided to set the length n = 26
which is the mean of navg and nmax.</p>
        <p>Regarding the CNN architecture, we applied one-dimensional convolutions
with variable height lters in order to extract the temporal structure of the
tweet over several region sizes. Figure 1 summarizes the model architecture and
its hyperparameters.</p>
        <p>Batch Normalization + ReLU
Convolution
1D Global Max Pooling</p>
        <p>
          Concat + Batch Normalization Softmax fully-connected layer
4 region sizes ([
          <xref ref-type="bibr" rid="ref1 ref4">1, 4</xref>
          ])
256 filters for each region
size 1024 different filters
256 feature maps for
each region size
ℝ4× ×256
256 salient features
for each region size
ℝ4×256
        </p>
        <p>Concatenated salient
features
ℝ1024
do
you
think
humans
have
sense</p>
        <p>Batch Normalization
Sentence matrix
ℝ ×( + )
…
…
…
…
…
…
…
………
………
………
………
………
………
…
……
…
…</p>
        <p>⋮
⋮
…
⋮
3 classes</p>
        <p>As can be seen in Figure 1, we used 4 di erent region sizes (the lter height range
from 1 to 4) and 256 lters for each region size. We used this range of region
sizes because, in the development phase, the best baseline was SVM using
bagof-word-1-4grams. After the lters were applied, we obtained 256 output feature
maps for each region size.</p>
        <p>In order to extract the most salient features for each region size, we applied
1D Global Max Pooling to the feature maps of each region size. Therefore, we
obtained 4 vectors with 256 components, that were concatenated and used as
4
input to a fully-connected layer which performs the classi cation task. We used
a softmax activation function to model the posterior distribution of each class
at the output layer.
4</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Experimental results</title>
      <p>In this section, we describe the experimental work conducted by ELiRF team in
the MultiStanceCat task. In addition, we present a study of the performance of
our best system in the competition.</p>
      <p>Table 2 summarizes the results obtained in the development phase. Three
di erent classi ers were considered: Linear SVM with bag-of-ngrams of words,
Linear SVM with bag-of-ngrams of chars and CNNs.</p>
      <p>For the SVM approaches we tested di erent values of n. With respect to
the CNN we explored three loss functions: Cross Entropy (CCE), Mean Squared
Error (MSE) and di erentiable approximation of the F1 measure (SMF1).
It can be observed in Table 2 that generally bag-of-chars performs worse than
bag-of-words. Note that CNN models outperform the results achieved by the
SVM classi ers. Moreover, CNN classi er with SMF1 loss function outperforms
the results of all the other classi ers. However, a deeper study about which
factors such as embeddings or lexicons are more relevant in the results would be
interesting.</p>
      <sec id="sec-2-1">
        <title>ELiRF-UPV at MultiStanceCat 2018 5</title>
        <p>It can be also observed that the value of the F1 measure for the NEUTRAL
class (F1(N E)) is generally lower than the F1 measures for AGAINST and
FAVOR classes (F1(AG); F1(F A)). We hypothesize this is due to the fact that
NEUTRAL class has less samples in the corpus. However, low values of F1(N E)
measure do not a ect the o cial evaluation measure that is de ned as the
average between F1(AG) and F1(F A).</p>
        <p>For the Spanish subtask competition, we selected the best CNN and SVM
models according to the results obtained in the development phase. Concretely,
our rst run (ELiRF-1) was the CNN model trained using SMF1 loss function. As
a second run (ELiRF-2) we selected a Linear SVM with bag-of-word-1-4grams.</p>
        <p>Table 3 shows the confusion matrices of the two submitted systems. It can be
observed that both systems confuse the NEUTRAL and the AGAINST classes
in a similar way. The best performance achieved by ELiRF-1 run is because it
predicts better the FAVOR class.
We have also performed a study of the samples that ELiRF-1 system misclassi ed
with high con dence. Some of these samples are shown in Table 4. We think
that in some cases, errors could be avoided by considering hashtags (sample 5,
#4gatos) or user mentions (error 2, @CatalunyaPlural). Unfortunately, we have
not included this information in our models.
6</p>
        <p>Table 5 shows the results on the test set for all the participating teams in the
Spanish task. The ELiRF-1 run obtained competitive results without using the
text of previous and next tweets or the images in the user timeline. Moreover, we
can observe that the context seems to be useful for this task because all the best
participating teams used this information. Finally, we would like to highlight
the great di erence observed in the results obtained on the development and the
test sets. We have no explanation for this, but we think that a study about this
aspect should be done when the test set will be available.
In this paper, we have presented the participation of the ELiRF team at
MultiStanceCat track of the IberEval workshop. Our team participated in the Spanish
subtask of this track and competitive results were achieved using only the text of
the tweets. Our best approach is based on CNN with sequential representation
of the tweets using word embedding, and polarity/emotion lexicons.</p>
        <p>
          As future work, we plan to include the context of the tweet in our deep
learning system in a similar way as Hierarchical Attention Networks [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] do. Moreover,
we think that data augmentation could help to improve the performance of the
models.
        </p>
        <p>We have observed that hashtags and user mentions contains relevant
information for this task. For this reason, as future work, we want to explore the
inclusion of this information in the tweet representation.
6</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Acknowledgements</title>
      <p>This work has been partially supported by the Spanish MINECO and FEDER
founds under project AMIC (TIN2017-85854-C4-2-R). Work of Jose-Angel Gonzalez
is also nanced by Universitat Politecnica de Valencia under grant PAID-01-17.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Cruz</surname>
            ,
            <given-names>F.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Troyano</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pontes</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ortega</surname>
            ,
            <given-names>F.J.:</given-names>
          </string-name>
          <article-title>Building layered, multilingual sentiment lexicons at synset and lemma levels</article-title>
          .
          <source>Expert Systems with Applications</source>
          <volume>41</volume>
          (
          <issue>13</issue>
          ),
          <volume>5984</volume>
          {
          <fpage>5994</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Krieger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ahn</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Tweetmotif: exploratory search and topic summarization for twitter</article-title>
          .
          <source>In: In Proc. of AAAI Conference on Weblogs and Social</source>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>E cient estimation of word representations in vector space</article-title>
          .
          <source>CoRR abs/1301</source>
          .3781 (
          <year>2013</year>
          ), http://arxiv.org/abs/1301.3781
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sutskever</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Distributed representations of words and phrases and their compositionality</article-title>
          .
          <source>CoRR abs/1310</source>
          .4546 (
          <year>2013</year>
          ), http://arxiv.org/abs/1310.4546
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Mohammad</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kiritchenko</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sobhani</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cherry</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Semeval-2016 task 6: Detecting stance in tweets</article-title>
          .
          <source>In: Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)</source>
          . pp.
          <volume>31</volume>
          {
          <issue>41</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Mohammad</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Turney</surname>
          </string-name>
          , P.D.:
          <article-title>Crowdsourcing a word-emotion association lexicon</article-title>
          .
          <source>Computational Intelligence</source>
          <volume>29</volume>
          (
          <issue>3</issue>
          ),
          <volume>436</volume>
          {
          <fpage>465</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Molina-Gonzalez</surname>
            ,
            <given-names>M.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mart</surname>
            nez-Camara,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mart</surname>
            n-Valdivia,
            <given-names>M.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>PereaOrtega</surname>
            ,
            <given-names>J.M.:</given-names>
          </string-name>
          <article-title>Semantic orientation for polarity classi cation in spanish reviews</article-title>
          .
          <source>Expert Systems with Applications</source>
          <volume>40</volume>
          (
          <issue>18</issue>
          ),
          <volume>7250</volume>
          {
          <fpage>7257</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Saralegi</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          , San Vicente, I.:
          <article-title>Elhuyar at tass 2013</article-title>
          . In: XXIX Congreso de la Sociedad Espaola de Procesamiento de lenguaje natural,
          <source>Workshop on Sentiment Analysis at SEPLN (TASS2013)</source>
          . pp.
          <volume>143</volume>
          {
          <issue>150</issue>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Taule</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mart</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rangel</surname>
            ,
            <given-names>F.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bosco</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Patti</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          , et al.:
          <article-title>Overview of the task on stance and gender detection in tweets on catalan independence at ibereval 2017</article-title>
          . In: 2nd Workshop on
          <article-title>Evaluation of Human Language Technologies for Iberian Languages</article-title>
          ,
          <string-name>
            <surname>IberEval</surname>
          </string-name>
          <year>2017</year>
          . vol.
          <year>1881</year>
          , pp.
          <volume>157</volume>
          {
          <fpage>177</fpage>
          .
          <string-name>
            <surname>CEURWS</surname>
          </string-name>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Taule</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rangel</surname>
            ,
            <given-names>F.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mart</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Overview of the task on multimodal stance detection in tweets on catalan #1oct referendum'</article-title>
          .
          <source>In: Third Workshop on Evaluation of Human Language Technologies for Iberian Languages, IberEval</source>
          <year>2018</year>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dyer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smola</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hovy</surname>
          </string-name>
          , E.:
          <article-title>Hierarchical attention networks for document classi cation</article-title>
          .
          <source>In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</source>
          . pp.
          <volume>1480</volume>
          {
          <issue>1489</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wallace</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>A sensitivity analysis of (and practitioners' guide to) convolutional neural networks for sentence classi cation</article-title>
          .
          <source>In: Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers)</source>
          . pp.
          <volume>253</volume>
          {
          <fpage>263</fpage>
          .
          <source>Asian Federation of Natural Language Processing</source>
          (
          <year>2017</year>
          ), http://aclweb.org/anthology/I17-1026
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>