<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>GRU with Author Profiling Information to Detect Aggressiveness</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>María Guadalupe Garrido-Espinosa</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alejandro Rosales-Pérez</string-name>
          <email>alejandro.rosales@cimat.mx</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Adrián Pastor López-Monroy</string-name>
          <email>pastor.lopez@cimat.mx</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Mathematics Reseach Center (CIMAT) Monterrey, Alianza Centro 502</institution>
          ,
          <addr-line>66629, Nuevo León</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Mathematics Research Center (CIMAT)</institution>
          ,
          <addr-line>Jalisco s/n Valenciana, 36023, Guanajuato</addr-line>
        </aff>
      </contrib-group>
      <fpage>246</fpage>
      <lpage>251</lpage>
      <abstract>
        <p>This paper describes our participation for the Aggressiveness Identification Track in the third edition of MEX-A3T. The task focuses on the detection of aggressive tweets in Mexican Spanish. Our approach consists in the use of a Bidirectional Gated Recurrent Unit merged with author profiling derived features. The challenge results indicate that our proposal exceeds a Support Vector Machine baseline.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Aggressiveness Detection</kwd>
        <kwd>Bidirectional GRU</kwd>
        <kwd>Author profiling</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>The social media enables users to be in contact with others they care about. It also ofers a way
to discuss, and disseminate information as well as share opinions with the particularity that
the people can decide to show or hide their identity; this makes easier for the users to express
themselves freely, but also removes the face to face incentives to avoid being ofensive.</p>
      <p>Given the huge amount of shared data, it is dificult to manually catch all aggressive messages.
So, there is a need to construct mechanisms that help to detect them automatically to avoid
harassment on social media and prevent physical assaults derived from aggressive comments.</p>
      <p>
        The Aggressiveness Identification Track in MEX-A3T [ 1] encouraged the development of
methods to determine whether a tweet written in Mexican Spanish is aggressive or not. Based
on the results obtained by [
        <xref ref-type="bibr" rid="ref1">2</xref>
        ] to tackle the aggressiveness identification problem, we evaluated
the usage of author profiling derived characteristics along with a Gated Recurrent Unit (GRU)
network. The challenge results showed that our proposal exceeds a Support Vector Machine
(SVM) baseline.
      </p>
      <p>This article is organized as follows, Section 2 details the proposed method and the way that
author profiling characteristics were predicted. Section 3 describes the corpus and the results
obtained with the training set. Subsequently, in Section 4 the results of the competition are
presented and finally, the conclusions and future work are presented in Section 5.</p>
    </sec>
    <sec id="sec-2">
      <title>2. System</title>
      <p>We preserved all the content words in the tweets. To tokenize, all punctuation marks were
removed, converting the text into space separated sequences of words. These sequences were
split into a list of tokens to form a vocabulary. Each word in the vocabulary is represented as a
vector with a pretrained word embeddings. We used FastText embeddings from Spanish Billion
Word Corpus [3] of size 300.</p>
      <p>A bi-directional GRU model using words as inputs is proposed, this model is combined
with the predictions on gender and occupation of users (using a reference model and using a
one-hot-encoding). Then a ReLU activation is applied, followed by a dropout, and a dense layer
for making predictions; Fig. 1 shows the architecture diagram. At the end, the model considered
only the gender and Sciences-Student occupation categories (the remaining categories were
discarded by a  2 criterion).</p>
      <sec id="sec-2-1">
        <title>2.1. Bidirectional Gated Recurrent Unit</title>
        <p>The bidirectional recurrent neural networks perform better on certain tasks where the order is
meaningful and are frequently used on natural language processing [4].</p>
        <p>The Bidirectional GRU is formed by two regular GRU, each of which processes the input
sequence in one direction, left-to-right and right-to-left, and then it merges their representations.
By proceeding in this way, the Bidirectional GRU can capture patterns that might be pass over
by a unidirectional GRU.</p>
        <p>A regular GRU calculates each hidden state ℎ as follows:
  =  (    +   ℎ −1)
  =  (    +   ℎ −1)
ℎ̃  = ℎ</p>
        <p>(   +  (  ⊙ ℎ −1)
ℎ = (1 −   ) ∗ ℎ −1 +   ∗ ℎ̃ 
(Update Gate)
(Reset Gate)
(Candidate)
(Output)
where   ,   ,   ,   ,  and  are the parameters to be learned in the training phase. The
function  is the logistic sigmoid function and ⊙ is the element-wise multiplication [5].</p>
        <p>The method proposed in this work uses a Bidirectional GRU network with ℎ̂  = ℎ⃖⃖⃗ + ⃖ℎ⃖ ⃖ as the
way of merging the two GRUs.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Author Profiling features</title>
        <p>In order to introduce more information to the model, we used the Mexican corpus for author
profiling from MEX-A3T 2019 [ 6] to predict three labels: gender, place of residence, and
occupation, where we considered a diferent model for each label. The occupation label has eight
classes: arts, student, social, sciences, sports, administrative, health, and others, while the place
of residence has six classes: north, northwest, northeast, center, west, and southeast.</p>
        <p>We adopted the n-gram ensemble approach proposed by [7] for each one of the attributes to
forecast with a little variation in the size of n-grams. The n-gram ensemble approach involves
four steps: the first extracts groups of n-grams of size one to three at word level and size three
to five at character level. In the second step, for each group, the best n-grams are selected using
 2 criterion. This process led to choose the best five thousand, two thousand, and thousand
n-grams at word level, and the best two thousand, three thousand, and five thousand at character
level. All of them are concatenated in the third step and used to classify with a SVM in the
fourth step.</p>
        <p>Once the prediction is done, the one-hot-encoding is applied to each label, and the resulting
features are further filtered with the  2 criterion to select the best three features. This process
leaved three author profiling features: gender, student, and sciences occupation.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Experimental Settings and Preliminary Evaluation</title>
      <p>In this section we describe the corpus provided by the organizers, the partitions used to make
experiments, the architecture used, and the preliminary results obtained. Table 1 shows the
tweets distribution in the training and test set</p>
      <p>To perform experiments, we made a partition with the 7,332 samples set: 70% was taken
to train, 10% to validate, and 20% to test. Fig. 1 shows the architecture of our Bidirectional
GRU model. The embedding layer outputs an embedding vector of size 86 × 300 and feeds a
Bidirectional GRU layer with 128 hidden units. Next, a global max pooling layer and a global
average pooling layer flatten the Bidirectional GRU output by taking the average and max value,
both of them are concatenated into a vector of size 1 × 256.</p>
      <p>In other channel, the author profiling features feed a dense layer with identity activation and
with 16 hidden units. The outcome of this layer is concatenated with the pooling outcome and
form a vector of size 1 × 272. It is then passed to another layer with ReLU activation and 64 units.
Before the final prediction, a dropout layer with a rate of 0.10 is used to regularize the network.</p>
      <p>Table 2 shows the results obtained by the method described in Section 2 at the validation
stage. The F1 obtained with the fusion of gender, sciences, and bi-GRU features is slightly better
than the model that incorporates student variable but is nearly a point better than the method
without author profiling features.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Competition Results</title>
      <p>In this section we will present our results in the competition. Table 3 lists the final rankings for
the challenge in the aggressiveness detection task. DeepMath-1 corresponds to the experiment
with gender and sciences while DeepMath-2 also includes the student trait. They ranked ninth
and tenth correspondingly.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions and future work</title>
      <p>In this paper, we reported our participation in the MEX-A3T 2020 project to classify aggressive
and non-aggressive tweets written in Mexican Spanish. We proposed a Bidirectional GRU
at word level with author profiling information. The results showed that the use of extra
information as gender and sciences occupation allows us to get a better performance than
those obtained without author profiling features. The competition results also showed that the
proposed method was able to outperform the BoW-SVM baseline provided by the organizers as
well as several proposed methods by other competitors.</p>
      <p>Future work includes conducting experiments with Bidirectional GRU at the character level
to capture dependencies in text missed by the one at the word level.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments References</title>
      <p>First author would like to thank CONACyT for financial support through scholarship number
718246.</p>
      <p>[1] M. E. Aragón, H. Jarquín, M. Montes-y Gómez, H. J. Escalante, L. Villaseñor-Pineda,
H. Gómez-Adorno, G. Bel-Enguix, J.-P. Posadas-Durán, Overview of MEX-A3T at IberLEF
2020: Fake News and Aggressiveness Analysis in Mexican Spanish, in: Notebook Papers of</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <source>2nd SEPLN Workshop on Iberian Languages Evaluation Forum (IberLEF)</source>
          , Malaga, Spain,
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>September</surname>
          </string-name>
          ,
          <year>2020</year>
          . [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Casavantes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>López</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. C.</given-names>
            <surname>González</surname>
          </string-name>
          , UACh at MEX-A3T
          <year>2019</year>
          : Preliminary Results
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <source>(IberLEF</source>
          <year>2019</year>
          ),
          <source>CEUR WS Proceedings</source>
          ,
          <year>2019</year>
          . [3]
          <string-name>
            <given-names>C.</given-names>
            <surname>Cardellino</surname>
          </string-name>
          ,
          <source>Spanish Billion Words Corpus and Embeddings</source>
          ,
          <year>2019</year>
          . URL: https://
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          crscardellino.github.io/SBWCE/. [4]
          <string-name>
            <given-names>F.</given-names>
            <surname>Chollet</surname>
          </string-name>
          ,
          <article-title>Deep Learning with Python</article-title>
          ,
          <source>Manning Publications</source>
          ,
          <year>2018</year>
          . [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Chung</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Gulcehre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Cho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Bengio</surname>
          </string-name>
          , Empirical Evaluation of Gated Recurrent Neural
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <source>Networks on Sequence Modeling, arXiv preprint arXiv:1412.3555</source>
          (
          <year>2014</year>
          ). [6]
          <string-name>
            <given-names>M. E.</given-names>
            <surname>Aragón</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <string-name>
            <surname>Álvarez-Carmona</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Montes-y Gómez</surname>
            ,
            <given-names>H. J.</given-names>
          </string-name>
          <string-name>
            <surname>Escalante</surname>
          </string-name>
          , L. Villasenor-
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Pineda</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Moctezuma</surname>
          </string-name>
          ,
          <article-title>Overview of MEX-A3T at IberLEF 2019: Authorship</article-title>
          and Aggres-
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <article-title>siveness Analysis in Mexican Spanish Tweets</article-title>
          ,
          <source>in: Notebook Papers of 1st SEPLN Workshop</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <source>on Iberian Languages Evaluation Forum (IberLEF)</source>
          , Bilbao, Spain,
          <year>2019</year>
          . [7]
          <string-name>
            <given-names>M. E.</given-names>
            <surname>Aragón</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. P.</given-names>
            <surname>López-Monroy</surname>
          </string-name>
          ,
          <article-title>Author Profiling</article-title>
          and Aggressiveness Detection in
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Spanish tweets:</surname>
          </string-name>
          MEX-A3T
          <year>2018</year>
          ., in: IberEval@ SEPLN,
          <year>2018</year>
          , pp.
          <fpage>134</fpage>
          -
          <lpage>139</lpage>
          . [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Graf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Miranda-Jiménez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. S.</given-names>
            <surname>Tellez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Moctezuma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Salgado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ortiz-Bejar</surname>
          </string-name>
          ,
          <string-name>
            <surname>C. N.</surname>
          </string-name>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Sánchez</surname>
          </string-name>
          , INGEOTEC at MEX-A3T:
          <article-title>Author Profiling and Aggressiveness Analysis in</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Twitter</given-names>
            <surname>Using</surname>
          </string-name>
          <article-title>tc and EvoMSA</article-title>
          , in: IberEval@ SEPLN,
          <year>2018</year>
          , pp.
          <fpage>128</fpage>
          -
          <lpage>133</lpage>
          . [9]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <article-title>Convolutional Neural Networks for Sentence Classification</article-title>
          , arXiv preprint
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <source>arXiv:1408.5882</source>
          (
          <year>2014</year>
          ). [10]
          <string-name>
            <given-names>N.</given-names>
            <surname>Albadi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kurdi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mishra</surname>
          </string-name>
          ,
          <source>Investigating the Efect of Combining GRU Neural Networks</source>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <source>Network Analysis and Mining</source>
          <volume>9</volume>
          (
          <year>2019</year>
          )
          <fpage>41</fpage>
          . [11]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Robinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tepper</surname>
          </string-name>
          ,
          <source>Detecting Hate Speech on Twitter Using a Convolution</source>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <source>GRU Based Deep Neural Network, in: European semantic web conference</source>
          , Springer,
          <year>2018</year>
          ,
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          pp.
          <fpage>745</fpage>
          -
          <lpage>760</lpage>
          . [12]
          <string-name>
            <surname>C. E. M. Cuza</surname>
          </string-name>
          , G. L.
          <string-name>
            <surname>De la Peña Sarracén</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Rosso</surname>
          </string-name>
          , Attention Mechanism for Aggressive
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Detection</surname>
          </string-name>
          , in: CEUR Workshop Proc., volume
          <volume>2150</volume>
          ,
          <year>2018</year>
          , pp.
          <fpage>114</fpage>
          -
          <lpage>118</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>