<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Using Lexical Resources for Detecting O ensiveness in Mexican Spanish Tweets</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Daniel Abraham Huerta-Velasco</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hiram Calvo</string-name>
          <email>hcalvog@cic.ipn.mx</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Centro de Investigacion en Computacion, Instituto Politecnico Nacional</institution>
          ,
          <addr-line>Ciudad de Mexico</addr-line>
          ,
          <country country="MX">Mexico</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This work presents a description of our participation in subtasks 3 and 4 at MeO endEs@IberLEF 2021 which consisted in classifying tweets as o ensive or non-o ensive in the O endMEX corpus. For both subtasks, we proposed to use several Spanish lexicons which have a collection of words that have been weighted according to di erent criteria like a ective, dimensional, and emotional values. In addition to them, structural values, word-embeddings and one-hot codi cation were taken into account. The scores of recall metric obtained in both subtasks was competitive comparing to both the baseline of the competition's and the other teams'.</p>
      </abstract>
      <kwd-group>
        <kwd>Lexical Resources</kwd>
        <kwd>Sentiment Analysis</kwd>
        <kwd>Mexican Spanish Tweets</kwd>
        <kwd>Text Classi cation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Social media have had a great impact in the history of humanity. Nowadays
it is very easy to share information, thoughts, images, videos, etc, only with
a click. Despite there are positive aspects associated with social media usage,
there are negative ones that many social media users have to face daily. One of
the most dangerous for most people is that many users take advantage of the
anonymity that social media gives them and insult, harass, provoke and threat
to an individual or a group of people.</p>
      <p>O ensiveness has been a topic studied by various disciplines. Computational
linguistics has studied it as a binary classi cation problem and good results are
being obtained by using some machine learning techniques which include classic
classi ers (Support Vector Machines, Logistic Regression, Random Forests) and
neural networks. Some organizations focus their the investigation on this topic
and organize competitions where, mainly, ask for new proposals that can classify
as good as possible whether a tweet is o ensive or not, among other labels, such
as if a tweet is vulgar but not o ensive, not vulgar and o ensive, if the aggression
of the tweet is targeted to a person or a group of people, etc.</p>
      <p>
        This year (2021) MeO endEs competition [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] at the Iberian Languages
Evalutation Forum (IberLEF) [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] was organized. The aim of this competition was
to boost research on a sensitive topic for the Spanish language. 4 subtasks were
part of this competition. This work presents our solution for the last two, which
consisted in classifying tweets as o ensive or non-o ensive in the O endMEX
corpus. It should be said that metadata about each tweet were provided in Task
4.
      </p>
      <p>As further detailed next, our proposed features derived from several lexicons
which have a collection of Spanish words that have been weighted according to
di erent criteria like a ective, dimensional, and emotional value among others
derived from POS-tagging analysis of the tweets and other models which have
been already proved such as word-embeddings and one-hot codi cation. This
data representation was the input of a Support Vector Machine and obtained
competitive scores of recall metric in the subtasks, and the usefulness percentage
of the lexical features overcame the 50% in each subtask.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Model's Description</title>
      <p>For these two tasks, the O endMEX corpus was used. It is divided into 2 sets:
The Training set is formed by 5,060 tweets where 3,679 of them were labeled as
non-o ensive, and the rest (1,381) as o ensive. The Test set is formed by 2,183
tweets. In addition to these sets, another one was released named as trial set
and was formed by 76 tweets (35 non-o ensive, 41 o ensive).</p>
      <p>
        Figure 1 shows the ow process of how we faced these two subtasks. In a
nutshell, the process consists in the extraction of the features similar to [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. In
this work, the authors extracted features from lexical resources called lexicons,
which are lists of words weighted according a value, in this case, the polarity
value of a word or a phrase in English to detect irony in English tweets. Then,
they used these features as inputs of some machine learning algorithms such as
Support Vector Machines, Decision Trees, and Naive-Bayes. The same strategy is
followed here, but we used di erent lexicons and proposed other kind of features
which include an special treatment for emojis and hashtags. These steps will be
explained in detail in the following sections.
Before the feature extraction process, a data preprocessing is performed. In this
step, four operations are applied:
{ Mentions cleaning: In the social media slang, a mention means that an user
is tagged in a post. In this operation, all mentions are removed in the post
but the frequency of them is saved because it will be considered as a feature.
{ Hashtag treatment : Hashtag is a term associated with topics of discussions
that users choose to be indexed in social networks, inserting the hash symbol
(#) before the word, phrase or expression with no whitespaces, allowing
only the underscore symbol ( ) to \separate" the words if wanted. In this
preprocessing, word segmentation is used in order to have the words as if
the user had not used a hashtag. The corpus used by word segment model
to learn how to split Spanish words was Spanish Billion Words Corpus [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        The frequency of hashtags is used as a feature.
{ Emojis cleaning: All emotional polarity values of emojis which are present
in the post are summed both positive and negative values individually, and
the combination of them according to values in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. It should be said that not
all emojis1 are present in the work of Kralj and her team. That is why six
features are extracted: the sum of the polarity of positive and negative emojis
in the post, the sum of polarity of positive and negative emojis separately,
the number of total emojis which are in the post, and the number of emojis
which are both in the work of Kralj and not. Finally, all emojis are removed
from the post.
      </p>
      <p>{ URLs cleaning: URLs are counted and then removed from the post.
2.2</p>
      <sec id="sec-2-1">
        <title>Features' Extraction</title>
        <p>After all tweets have been preprocessed, the next step is to extract the features
of the text. As it is widely known, most machine learning algorithms require a
numeric representation of text as the input, so it has to be casted to a vectorial
representation where each element represents a feature. They are categorized
depending on their nature.</p>
        <p>Structural features consist in the quanti cation of features that can be
obtained based on Part-Of-Speech classi cation. Table 1 shows the features which
fall under this description.</p>
        <p>
          A ective features consist in both positive and negative polarity values that
a tweet has according to the sum of the words' polarity present in it. To do
that, several lists of Spanish words (lexicons) classi ed by an amount (positive
amounts means positive emotional polarity, otherwise, negative) or a label
(positive, negative, neutral) are used. Table 2 shows the features which fall in this
description and the name of the lexicon which was used for computing each
feature.
1 https://unicode.org/Public/UNIDATA/emoji/emoji-data.txt
Features Description
eqxucelsatmmmarakrsks The frequency of each punctuation mark in a tweet
singulars The frequency of each in ectional feature of nouns, pronouns,
plurals adjectives, determiners, numerals, and verbs.
words
chars
upper The total amount of uppercase characters in a tweet
verbs
aaddjv The frequency of each POS-tag in a tweet
nouns
hashtags
mentions The frequency of each speci c marker in a tweet
urls
epmoloajrisemojis The frequency of emojis in a tweet and a counter of emojis that
non polar emojis appear in [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] or not, respectively
        </p>
        <p>
          The total amount of words and characters in a tweet, respectively
Dimensional Features consist in those which are inspired in some theories
which propose that the nature of an emotional state is determined by its position
in a space of independent dimensions. According to a dimensional approach,
emotions can be de ned as a coincidence of values on a number of di erent
strategic dimensions. Table 3 shows the features inspired by these theories.
Emotional Features consist in those which are inspired in the work of [18] and
[
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] who de ned 8 and 6 basic emotions, respectively: anger, disgust, fear, joy,
sadness, surprise, anticipation, and trust. Table 4 shows the features inspired by
these emotions.
        </p>
        <p>Contextual Features consist in those which are meta-data of the tweet. These
features were only used for subtask 4. Table 5 shows a description of the
metadata given for this subtask and how we used them as features.</p>
        <p>
          In total, a tweet is represented as a vector composed by 114 features for
subtask 3, and by 126 for subtask 4. In the future, they will be refered as CVAD
features. One thing to note is that the lexicons used in a ective, dimensional and
emotional features contain words or phrases not in a speci c variant of Spanish
except the Mexican Slang Lexicon. In addition to them, 300 word-embeddings
and a one-hot codi cation features are added. The way in which these
wordembeddings were trained is described in [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. For one-hot codi cation, all words
in the training dataset are obtained. Then, these n-features (where n depends
on how many words are used at least m-times in the whole training dataset)
are vectorized as zeros. Finally, if each feature (word) is present in the post, its
        </p>
        <sec id="sec-2-1-1">
          <title>Features</title>
          <p>emojis polarity
pos emojis
neg emojis
HL insults
HL xenoph
HL misog
HL inmigrants
EMOLEX n+
EMOLEX
nISOL 1+
ISOL
1MXSL int1+
MXSL
int1MXSL phrn+
MXSL
phrnML SENTICON n+
ML SENTICON
nMS 1+
MS
1SSL 1+
SSL
1ELHPOLAR n+
ELHPOLAR
nSENTICNET +
SENTICNET</p>
        </sec>
        <sec id="sec-2-1-2">
          <title>Description</title>
          <p>Sum of tweet's polarity according to the emojis present in the
post
Sum of polarity value of \positive" and \negative" emojis,
respectively.</p>
          <p>
            Hate speech Spanish lexicons[
            <xref ref-type="bibr" rid="ref17">17</xref>
            ] contain 4 lexicons which
described general insults, hateful lexicons toward immigrants
and women, and words that refer to the nationality of an
immigrant in Spanish. Each lexicon contains 279, 44, 183, and
250 words respectively.
          </p>
          <p>
            NRC Word-Emotion Association Lexicon (aka EMOLEX) [
            <xref ref-type="bibr" rid="ref12">12</xref>
            ]
is a list of English and Spanish words/phrases and their
associations with two sentiments (positive and negative). Each
feature is the sum of positive and negative (separately) per
n-gram in the lexicon. n goes from 1 to 4
iSOL[
            <xref ref-type="bibr" rid="ref13">13</xref>
            ] is a list of words labeled as positive or negative. Each
feature is the sum of positive and negative words in the post.
Mexican Slang lexicon [
            <xref ref-type="bibr" rid="ref5">5</xref>
            ] consists in lists of interjections and
phrases used in mexican slang. Each feature is the sum of
positive and negative (separately) per n-gram in the lexicon. n
goes from 1 to 4. We added 1,373 Mexican expressions from our
own knowledge to this list.
          </p>
          <p>
            ML-Senticon [
            <xref ref-type="bibr" rid="ref6">6</xref>
            ] is a list of Spanish words/phrases which, for
each lemma, provides an estimation of polarity (from very
negative -1.0 to very positive +1.0). Each feature is the sum of
positive and negative words in the post per n-gram in the
lexicon. n goes from 1 to 4
Multilingual Sentiment lexicon [
            <xref ref-type="bibr" rid="ref10">10</xref>
            ] is a list of Spanish words
labeled as positive or negative. Each feature is the sum of
positive and negative words in the post
Sentiment Lexicons in Spanish [
            <xref ref-type="bibr" rid="ref15">15</xref>
            ] is a list of Spanish words
which are labeled as positive and negative according to English
and Spanish annotations
Elhpolar lexicon[22] is a list of Spanish words/phrases labeled
as positive and negative. Each feature is the sum of positive
and negative words in the post per n-gram. n goes from 1 to 4
SenticNet [
            <xref ref-type="bibr" rid="ref2">2</xref>
            ] is a list of words which have an emotional
polarity oating value from -1 (negative) to +1 (positive). Each
feature is the sum of these values according their polarity
          </p>
        </sec>
        <sec id="sec-2-1-3">
          <title>Features Description</title>
          <p>SSSSEEEENNNNTTTTIIIICCCCNNNNEEEETTTT saapetplentteasitnsituatiidnvoteintnyess aSHseosnuotrcicgiaNlateestsd [ow2f]iEtihsmatohtleiisotfnoisusrmadolidimsetelno[3fsi]SopnasnoifshthweoCrdasmwbrhiiach are
S-ANEW val
S-ANEW aro
S-ANEW dom
SDAL pleasantness
SDAL activation
SDAL imagery</p>
        </sec>
        <sec id="sec-2-1-4">
          <title>Features</title>
          <p>EMOLEX n anger
EMOLEX n disgust
EMOLEX n fear
EMOLEX n joy
EMOLEX n sadness
EMOLEX n surprise
EMOLEX n anticipation
EMOLEX n trust
SEL 1 anger
SEL 1 disgust
SEL 1 fear
SEL 1 joy
SEL 1 sadness
SEL 1 surprise</p>
          <p>
            Spanish ANEW [20] is a list of words which is inspired by
A ective Norms for English Words (ANEW) [
            <xref ref-type="bibr" rid="ref1">1</xref>
            ]. Words
are associated with emotional ratings in terms of the
Valence-Arousal-Dominance model
Spanish DAL (SDAL) [21] is a list of Spanish words
which are manually annotated with regard to this three
dimensions. SDAL is inspired by [23]
EMOLEX [
            <xref ref-type="bibr" rid="ref12">12</xref>
            ] is a list of English and Spanish words or
phrases and their associations with the 8 basic emotions
identi ed by Plutchik. Each feature is the sum of each
emotion per n-gram in the lexicon. n goes from 1 to 4
Spanish Emotion Lexicon (SEL) [
            <xref ref-type="bibr" rid="ref11">11</xref>
            ][19] is a list of Spanish
words that are associated with the measure of Probability
Factor of A ective use (PFA) with respect to the 6 basic
emotions identi ed by Ekman
These features describe the data of the user who twitted: whether
his/her account is veri ed, how many followers he or she has, how
many users he or she is following, how many public lists that he or
she is a member of, how many tweets he or she has published, if he
or she has altered the theme or background of his/her pro le, and
if he or she has his/her own pro le image
These are the information about the tweet itself: how many
retweets it has, how many times it has been marked as favorite, if
it is a reply of another tweet, and if it is a quote of a tweet.
representation in the vector is changed to 1. It should be noted that tweets in
the trial dataset were included into training dataset.
2.3
          </p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>Model's training</title>
        <p>These features were the inputs of a Support Vector Machine (SVM). SVM
hyperparameters' tuning and cross validation over training dataset were performed to
know which con guration of both features and hyperparameters yielded the best
theoretical results and then, predict the labels of testing dataset using them. We
used scikit-learn GridSearchCV2 and cross validate3 methods to perform this
step. The metric used for optimizing the hyperparameters was F1 macro. Cross
validation was performed using the K-Fold technique which consists in dividing
all samples in k groups (k-folds). The prediction function is learned using k 1
folds, and the fold left out is used for testing. The value of k used in the
experiments was 5. Finally, to obtain one-hot codi cation, tested word frequencies
were from bigger or equal than 1 to 5, separately.</p>
        <p>Tables 6 and 7 show the ranked results of the experimentation for
subtask 3 and 4, respectively. All experiments include CVAD features, 300
wordembeddings and n-one hot codi cation. Tables show the experimentation among
the di erent number of features derived of the number of words which frequencies
are bigger or equal to n.</p>
        <p>There are 11,544 di erent words in the training dataset of which 4,102 are
used at least twice, 2,462 at least thrice, 1,721 at least four times, and 1,333 at
least ve times.
Using the con guration of the best experimental results, labels from the test
dataset are obtained and the results of these are shown in Table 8.
2 https://scikit-learn.org/stable/modules/generated/sklearn.model_
selection.GridSearchCV.html.
3 https://scikit-learn.org/stable/modules/cross_validation.html.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results in the competition</title>
      <p>
        The organizers of MeO endEs [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] reported a baseline performance per subtask.
For Subtask 3 they reported 0.719, 0.41, and 0,522 scores for precision, recall and
F1 score respectively, and for Subtask 4, 0.663, 0.698, and 0.68. As they ranked
the participants by using the F1 macro metric, our solution was better ranked
than baseline for Subtask 3, but it was not able to outperform it in Subtask 4.
      </p>
      <p>After analyzing cross validation process to nd out what type of tweets in
training dataset our proposal was not able to classify correctly in both subtasks,
we realized that tweets with sexual connotations or with negative words (not
vulgar) but not attacking someone are some of them. Table 9 shows some instances
which falls under these descriptions.
Tweet Actual label
@USUARIO como luchar contra la corrupcion de los o ciales no
solo nos enfoquemos en la de los ciudadanos esa moneda tiene dos Non-aggressive
caras feas
yWcoloauvarrusleolainvita..yo tambien quiero mamar esa panocha deliciosa Aggressive</p>
      <p>Comparing our results to the rest of competitors, our solution was ranked
at 7th place of 10 teams for Subtask 3, and at 2nd place out of 3 participants
for Subtask 4. In order to know which CVAD features (i.e. the ones derived by
lexical resources) were useful for these problems, a feature selection process was
performed. To do this, we used the SelectFromModel4 method, which selects
features based on importance weights, on our top solutions per subtask.</p>
      <p>For Subtask 3, 13 structural features out of 17 (76.47%), 26 a ective ones out
of 49 (53.06%), 9 dimensional of 10 (90%), and 13 emotional of 38 (34.21%) were
found useful. For Subtask 4, 16 (94.12%), 30 (61.22%), 9 (90%), 16 (42.10%),
and 8 contextual features out of 12 (66.67%) were selected.</p>
      <p>As can be seen, the usage percentage per type of CVAD feature increased
when the metadata of the tweet was supplied to detect whether a tweet is
offensive or not. This phenomenon can be observed in the obtained scores which
showed a slightly better classi cation in subtask 4 than 3.</p>
      <p>
        Another interesting feature to be observed is that both a ective and
emotional features were less useful in subtasks 3 and 4 compared to the other
features. The reason of this is that phrases with 3 or 4 words (i.e. trigrams and
4-grams) which are present in the used a ective and emotional lexicons are not
frequently used by Mexican users except for those present in the combination
of the Mexican Slang lexicon [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and our list. If we removed these features, the
usage percentage turns into 70.27% a ective features, and 59.09% emotional
features for Subtask 3. For Subtask 4, the percentages after removing said features
are 81.08% and 72.73%, respectively.
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Conclusions and Future Work</title>
      <p>For these subtasks, a relatively simple model was proposed to classify Mexican
Spanish tweets as o ensive or non-o ensive. This model was mainly based on
lexical resources as features, as well as other kind of features which have been used
previously. This representation allowed our model to learn contextual features
which are the meta-data provided for subtask 4.</p>
      <p>One thing to be noted is that our recall scores obtained in both subtasks
were better than the majority of competitors' whose models were better ranked,
but our precision scores were not as good as theirs. This evidence suggests that
using lexical resources to detect o ensiveness in Mexican Spanish tweets is a
good option when there is a high cost associated with False Negatives, i.e. when
a model is preferred to detect o ensiveness or non-o ensiveness in tweets when
they actually are.</p>
      <p>As a future work, we plan to perform experiments using these features with
di erent Machine Learning algorithms such as the multilayer perceptron;
additionally, we plan to update the used lexicons with words or phrases which
mexicans actually use both in the real life and on social media according to the
criteria adopted to make these lists.
4 https://scikit-learn.org/stable/modules/generated/sklearn.feature\
_selection.SelectFromModel.html.
18. Plutchik, R.: The nature of emotions: Human emotions have deep evolutionary
roots, a fact that may explain their complexity and provide tools for clinical
practice. American scientist 89(4), 344{350 (2001)
19. Rangel, I.D., Sidorov, G., Guerra, S.S.: Creacion y evaluacion de un diccionario
marcado con emociones y ponderado para el espan~ol. Onomazein 5(29), 31{46
(2014)
20. Redondo, J., Fraga, I., Padron, I., Comesan~a, M.: The Spanish adaptation of anew
(a ective norms for english words). Behavior research methods 39(3), 600{605
(2007)
21. R os, M.D., Gravano, A.: Spanish dal: a spanish dictionary of a ect in language. In:
Proceedings of the 4th Workshop on Computational Approaches to Subjectivity,
Sentiment and Social Media Analysis. pp. 21{28 (2013)
22. Urizar, X.S., Roncal, I.S.V.: Elhuyar at tass 2013. In: Proceedings of the Workshop
on Sentiment Analysis at SEPLN (TASS 2013). pp. 143{150 (2013)
23. Whissell, C.: Using the revised dictionary of a ect in language to quantify the
emotional undertones of samples of natural language. Psychological reports 105(2),
509{521 (2009)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bradley</surname>
            ,
            <given-names>M.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lang</surname>
            ,
            <given-names>P.J.:</given-names>
          </string-name>
          <article-title>A ective norms for english words (ANEW): Instruction manual and a ective ratings</article-title>
          .
          <source>Tech. rep.</source>
          ,
          <source>Technical report C-1</source>
          , the center for research in psychophysiology (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Cambria</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xing</surname>
            ,
            <given-names>F.Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Poria</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kwok</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Senticnet 6: Ensemble application of symbolic and subsymbolic AI for sentiment analysis</article-title>
          .
          <source>In: Proceedings of the 29th ACM International Conference on Information &amp; Knowledge Management</source>
          . pp.
          <volume>105</volume>
          {
          <issue>114</issue>
          (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Cambria</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Livingstone</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hussain</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>The hourglass of emotions</article-title>
          .
          <source>In: Cognitive behavioural systems</source>
          , pp.
          <volume>144</volume>
          {
          <fpage>157</fpage>
          . Springer (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Cardellino</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Spanish Billion Words Corpus and Embeddings (August</article-title>
          <year>2019</year>
          ), https://crscardellino.github.io/SBWCE/
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Castro-Sanchez</surname>
            ,
            <given-names>N.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baca-Gomez</surname>
            ,
            <given-names>Y.R.</given-names>
          </string-name>
          , Mart nez, A.:
          <article-title>Development of a ective lexicon for spanish with mexican slang expressions</article-title>
          .
          <source>Res. Comput. Sci. 100</source>
          ,
          <issue>9</issue>
          {
          <fpage>18</fpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Cruz</surname>
            ,
            <given-names>F.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Troyano</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pontes</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ortega</surname>
            ,
            <given-names>F.J.:</given-names>
          </string-name>
          <article-title>Building layered, multilingual sentiment lexicons at synset and lemma levels</article-title>
          .
          <source>Expert Systems with Applications</source>
          <volume>41</volume>
          (
          <issue>13</issue>
          ),
          <volume>5984</volume>
          {
          <fpage>5994</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ekman</surname>
            ,
            <given-names>P.:</given-names>
          </string-name>
          <article-title>An argument for basic emotions</article-title>
          .
          <source>Cognition &amp; emotion 6(3-4)</source>
          ,
          <volume>169</volume>
          {
          <fpage>200</fpage>
          (
          <year>1992</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8. Far as,
          <string-name>
            <given-names>D.I.H.</given-names>
            ,
            <surname>Patti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Rosso</surname>
          </string-name>
          ,
          <string-name>
            <surname>P.</surname>
          </string-name>
          :
          <article-title>Irony detection in twitter: The role of a ective content</article-title>
          .
          <source>ACM Transactions on Internet Technology (TOIT) 16(3)</source>
          ,
          <volume>1</volume>
          {
          <fpage>24</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Kralj</given-names>
            <surname>Novak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Smailovic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Sluban</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Mozetic</surname>
          </string-name>
          ,
          <string-name>
            <surname>I.</surname>
          </string-name>
          :
          <article-title>Sentiment of emojis</article-title>
          .
          <source>PloS one</source>
          <volume>10</volume>
          (
          <issue>12</issue>
          ),
          <year>e0144296</year>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Lab</surname>
            ,
            <given-names>D.S.</given-names>
          </string-name>
          : Multilingual sentiment, https://sites.google.com/site/ datascienceslab/projects/multilingualsentiment
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Miranda-Jimenez</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Viveros-Jimenez</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gelbukh</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Castro-Sanchez</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Velasquez</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <article-title>D az-</article-title>
          <string-name>
            <surname>Rangel</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gordon</surname>
          </string-name>
          , J.:
          <article-title>Empirical study of opinion mining in Spanish Tweets (</article-title>
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Mohammad</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Turney</surname>
          </string-name>
          , P.D.:
          <article-title>Crowdsourcing a word-emotion association lexicon 29(3</article-title>
          ),
          <volume>436</volume>
          {
          <fpage>465</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Molina-Gonzalez</surname>
            ,
            <given-names>M.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mart</surname>
            nez-Camara,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mart</surname>
            n-Valdivia,
            <given-names>M.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>PereaOrtega</surname>
            ,
            <given-names>J.M.:</given-names>
          </string-name>
          <article-title>Semantic orientation for polarity classi cation in Spanish reviews</article-title>
          .
          <source>Expert Systems with Applications</source>
          <volume>40</volume>
          (
          <issue>18</issue>
          ),
          <volume>7250</volume>
          {
          <fpage>7257</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Montes</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gonzalo</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aragon</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Agerri</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alvarez-Carmona</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alvarez Mellado</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carrillo-de Albornoz</surname>
          </string-name>
          , J.,
          <string-name>
            <surname>Chiruzzo</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Freitas</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gomez</surname>
            <given-names>Adorno</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Gutierrez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            ,
            <surname>Jimenez-Zafra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.M.</given-names>
            ,
            <surname>Lima</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Plaza-de Arco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.M.</given-names>
            ,
            <surname>Taule</surname>
          </string-name>
          , M. (eds.):
          <source>Proceedings of the Iberian Languages Evaluation Forum (IberLEF</source>
          <year>2021</year>
          ) (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Perez-Rosas</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Banea</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mihalcea</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Learning sentiment lexicons in Spanish</article-title>
          .
          <source>In: LREC</source>
          . vol.
          <volume>12</volume>
          , p.
          <fpage>73</fpage>
          .
          <string-name>
            <surname>Citeseer</surname>
          </string-name>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <article-title>Plaza-del-</article-title>
          <string-name>
            <surname>Arco</surname>
            ,
            <given-names>F.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Casavantes</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Escalante</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martin-Valdivia</surname>
            ,
            <given-names>M.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Montejo-Raez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Montes-</surname>
            y-Gomez,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jarqu</surname>
            n-Vasquez,
            <given-names>H.</given-names>
          </string-name>
          ,
          <article-title>Villasen~or-</article-title>
          <string-name>
            <surname>Pineda</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Overview of the MeO endEs task on o ensive text detection at IberLEF 2021</article-title>
          .
          <source>Procesamiento del Lenguaje Natural</source>
          <volume>67</volume>
          (
          <issue>0</issue>
          ) (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Plaza-Del-Arco</surname>
            ,
            <given-names>F.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Molina-Gonzalez</surname>
            ,
            <given-names>M.D.</given-names>
          </string-name>
          ,
          <article-title>Uren~a-</article-title>
          <string-name>
            <surname>Lopez</surname>
            ,
            <given-names>L.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mart</surname>
            <given-names>nValdivia</given-names>
          </string-name>
          , M.T.:
          <article-title>Detecting misogyny and xenophobia in Spanish tweets using language technologies</article-title>
          .
          <source>ACM Transactions on Internet Technology (TOIT) 20(2)</source>
          ,
          <volume>1</volume>
          {
          <fpage>19</fpage>
          (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>