<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Sexism Identification In Tweets Using Machine Learning Approaches</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Murari Sreekumar</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Shreyas Karthik</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Durairaj Thenmozhi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Shriram Gopalakrishnan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Krithika Swaminathan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Sri Sivasubramaniya Nadar College Of Engineering</institution>
          ,
          <addr-line>Rajiv Gandhi Salai (OMR), Kalavakkam 603 110, Tamil Nadu</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Sexism poses significant challenges in sentiment analysis, as it can manifest in subtle and nuanced ways, often embedded within seemingly benign language. On social media, where communications are frequently code-mixed, particularly in Dravidian languages, there is an increasing demand for identifying sexist content to ensure healthy online interactions. The EXIST 2024 shared task aims to detect sexism in Spanish and English tweets collected from social media platforms. Various traditional machine learning approaches are employed to identify whether the comments contain sexist content in Spanish and English languages. Utilizing Support Vector Machines (SVM), Random Forest and Logistic Regression as a classifier, we achieve F1 scores of 0.6299, 0.6074 and 0.5518 respectively for English dataset.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Sexism Identification</kwd>
        <kwd>Traditional Machine Learning Algorithms</kwd>
        <kwd>Natural Language Processing</kwd>
        <kwd>Sentiment Analysis</kwd>
        <kwd>Text Analytics</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Sexism is prejudice or discrimination based on one’s sex or gender. Sexism can afect anyone, but
primarily afects women and girls. It has been linked to gender roles and stereotypes, and may include
the belief that one sex or gender is intrinsically superior to another. With the advent of social media
people have begun misusing the freedom speech and expression and instead have engaged in lot of hate
speech on women politicians, journalists, personalities etc. This has especially risen in social media
platforms such as twitter during the pandemic time [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        Women who experience online abuse often alter their online behaviour, self-censor their content
and limit their interactions on platforms out of fear of violence and abuse. By silencing or driving
women away from online spaces, online violence can afect their economic outcomes, leading to
loss of employment and societal status. Additionally, online gender-based violence may serve as a
predictor of violent crimes in the physical world [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ][
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. it is crucial to address these aspects of sexism
in social networks and hence Natural Language Processing research is crucial in providing insights into
identifying the tweets and classifying them as Sexist and Non-Sexist. Computational understanding of
natural language has been used in addressing issues such as sentiment analysis[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], human behaviour
detection, fake news detection[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], question answering and depression and threat detection across
diferent forms of media.
      </p>
      <p>Our research paper presents various innovative solutions contributing to the field of sexism
identification in significant ways:
• This project can be used for real time applications in social media platforms like Twitter, Instagram,</p>
      <p>Facebook, LinkedIn etc in order to maintain a healthy and safe online environment.</p>
      <p>The task that we have performed in EXIST 2024 is Sexism Identification in Tweets. In this task, the
systems have to decide whether the tweets are Sexist or Not Sexist.</p>
      <p>In this research paper, we have discussed the research works that we have done for Task 1. The rest
of the paper is organised as follows: Section 2 presents a literature survey explaining the key theories
and concepts, research methodologies and the trends and patterns common in the field of sexism
identification. Section 3 describes the diferent datasets used and the task performed. Section 4 talks
about the methodology like preprocessing, lemmatization, vectorization and the various models used
for our task. Section 5 talks about our results and performance analysis with other teams participating
in the task. Finally, in Section 6 we talk about the conclusions and the future prospects of the research
work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>Various works in the field of Sexism Identification were studied and diverse methodologies and approach
for sexism identification and classification were employed to solve this issue. Significant eforts have
been made by researchers around the world to develop annotated datasets and apply deep learning
models such as Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs). In
addition to these, various transformer based models like BERT have been used as they have consistently
provided excellent accuracy in identifying sexist tweets.</p>
      <p>
        Rodríguez-Sánchez et al. (2020) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] undertook a research on automatic classification of sexism in
social networks. They specialized mainly on Twitter data in Spanish. They developed the MeTwo
dataset that labels the tweets into sexist, non-sexist and doubtful. This is the first dataset in Spanish
used to identify sexism in a broad sense, ranging from hostile to subtle sexism.To classify the tweets
into three categories, they have used various traditional Machine Learning models like Support Vector
Machine (SVM), Logistic Regression, Random Forest, and Naive Bayes. Various advanced deep learning
models like Bidirectional Encoder Representations from Transformers (BERT), Bidirectional Long Short
Term Memory (Bi-LSTM), Convolutional Neural Networks (CNN) and Recurrent Neural Networks
(RNN) have also been used. This research done by them can be used in fields such as misogyny detection
in tweets and various other texts.
      </p>
      <p>
        Davidson et al. (2017) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] in their research worked to distinguish hate speech from ofensive language
on social media. They collected the tweets and labeled them into three categories namely hate speech,
ofensive language and neither. First, they converted all the text into lowercase, stemmed the text to
obtain the root words using PorterStemmer, create bigram, unigram and trigram features using TF-IDF.
They used Penn Part-Of-Speech (POS) tagging and included count indicators for r hashtags, mentions,
retweets, and URLs, as well as features for the number of characters, words, and syllables in each tweet.
Then various models like Logistic Regression, naive Bayes, decision trees, random forests, and linear
SVMs. These models successfully classified racist and homophobic slurs as hate speech, while sexist
language was more frequently categorized as ofensive.
      </p>
      <p>
        Harika Abburi et al. (2021) [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] worked on Fine-Grained Multi-label Sexism Classification Using a
Semi-Supervised Multi-level Neural Approach. They initially employed the technique of Self-training,
which is a semi supervised learning approach that helps augment the set of labeled instances by
selectively adding unlabeled samples. Then it applies the models to the unlabeled instances and
identifies a subset of them to be added to the training set, along with the predicted labels. To address
categories with scarce labeled data, they propose a multi-level training approach. The model trains
initially on a reduced set of broader categories (coarse), then refines its understanding on the full set of
ifne-grained categories. To begin with, the data was tested on various Traditional Machine Learning
models like logistic regression (LR), Support Vector Machine (SVM) and Random Forests (RF) Classifiers.
These were applied on two feature sets namely TF-IDF on word unigrams and bigrams (Word ngrams)
and the average of the ELMo vectors. Then various Deep Learning techniques like BiLSTM, BERT and
other CNN based architectures were used. Thus, this approach can be used to analyze online sexism by
using unlabeled data and various Deep Learning and Neural Network models.
      </p>
      <p>S Sharifirad et al. (2019) worked on a comprehensive classification of diferent online harassment
categories and explain its challenges using NLP. The tweets have been classified into Indirect Harassment,
Information Threat, Sexual Threat and Non Sexist. They have used various classification methods like
bigrams, threegrams, Two Character Grams, Word2Vec, Doc2Vec, Long Short Term Memory (LSTM)
among others. These techniques help identify boundaries between words or phrases in text, especially
in languages without explicit word separators. By analyzing sequences of words, n-grams can be used
to predict the next word in a sequence, which is useful for tasks like text generation. They have used
neural networks and the traditional machine learning technique Naive Bayes. The tweets were classified
correctly in their categories with accuracy ranging from 0.66 to 0.91 for LSTM.</p>
      <p>Thus, it is found that while significant progress has been made in identifying and mitigating various
forms of sexism on social networks, many existing studies primarily focus on explicit instances of
sexist language. However, the detection and analysis of more subtle, implicit forms of sexism remain
under-explored. Additionally, the intersection of sexism with other forms of discrimination, such as
racism or homophobia, has not been thoroughly investigated. This research aims to address these gaps
by developing more sophisticated algorithms that can identify both explicit and implicit sexist content,
considering the broader context of intersectional discrimination in social network environments. In
addition to these, we also aim to integrate these techniques in various social media platforms to ensure
safe and healthy online environments.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Task and Dataset</title>
      <p>
        The task organizers of CLEF2024 provided a dataset called EXIST2024 [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ][10]. The EXIST2024 dataset
contains exactly 6920 tweets for training, 1038 tweets for development and 2076 tweets for testing
which adds upto to an overall of more than 10000 tweets.
      </p>
      <p>From the above table, it can be observed that the training, development and testing dataset contain
English and Spanish tweets in the same ratio.</p>
      <p>TASK 1: Sexism Identification in Tweets The first task is a binary classification. The systems
have to decide whether or not a given tweet contains sexist expressions or behaviours (i.e., it is sexist
itself, describes a sexist situation or criticizes a sexist behaviour). The following tweets show examples
of sexist and not sexist messages. The opinions of Six annotators were also given. These annotators
classified the tweets into Sexist and Non Sexist using "YES" and "NO". The opinion given by the majority
of the annotators was taken into account for every tweet and then used for identifying whether a tweet
is sexist.
"People really try to convince women with little
to no ass that they should go out and buy a body.</p>
      <p>Like bih, I don’t need a fat ass to get a man. Never
have."</p>
      <p>Non-Sexist
"Alguien me explica que zorra hace la gente en el
cajero que se demora tanto."
"@messyworldorder it’s honestly so embarrassing
to watch and they’ll be like ’not all white women
are like that’"</p>
    </sec>
    <sec id="sec-4">
      <title>4. Methodology</title>
      <sec id="sec-4-1">
        <title>4.1. Preprocessing</title>
        <p>We trained the traditional machine learning models such as Support Vector Machine (SVM) [11, 12],
Random Forest and Logistic Regression on the training dataset, evaluated the models on the dev dataset
and submitted our runs by applying the ML models on the test dataset.</p>
        <p>Our first step was to clean the data given in order to improve the performance of the machine learning
models:
1. Converting the text to lowercase: This ensures consistency in text data. By doing this the
vocabulary size is reduced and it reduces the computational requirements.
2. Removing punctuation marks:They often point to external resources that are not relevant to the
context of the text being analyzed.
3. Removing http links and emoticons:These do not contribute to the semantic meaning of the text.
4. Removing twitter mentions like @username
5. Removing all the numbers from the tweet column: These do not contribute towards sexist words.
6. Removing stop words like "a", "an", "the", "is" and so on to improve the accuracy of the models.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Lemmatization</title>
        <p>Lemmatization is a crucial step in preprocessing data where the words in the text are converted to
the base form. We have preferred to use Lemmatization as it followed grammatical rules better than
Stemming. This process involves:
• Identifying the part of speech: Understanding whether a word is a noun, verb, adjective, etc.,
which helps in determining the correct lemma.
• Morphological analysis: Analyzing the structure and form of the word to convert it to its base
form.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Vectorization</title>
        <p>In order to ensure that the data is understood well by the model we need to convert the data into
a format that machines can understand,typically vectors or array of numbers. Among vectorization
techniques we found TF-IDF vectorization to give a better accuracy. Basically it adjusts the frequency
of words by how commonly they appear across all documents, giving more weight to less common but
significant words.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4. Model Evaluation</title>
        <p>
          We have used three models using hard-hard labels such as Sexist and Non-Sexist. They are:
1. Support Vector Machines: A supervised machine learning algorithm that we used for classification
and regression tasks. It operates by creating a decision boundary that separates n-dimensional
spaces into classes so that a new data point can be assigned to its relevant category.
2. Logistic Regression: It is a regression model mainly used for classification problems. Logistic
regression models the probability that a given input belongs to a particular class. It uses the
logistic function, also known as the sigmoid function, to map any real-valued number into the
range [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ].
3. Random Forest: It is an ensemble learning method in which multiple decision trees are built
during training and merges their results to improve accuracy and over-fitting .
4. Decision Trees: A tree-like model of decisions and their possible consequences, including
outcomes, resource costs, and utility.
5. Hyper-parameter tuning: This is an essential step that helps in optimizing the performance of the
models used for classifying the tweets. Hyper-parameters are configurations external to the model
that cannot be learned from the data, such as learning rate, batch size, and the number of layers
in a neural network. Since the data used in NLP is highly complex and multi-dimensional
hyperparameter tuning is used to identify optimal hyper-parameter configurations in order to make
the models more eficient and accurate. There are various methods of hyper-parameter tuning
like GridSearchCV, RandomSearchCV, Bayesian Optimization and Gradient-based Optimization.
We have used GridSearchCV for our research.
        </p>
        <p>For SVM, we have tuned the hyper-parameters like regularization parameter (C) and the kernel
parameters, such as the gamma parameter for the radial basis function (RBF) kernel. For Logistic
Regression, we have tuned hyper-parameters like the regularization strength (often denoted as
C). Regularization techniques such as L1 (lasso) and L2 (ridge) are also tuned to improve model
generalization.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Results and Performance Analysis</title>
      <sec id="sec-5-1">
        <title>5.1. Performance Analysis</title>
        <p>Scikit-learn, also known as sklearn, is an open-source, machine learning and data modeling library for
Python. It features various classification, regression and clustering algorithms including support vector
machines, random forests, gradient boosting, k-means and DBSCAN, and is designed to interoperate
with the Python libraries, NumPy and SciPy. The sklearn metrics library also provides the classification
report for evaluation of the performance of the model. The performance is measured using the following
metrics:
1) Precision: Precision is defined as the ratio of true positives to sum of true and false positives.
2) Recall: Recall is defined as the ratio of true positives to sum of true positives and false negatives.
3) F1-Score: The F1 is the weighted harmonic mean of precision and recall. The closer the value of F1
is to 1, better is the performance of the model.</p>
        <p>The result of the task is represented in the form of the table below. Among the 3 being used SVM
had the best F1 score of 0.6299. The next best F1 score came from Random Forest which is of 0.6074.
Logistic Regression had an F1 score 0.5518. Table 4 also displays the ranking of our submissions based
on the shared task oficial ranking in (hard-hard) evaluation scenario.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Reflections</title>
      <p>Through this paper we learnt about important methods in the filed of natural language processing and
the steps involved in it .We learnt through this task that SVM in general is a very good model for text
classification as they are particularly efective in cases where the number of dimensions (features) is
greater than the number of samples. This makes them suitable for applications like text classification,
where each word can be considered a feature. While SVMs work with linear hyperplanes by default,
the ‘kernel trick’ allows them to handle non-linear relationships between features. This is crucial for
text, where complex semantic relationships exist between words.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Conclusions</title>
      <p>Through the scope of the paper we have explored traditional models to perform classification of Sexist
and Not-Sexist speech on the given data by EXIST in English Language. It was noted that the SVM
had the best F1 score of 0.6299. This research contributes to the field of natural language processing
and provides valuable insights into addressing social issues in online platforms. Future work can be
done in incorporating more advanced techniques and also introduce more pre-processing techniques in
order to improve the performance of the model. Additionally the model can be deployed in real world
applications in order to monitor sexist tweets on social platforms.Future work can focus on expanding
the model to handle multi-class classification problems,incorporating more advanced techniques such
as attention mechanisms, and exploring additional preprocessing steps to improve model performance.
Additionally, the model can be deployed in real-world applications to mitigate and monitor instances of
sexism on social media platforms. We hope these eforts will contribute towards fight against sexism.
A. Maeso, V. Ruiz, Exist 2024: sexism identification in social networks and memes, in: European
Conference on Information Retrieval, Springer, 2024, pp. 498–504.
[10] L. Plaza, J. Carrillo-de-Albornoz, V. Ruiz, A. Maeso, B. Chulvi, P. Rosso, E. Amigó, J. Gonzalo,
R. Morante, D. Spina, Overview of EXIST 2024 – Learning with Disagreement for Sexism
Identification and Characterization in Social Networks and Memes (Extended Overview), in: G. Faggioli,
N. Ferro, P. Galuščáková, A. G. S. de Herrera (Eds.), Working Notes of CLEF 2024 – Conference
and Labs of the Evaluation Forum, 2024.
[11] S. L. Salzberg, C4. 5: Programs for machine learning by j. ross quinlan. morgan kaufmann publishers,
inc., 1993, 1994.
[12] T. Pranckevičius, V. Marcinkevičius, Comparison of naive bayes, random forest, decision tree,
support vector machines, and logistic regression classifiers for text reviews classification, Baltic
Journal of Modern Computing 5 (2017) 221.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>N.</given-names>
            <surname>Dehingia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>McAuley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>McDougal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Reed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. G.</given-names>
            <surname>Silverman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Urada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Raj</surname>
          </string-name>
          ,
          <article-title>Violence against women on twitter in india: Testing a taxonomy for online misogyny and measuring its prevalence during covid-19</article-title>
          , PLoS one
          <volume>18</volume>
          (
          <year>2023</year>
          )
          <article-title>e0292121</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Chaudhary</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <article-title>Sexism identification in social networks</article-title>
          ,
          <source>Working Notes of CLEF</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>R.</given-names>
            <surname>Ouedraogo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Stenzel</surname>
          </string-name>
          ,
          <article-title>How domestic violence is a threat to economic development</article-title>
          ,
          <source>IMF Blog Insights &amp; Analysis on Economics and Finance</source>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>L.</given-names>
            <surname>Khan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Amjad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ashraf</surname>
          </string-name>
          , H.-
          <string-name>
            <given-names>T.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gelbukh</surname>
          </string-name>
          ,
          <article-title>Urdu sentiment analysis with deep learning methods</article-title>
          ,
          <source>IEEE access 9</source>
          (
          <year>2021</year>
          )
          <fpage>97803</fpage>
          -
          <lpage>97812</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Khanam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Alwasel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Sirafi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rashid</surname>
          </string-name>
          ,
          <article-title>Fake news detection using machine learning approaches</article-title>
          ,
          <source>in: IOP conference series: materials science and engineering</source>
          , volume
          <volume>1099</volume>
          ,
          <string-name>
            <given-names>IOP</given-names>
            <surname>Publishing</surname>
          </string-name>
          ,
          <year>2021</year>
          , p.
          <fpage>012040</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>F.</given-names>
            <surname>Rodríguez-Sánchez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Carrillo-de Albornoz</surname>
          </string-name>
          , L. Plaza,
          <article-title>Automatic classification of sexism in social networks: An empirical study on twitter data</article-title>
          ,
          <source>IEEE Access 8</source>
          (
          <year>2020</year>
          )
          <fpage>219563</fpage>
          -
          <lpage>219576</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>T.</given-names>
            <surname>Davidson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Warmsley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Macy</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Weber</surname>
          </string-name>
          ,
          <article-title>Automated hate speech detection and the problem of ofensive language</article-title>
          ,
          <source>in: Proceedings of the international AAAI conference on web and social media</source>
          , volume
          <volume>11</volume>
          ,
          <year>2017</year>
          , pp.
          <fpage>512</fpage>
          -
          <lpage>515</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>P.</given-names>
            <surname>Parikh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Abburi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Chhaya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Varma</surname>
          </string-name>
          ,
          <article-title>Categorizing sexism and misogyny through neural approaches</article-title>
          ,
          <source>ACM Transactions on the Web (TWEB) 15</source>
          (
          <year>2021</year>
          )
          <fpage>1</fpage>
          -
          <lpage>31</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>L.</given-names>
            <surname>Plaza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Carrillo-de Albornoz</surname>
          </string-name>
          , E. Amigó,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gonzalo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Morante</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rosso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Spina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Chulvi</surname>
          </string-name>
          ,
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>