<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Bias Mitigation in Misogynous Meme Recognition: A Preliminary Study</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Gianmaria Balducci</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giulia Rizzi</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Elisabetta Fersini</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>PMI Reboot S.r.l.</institution>
          ,
          <addr-line>Milan</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Universitat Politècnica de València</institution>
          ,
          <addr-line>Valencia</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Milano-Bicocca</institution>
          ,
          <addr-line>Milan</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper, we address the problem of automatic misogynous meme recognition by dealing with potentially biased elements that could lead to unfair models. In particular, a bias estimation technique is proposed to identify those textual and visual elements that unintendedly afect the model prediction, together with a naive bias mitigation strategy. The proposed approach is able to achieve good recognition performance characterized by promising generalization capabilities.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Bias Mitigation</kwd>
        <kwd>Bias Estimation</kwd>
        <kwd>Misogyny Identification</kwd>
        <kwd>Meme</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <sec id="sec-1-1">
        <title>Most of the investigations propose a few bias estimation</title>
        <p>metrics and related mitigation policies that are based on
a fixed set of seed words to quantify and minimize the
bias at the dataset or model level. When dealing with
misogynous memes recognition, metrics to estimate the
bias and techniques to mitigate it are still missing.</p>
        <p>
          To this purpose, we provide the following main
contributions:
In the context of social media, memes have become
popular as a means of expressing irony or opinions on
various topics. However, these memes can also perpetuate
discriminatory behaviours towards certain groups and
minorities. Misogyny, in particular, has gained attention
as a form of hateful language conveyed through memes
in various ways, such as female stereotyping, shaming,
objectification, and violence. While misogyny recogni- • a candidate biased elements identification in a
tion mechanisms have been widely investigated focusing multi-modal setting, focusing on both textual and
on textual sources (i.e., tweets) [
          <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4">1, 2, 3, 4</xref>
          ], misogynous visual constituents of a meme;
identification in multimodal settings, and in particular • a mitigation strategy at training time, named
on memes, is still in its infancy. In [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], a few naive uni- Masking Mitigation, that masks the candidate
bimodal and multimodal approaches have been investi- ased elements to reduce the distortion introduced
gated to understand the contribution of textual and vi- by their presence.
sual cues. Further investigations from the same authors
[
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] have introduced a multimodal approach that consid- The rest of the paper is organized as follows. In Section
ers both visual (in the form of captioning) and textual 2 a summary of the state of the art is reported. In Section
information to distinguish between misogynous and non- 3 the candidate biased element identification strategy is
misogynous memes. Recently, the performance of mul- detailed. In Section 4 the proposed mitigation strategy
tiple pre-trained and trained from scratch models have is presented. In Section 5 the experimental results are
been compared to verify if domain-specific pre-training reported. In Section 6 conclusion are reported.
could help to improve the recognition performance [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
        </p>
        <p>
          Independently on the textual, visual or multimodal 2. Related work
sources, several authors highlighted how the
classification models may be subject to bias that could afect the The majority of works on hate content detection focus
real performance of the models [
          <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
          ] in a real setting. on tweets, while, only in recent years, they have started
to address multimodal content such as memes. For
inCLiC-it 2023: 9th Italian Conference on Computational Linguistics, stance, the approach proposed in [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] aims to counter the
Nov 30 — Dec 02, 2023, Venice, Italy phenomenon of memes that can convey sexist messages
*$Cogr.breasldpuocncdii1n@g caaumthpours..unimib.it (G. Balducci); ranging from stereotyping women to shaming,
objectig.rizzi10@campus.unimib.it (G. Rizzi); elisabetta.fersini@unimib.it fication, and violence, investigating both unimodal and
(E. Fersini) multimodal approaches to understand the contribution
0000-0002-0619-0760 (G. Rizzi); 0000-0002-8987-100X (E. Fersini) of textual and visual cues. In [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], the authors indicate
CPWrEooUrckReshdoinpgs IhStpN:/c1e6u1r3-w-0s.o7r3g ©ACt2tEr0i2bU3utCRioonpWy4r.0igoIhnrttekfornsrahtthiooisnppaalp(PCerCrboByYcite4s.0ea)u.dthionrsg.Usse( CpeErmUittRed-uWndeSr.Correagti)ve Commons License how the visual mode may be much more informative
for detecting hate speech than the linguistic mode in and (iii) the definition of a metric to quantify how a
memes. More recently, two benchmark datasets have model could be biased from such elements. The proposed
been proposed to facilitate the investigation related to method has been evaluated on the MAMI Dataset [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]
misogynous meme detection. The first benchmark pre- consisting of 10.000 memes for training and 1.000 memes
sented in [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] is composed of 800 memes from the most for testing. The MAMI test set will later be referred to as
popular social media platforms. The dataset has been raw.
labelled through a crowdsourcing platform, involving 60
subjects, in order to collect three evaluations for each 3.1. Candidate Bias Elements Estimation
instance. Each instance, labelled according to misogyny,
aggressiveness and irony, has been labelled by three an- As highlighted in the literature, classification models may
notators from the crowd and three expert labellers. A be afected by bias: the presence of specific elements can
more recent benchmark has been collected for MAMI lead the model to an erroneous behaviour by predicting
shared task at SemEval 2022 [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. The dataset, composed a specific label due to the presence of such elements.
of 10.000 memes for training and 1.000 memes for testing, This distortion in the investigated data-derived models
allowed to approach: (i) the identification of misogynistic can be in fact caused by an imbalance distribution, in
memes, and (ii) the recognition of the type of misogyny relation to the prediction label, of specific terms or visual
among potential overlapping categories. For the MAMI elements strongly associated with a given class label.
challenge, most of the participants [
          <xref ref-type="bibr" rid="ref13">13, 14, 15, 16</xref>
          ] ex- Those candidate biased elements can be distinguished
ploited pre-trained models and ensemble strategies. in candidate biased terms, which are related to the
        </p>
        <p>
          Regarding the potential bias that the models could in- superimposed text of a meme, and candidate biased
herit from the training dataset, most of the investigations tags, which are concerned with the objects that describe
focus only on a unimodal setting and more precisely on the scene of a meme. We exploit a novel estimation
the textual component [17, 18, 19]. In particular, special for identifying candidate biased elements [26] that
attention has been devoted to identity terms, i.e. those overcomes the limitations of the Polarized Weirdness
terms frequently associated with hateful expressions in Index (PWI) [27], which is unbounded and does not
the dataset referred to a specific target (e.g., woman, consider the context in which the elements appear, and
wife, girlfriend, etc...). It has been demonstrated that extended the estimation process to address more than
such identity terms lead the models to biased implicit one modality.
associations between such terms and a given class label,
ifnally originating unfair predictions. In order to coun- Given a multimodal dataset ,  is a visual or textual
teract the potential bias, several mitigation strategies element belonging to the set  that comprises all the
have been proposed in the literature. One of the most terms and tags of . A bias score () can be estimated
widely used strategies is data augmentation [
          <xref ref-type="bibr" rid="ref4">4, 20, 21</xref>
          ], for each element  according to the following formula:
which consists in adding data containing examples of
tnhoant-thoaxviec cthomemmoensttsdtihspartobprionrgtiobancaktethdoissteriibdeuntitoitny itnertmhes S(e) = |ℳ1 | ∑|ℳ=︁1|  (+ | ) −  (+ | { − }) (1)
dataset. Alternative solutions are focused on
mitigating directly the models by means of specific objective where ℳ is the set of memes containing , + represents
functions [22, 23] or optimization strategies [24, 25, 26]. the misogynous label and  denotes the set of terms
Although the above-mentioned strategies represent a fun- and tags in a given meme .  (+ | ) represents the
damental step towards bias mitigation, they are defined probability of a meme  of being associated with the
for unimodal settings. Bias estimation and mitigation for misogynous label, given the terms and tags  within the
multimodal perspective are still missing for misogynous meme itself, and, analogously,  (+ | { − }) denotes
meme identification. the probability of a meme  of being associated with
the misogynous label +, given the text (tags) present
3. Bias Estimation in the instance (meme), excluding the evaluated element
 except for the term (tag) in analysis. The proposed
In order to understand if a given misogyny identifica- bias score ranges into the interval [− 1; +1]. The higher
tion model is biased, three main steps are performed: (i) positive the score, the more likely the element would
Candidate Biased Elements Estimation, which allows us induce bias towards the positive class (misogynous). On
to identify specific textual or visual elements that could the other hand, the lower negative the score, the more
lead a model to unfair predictions, (ii) the creation of a likely the element would be associated with the negative
Synthetic Dataset with specific characteristics that allow class (not misogynous). Terms and tags with a score close
evaluating models behaviours in challenging examples, to zero, are considered neutral with respect to a given
label.
        </p>
      </sec>
      <sec id="sec-1-2">
        <title>We report in Tables 1 and 2 the set of biased terms</title>
        <p>and biased tags identified on the MAMI training dataset.
As we can see, the set of candidate biased terms with
the highest score for the misogynous class is composed
of words that are typically associated with some
specific misogyny categories like dishwasher and chick for
stereotype and whore for objectification. The remaining
tokens are websites that have been used to collect only
misogynous memes. A few terms identified as convey
potential bias are related to the seed words used to
collect the dataset (e.g. whore), confirming the ability of
the proposed approach to capture the bias introduced in
the dataset-creation phase (Selection Bias). On the other
hand, the presence of other terms (e.g. chloroform)
highlights the ability of the proposed approach to generalize
with respect to the dataset creation process and include
elements that may induce bias due to their unintended
unbalanced distribution. Concerning the set of terms
with the highest negative bias score for the not
misogynous class, it is composed of words that are very general
and commonly used in a variety of popular memes. An
analogous consideration can be drawn for the candidate
biased tags.</p>
        <sec id="sec-1-2-1">
          <title>Term</title>
        </sec>
        <sec id="sec-1-2-2">
          <title>Candidate Biased Terms</title>
        </sec>
        <sec id="sec-1-2-3">
          <title>Misogynous Not Misogynous</title>
          <p>Score Term Score
demotivational
dishwasher
promotion
whore
chick
motivate
chloroform
blond
diy
belong
0.39
0.38
0.35
0.35
0.34
0.33
0.30
0.30
0.30
0.28
towards the not misogynous class. Given a specific
element + ∈ + and + ∈ +, we collected misogynous
and not misogynous memes according to the following
criteria:
• a not misogynous meme is part of the synthetic
dataset if it contains + (or +) and it does not
contain any biased candidate terms (or tags) with
a negative score. This is to evaluate the impact
of + (or +) in introducing a bias towards the
misogynous class in not misogynous memes;
• a misogynous meme is part of the synthetic
dataset if it contains + (or +) and it does not
contain any other element in + (or +). This
is to verify if the model, given the presence of +
(or +), is able to perform well on misogynous
memes.</p>
        </sec>
      </sec>
      <sec id="sec-1-3">
        <title>An analogous procedure has been adopted to create</title>
        <p>misogynous and not misogynous memes according to
the candidate biased terms and tags with a negative score.</p>
        <p>The synthetic test set will later be recalled as synt.
3.3. Multimodal Bias Estimation (MBE)
3.2. Synthetic Dataset In order to measure if a given model is afected by bias we
introduce the Multimodal Bias Estimation (MBE)
metIn order to measure the bias of the models when making ric, which combines the area under the curve ( )
predictions, a synthetic dataset has been created with spe- estimated on a test set belonging to the original MAMI
cific characteristics that can efectively help to highlight test set and the area under curve estimated on the test
the bias of the models given the presence of the candidate set belonging to the synthetic dataset ( ):
biased elements.</p>
        <p>In particular, let + and + be respectively the set of
sacllotrhee,wbihaiscehdqcuaanldifieisdaetlee mteernmtss tahnadt taargesewxpitehctaedpotosiitniv-e   = 12   + 21   (2)
troduce the bias towards the misogynous class. Also, where   is computed as reported in Equation 3.
let − and − be respectively the set of all the biased ℳ represents the subgroup of memes identified by the
candidate terms and tags with a negative score, which presence of a biased term ,  is the subset of selected
qualifies elements that are expected to introduce the bias</p>
        <p>∑︀  Subgroup(ℳ) + ∑︀    (ℳ) + ∑︀   (ℳ)
1 ∈ ∈ ∈
2</p>
        <p>∑︀  Subgroup(ℳ) + ∑︀    (ℳ) + ∑︀   (ℳ)
+ 1 ∈ ∈ ∈
2
(3)
| |
||
man
0.87
man
0.87
biased terms. ℳ denotes the subgroup of memes iden- unimodal representation of the memes. In particular, the
tified by the presence of a biased tag  and  denotes the following modalities have been considered as (separate)
subset of selected biased tags. input space:</p>
        <p>is a three per-element AUC-based measure,
which considers both the biased terms and the biased
tags, composed of the following estimations:
desk
0.8
desk
0.8
chair
0.43
chair
0.43
car</p>
      </sec>
      <sec id="sec-1-4">
        <title>The MBE metric, which ranges into the interval [0, 1], esti</title>
        <p>mates the ability of the models on performing a good
prediction on the raw test data and simultaneously achieving
a significant performance on memes that, by
construction, can lead to a biased prediction.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>4. Debiasing Strategy</title>
      <sec id="sec-2-1">
        <title>Several baseline models have been initially considered for</title>
        <p>distinguishing between misogynous and not misogynous
memes. We trained SVM, KNN, Naive Bayes, Decision
Tree, and Multi-layer Perception independently on each
• textual component, that is the transcription of
the text contained within the meme (obtained
with OCR) embedded through the Universal
Sentence Encoder (USE) [28].
• visual component, expressed by the objects
identified within the meme ( object tags) by the
Scene Graph Generation method [29] and
represented through a n-dimensional vector that
denotes if a given meme contains one or more
predefined objects with the corresponding
probabilities.</p>
      </sec>
      <sec id="sec-2-2">
        <title>The classifiers have been combined, accordingly to</title>
        <p>each modality (e.g. visual or textual), through a Bayesian
Model Averaging (BMA) [30] ensemble paradigm. BMA
has been employed also for creating a multimodal
ensemble that considers all the predictions provided by the
above-mentioned models trained on each representation
independently.
4.1. Mitigation Strategy</p>
      </sec>
      <sec id="sec-2-3">
        <title>Bias mitigation is adopted in both unimodal and</title>
        <p>multi-modal contexts. In an unimodal setting, only
the considered modality is mitigated. In a multi-modal
scenario, all the models based on visual and textual
components that compose the ensemble are mitigated.</p>
        <p>In order to debias the model at training time (and
inference time), a Masking Mitigation is proposed. In
particular, for what concerns the textual component,
each biased term is masked according to the class label
that they afect more (see Table 1). Any given biased
term, estimated using to the strategy presented in
section 3, is masked in the training dataset according
to the class towards they induce bias. In particular,
if a candidate biased term induces a bias towards the
misogynous label, then it is replaced with a positive
mask [POS-MASK] in misogynous memes. On the
contrary, if a candidate biased term induces a bias
towards the not misogynous label, then it is replaced
with a negative mask [NEG-MASK] in not misogynous
memes. An example is reported in the following.</p>
        <p>Original Text:
dishwasher so you...</p>
        <p>When you can’t aford a new
Masked Text: When you can’t aford a new [POS-MASK]
so you...</p>
        <p>Textual Component Only
   
0.7202 0.7801
0.7173 0.7041
0.7010 0.7687
0.6301 0.7475
0.7257 0.7521
0.7326 0.7841
0.6775 0.6811
0.7325 0.8052
component only: (1) training on the textual component
Regarding the visual component, when a candi- only lead all the models to obtain good results on both
date biased tag is present, the probability value of  and  test sets, (2) BMA is able to achieve
rethat tag is set equal to 0 and a new feature indicating markable results compared with the baselines, (3) the
the presence of the masking is added to the original proposed Masking Mitigation strategy (BMA-MM)
sign-dimensional vector. A toy example is reported in nificantly outperforms all the baseline models and the
Figure 1. original BMA, but also the REPAIR strategy. BMA-MM
is able to maintain good recognition performance on the
5. Experimental Results  test set, still improving significantly the
generalization capabilities on the controversial memes available in
We report in this section the results of the proposed miti- the  test set.
gation strategy, comparing the performance with several
approaches. In particular, we report  ,   Visual Component Only
and   related to each model enclosed in the en- Model      
semble, i.e., Support Vector Machines (SVM), K-Nearest SKVNMN 00..66860283 00..55994128 00..66326833
Neighbour (KNN), Naive Bayes (NB), Decision Tree (DT), NB 0.6635 0.5773 0.6204
and Multi-layer Perception (MLP) together with their DT 0.6499 0.5888 0.6194
Bayesian Model Averaging (BMA). We also show the per- MLP 0.6912 0.6047 0.6480
formance of the proposed Masking Mitigation on BMA BMA 0.6870 0.5990 0.6430
(BMA-MM). Finally, we report a baseline debiasing tech- REPAIR 0.6651 0.5922 0.6286
nique available in the state of the art. In particular, we BMA-MM 0.6655 0.6416 0.6535*
used REPAIR [31] as a benchmark mitigation model. It Table 4
computes a weight  for each sample based on its pro- Model performance using the visual component only. Bold
portional loss contribution with respect to a reference denotes the best MBE, while (*) reflects that the mitigated
model and resamples the original training dataset accord- model outperforms the best non-mitigated approach (BMA)
ing to several strategies. In particular, given a weight  and the improvement is statistically significant.
for each meme , it keeps  = 50% examples with the
largest weight  from each class. For what concerns Table 4, where the models have</p>
        <p>We show in Tables 3-5, the comparison between all been trained using the visual component only, the
conthe considered models, distinguished according to the siderations are a bit diferent. As demonstrated in other
modalities used to perform the training and the corre- state-of-the-art studies [26], the visual component is less
sponding mitigation phase. A T-test has been performed impactful on the recognition capabilities than the textual
to compute the statistical equality with a pairwise analy- one. We hypothesize that the reduced contribution of the
sis between the best-performing approach (BMA) against pictorial component is mainly due to conceptualization
the compared mitigation strategies, i.e. BMA-MM and issues to relate a given object to a an abstract concept
REPAIR. (e.g. dishwasher). However, also in this case, BMA is able</p>
        <p>A few considerations can be derived from Table 3, to achieve better results than the baselines and BMA-MM
where the models have been trained using the textual is still able to significantly outperform the original BMA
and REPAIR.</p>
      </sec>
      <sec id="sec-2-4">
        <title>Regarding the performance of the multimodal settings</title>
        <p>reported in Table 5, we can assert that not only the
proposed mitigation strategy significantly outperforms all
the other configurations presented above, but it is also
able to achieve a very promising compromise between
 and  samples that facilitate the adoption of the
BMA-MM in a real setting.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>6. Conclusions</title>
      <p>This paper addressed the problem of mitigating
misogynous meme detection. In particular, a candidate biased
element estimation and a corresponding mitigation
strategy is proposed to perform fair prediction in a real
setting. The proposed approach, validated on a benchmark
dataset, achieved remarkable results both in terms of
prediction and generalization capabilities, reducing the bias
in a significant way.</p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgments</title>
      <sec id="sec-4-1">
        <title>The work of Elisabetta Fersini has been partially funded</title>
        <p>by the European Union – NextGenerationEU under the
National Research Centre For HPC, Big Data and
Quantum Computing - Spoke 9 - Digital Society and Smart
Cities (PNRR-MUR), and by MUR under the grant
“Dipartimenti di Eccellenza 2023-2027" of the Department
of Informatics, Systems and Communication of the
University of Milano-Bicocca, Italy.
DD-TIG at semeval-2022 task 5: Investigating the of the 2022 Conference of the North American
relationships between multimodal and unimodal Chapter of the Association for Computational
Lininformation in misogynous memes detection and guistics: Human Language Technologies,
Associaclassification, in: The 16th International Workshop tion for Computational Linguistics, Seattle, United
on Semantic Evaluation, 2022. States, 2022, pp. 811–826. URL: https://aclanthology.
[14] L. Chen, H. W. Chou, RIT boston at semeval-2022 org/2022.naacl-main.59. doi:10.18653/v1/2022.
task 5: Multimedia misogyny detection by using naacl-main.59.
coherent visual and language features from CLIP [24] V. Perrone, M. Donini, M. B. Zafar, R. Schmucker,
model and data-centric AI principle, in: The 16th K. Kenthapadi, C. Archambeau, Fair bayesian
optiInternational Workshop on Semantic Evaluation, mization, in: Proceedings of the 2021 AAAI/ACM
2022. Conference on AI, Ethics, and Society, 2021, pp.
[15] S. Hakimov, G. S. Cheema, R. Ewerth, TIB-VA at 854–863.</p>
        <p>semeval-2022 task 5: A multimodal architecture [25] S. Sikdar, F. Lemmerich, M. Strohmaier, Getfair:
for the detection and classification of misogynous Generalized fairness tuning of classification models,
memes, in: The 16th International Workshop on in: Proceedings of the 2022 ACM Conference on
Semantic Evaluation, 2022. Fairness, Accountability, and Transparency, 2022,
[16] J. M. ZHI, Z. Mengyuan, M. Yuan, D. Hu, X. Du, pp. 289–299.</p>
        <p>L. Jiang, Y. Mo, X. Shi, PAIC at semeval-2022 task [26] G. Rizzi, F. Gasparini, A. Saibene, P. Rosso, E. Fersini,
5: Multi-modal misogynous detection in MEMES Recognizing misogynous memes: Biased models
with multi-task learning and multi-model fusion, and tricky archetypes, Information Processing &amp;
in: The 16th International Workshop on Semantic Management 60 (2023) 103474.</p>
        <p>Evaluation, 2022. [27] F. Poletto, V. Basile, M. Sanguinetti, C. Bosco,
[17] D. Nozza, C. Volpetti, E. Fersini, Unintended bias V. Patti, Resources and benchmark corpora for hate
in misogyny detection, in: IEEE/WIC/ACM Inter- speech detection: a systematic review, Language
national Conference on Web Intelligence, WI ’19, Resources and Evaluation 55 (2021) 477–523.
Association for Computing Machinery, New York, [28] D. Cer, Y. Yang, S.-y. Kong, N. Hua, N. Limtiaco,
NY, USA, 2019, p. 149–155. URL: https://doi.org/ R. St. John, N. Constant, M. Guajardo-Cespedes,
10.1145/3350546.3352512. doi:10.1145/3350546. S. Yuan, C. Tar, B. Strope, R. Kurzweil,
Univer3352512. sal Sentence Encoder for English, in: Empirical
[18] N. Zueva, M. Kabirova, P. Kalaidin, Reducing unin- Methods in Natural Language Processing (EMNLP):
tended identity bias in russian hate speech detec- System Demonstrations, 2018, pp. 169–174.
tion, in: Proceedings of the Fourth Workshop on [29] X. Han, J. Yang, H. Hu, L. Zhang, J. Gao, P. Zhang,
Online Abuse and Harms, 2020, pp. 65–69. Image scene graph generation (sgg) benchmark,
[19] F. R. Nascimento, G. D. Cavalcanti, M. Da Costa- 2021. arXiv:2107.12604.</p>
        <p>Abreu, Unintended bias evaluation: An analysis of [30] E. Fersini, E. Messina, F. A. Pozzi, Sentiment
analyhate speech detection and gender bias mitigation sis: Bayesian Ensemble Learning, Decision Support
on social media using ensemble learning, Expert Systems 68 (2014) 26–38.</p>
        <p>Systems with Applications 201 (2022) 117032. [31] Y. Li, N. Vasconcelos, Repair: Removing
representa[20] R. Zmigrod, S. J. Mielke, H. Wallach, R. Cotterell, tion bias by dataset resampling, in: Proceedings of
Counterfactual data augmentation for mitigating the IEEE/CVF conference on computer vision and
gender stereotypes in languages with rich morphol- pattern recognition, 2019, pp. 9572–9581.
ogy, in: Proceedings of the 57th Annual Meeting
of the Association for Computational Linguistics,
2019, pp. 1651–1661.
[21] K. Guo, R. Ma, S. Luo, Y. Wang, Coco at
semeval2023 task 10: Explainable detection of online sexism,
in: Proceedings of the The 17th International
Workshop on Semantic Evaluation (SemEval-2023), 2023,
pp. 469–476.
[22] M. Xia, A. Field, Y. Tsvetkov, Demoting racial bias in
hate speech detection, in: Proceedings of the Eighth
International Workshop on Natural Language
Processing for Social Media, 2020, pp. 7–14.
[23] R. Sridhar, D. Yang, Explaining toxic text via
knowledge enhanced text generation, in: Proceedings</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Anzovino</surname>
          </string-name>
          , E. Fersini,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rosso</surname>
          </string-name>
          ,
          <article-title>Automatic identiifcation and classification of misogynistic language on twitter</article-title>
          ,
          <source>in: International Conference on Applications of Natural Language to Information Systems</source>
          , Springer,
          <year>2018</year>
          , pp.
          <fpage>57</fpage>
          -
          <lpage>64</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Bashar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Nayak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Suzor</surname>
          </string-name>
          ,
          <article-title>Regularising lstm classifier by transfer learning for detecting misogynistic tweets with small training set</article-title>
          ,
          <source>Knowledge and Information Systems</source>
          <volume>62</volume>
          (
          <year>2020</year>
          )
          <fpage>4029</fpage>
          -
          <lpage>4054</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>H. T.</given-names>
            <surname>Ta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. B. S.</given-names>
            <surname>Rahman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Najjar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gelbukh</surname>
          </string-name>
          ,
          <article-title>Transfer learning from multilingual deberta for sexism identification</article-title>
          ,
          <source>in: CEUR Workshop Proceedings</source>
          , volume
          <volume>3202</volume>
          ,
          <string-name>
            <surname>CEUR-WS</surname>
          </string-name>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>R.</given-names>
            <surname>Calderón-Suarez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. M.</given-names>
            <surname>Ortega-Mendoza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Montes-Y-Gómez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Toxqui-Quitl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Márquez-Vera</surname>
          </string-name>
          ,
          <article-title>Enhancing the detection of misogynistic content in social media by transferring knowledge from song phrases</article-title>
          ,
          <source>IEEE Access 11</source>
          (
          <year>2023</year>
          )
          <fpage>13179</fpage>
          -
          <lpage>13190</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>E.</given-names>
            <surname>Fersini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Gasparini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Corchs</surname>
          </string-name>
          ,
          <article-title>Detecting sexist MEME on the Web: A study on textual and visual cues</article-title>
          ,
          <source>in: 8th International Conference on Afective Computing and Intelligent Interaction Workshops and Demos (ACIIW)</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>226</fpage>
          -
          <lpage>231</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>E.</given-names>
            <surname>Fersini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Rizzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Saibene</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Gasparini</surname>
          </string-name>
          ,
          <article-title>Misogynous meme recognition: A preliminary study</article-title>
          ,
          <source>in: International Conference of the Italian Association for Artificial Intelligence</source>
          , Springer,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Haridasan</surname>
          </string-name>
          , R. Mooney, “
          <article-title>female astronaut: Because sandwiches won't make themselves up there”: Towards multimodal misogyny detection in memes</article-title>
          ,
          <source>in: The 7th Workshop on Online Abuse and Harms (WOAH)</source>
          ,
          <year>2023</year>
          , pp.
          <fpage>150</fpage>
          -
          <lpage>159</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>R.</given-names>
            <surname>Song</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Giunchiglia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Shi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <article-title>Measuring and mitigating language model biases in abusive language detection</article-title>
          ,
          <source>Information Processing &amp; Management</source>
          <volume>60</volume>
          (
          <year>2023</year>
          )
          <fpage>103277</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>T.</given-names>
            <surname>Shen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Bouadjenek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Mai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sanner</surname>
          </string-name>
          ,
          <article-title>Towards understanding and mitigating unintended biases in language model-driven conversational recommendation</article-title>
          ,
          <source>Information Processing &amp; Management</source>
          <volume>60</volume>
          (
          <year>2023</year>
          )
          <fpage>103139</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>B. O.</given-names>
            <surname>Sabat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. C.</given-names>
            <surname>Ferrer</surname>
          </string-name>
          ,
          <string-name>
            <surname>X. G. i Nieto</surname>
          </string-name>
          ,
          <article-title>Hate speech in pixels: Detection of ofensive memes towards automatic moderation</article-title>
          ,
          <year>2019</year>
          . arXiv:
          <year>1910</year>
          .02334.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>F.</given-names>
            <surname>Gasparini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Rizzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Saibene</surname>
          </string-name>
          , E. Fersini,
          <article-title>Benchmark dataset of memes with text transcriptions for automatic detection of multi-modal misogynistic content</article-title>
          ,
          <source>Data in brief 44</source>
          (
          <year>2022</year>
          )
          <fpage>108526</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>E.</given-names>
            <surname>Fersini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Gasparini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Rizzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Saibene</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Chulvi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rosso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lees</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. Sorensen,</surname>
          </string-name>
          <article-title>SemEval2022 task 5: Multimedia automatic misogyny identification</article-title>
          ,
          <source>in: Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)</source>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          , Seattle, United States,
          <year>2022</year>
          , pp.
          <fpage>533</fpage>
          -
          <lpage>549</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Dong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ding</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Liu</surname>
          </string-name>
          , K. Zhang,
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>