<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>DSVS at MiSonGyny 2025: Multiple Instance Learning for Misogyny Speech Detection in Song Lyrics</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sergio Damián-Sandoval</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>David Vázquez-Santana</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Centro de Investigación en Computación (CIC), Instituto Politécnico Nacional (IPN)</institution>
          ,
          <country country="MX">México</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <abstract>
        <p>Detecting misogynistic content in song lyrics is a challenging task due to the informal, metaphorical, and often ambiguous nature of language used in music. In this work, we explore the use of Multiple Instance Learning (MIL) for the automatic classification of Spanish song lyrics as either misogynistic or non-misogynistic. Each song is represented as a bag of sentences to mitigate the limitations of sequence length in transformer-based models and to enable the model to focus on the most informative parts of the lyrics. We employ an attention-based pooling mechanism that learns to weight sentences according to their relevance to the task. Additionally, we address class imbalance by applying class weighting for binary classification and focal loss for the multiclass setting. We experiment with three diferent dataset versions, varying the granularity and overlap of instances within each bag. Our results show that the MIL framework, combined with a transformer model pre-trained on social media data, achieves robust performance on both binary and multi-class misogyny detection tasks, demonstrating its efectiveness in handling weak supervision scenarios and noisy textual data.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Multiple Instance Learning</kwd>
        <kwd>Misogyny Detection</kwd>
        <kwd>Attention Pooling</kwd>
        <kwd>Spanish Lyrics</kwd>
        <kwd>Transformer Models</kwd>
        <kwd>Focal Loss</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Misogyny is a term used to describe a group of beliefs, actions, or expressions that show disrespect,
animosity, or prejudice against women [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. It can manifest explicitly, such as through verbal or physical
abuse, or covertly, such as through stereotypes or other content that perpetuates women’s objectification
or subordination. This is a systemic problem with roots in several institutional and cultural frameworks
that continue to have some influence on women’s lives today.
      </p>
      <p>
        Misogyny has a prominent and alarming social impact. Women, who are the most impacted group,
may experience low self-esteem and the normalization of violence and discrimination as a result of
being exposed to misogynistic attitudes and discourse through interpersonal relationships or cultural
products like fashion, music, or movies [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Women have historically faced many obstacles to their
professional and personal growth [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ], including disproportionately bearing the burden of unpaid
and frequently unappreciated domestic work, limited access to leadership roles, and exclusion from
educational institutions. Their mental and physical health have surely been directly impacted by these
disparities, which still exist today in more subdued but no less detrimental forms, like the wage gap
and workplace harassment.
      </p>
      <p>
        Misogyny also has a negative impact on men and society as a whole by maintaining toxic forms of
masculinity, limiting male emotional expression, and reinforcing strict gender roles. Some sectors have
normalized the notion that men are innately oppressive [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], while others continue to hold the view that
feminism hates men [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. This polarization of views makes mutual understanding harder and creates
mistrust, which gets in the way of building healthy and positive relationships between men and women.
      </p>
      <p>This way, misogyny not only keeps gender inequality going, but also twists human relationships,
stopping us from building connections based on respect, fairness, and empathy.</p>
      <p>
        In this context, music holds a central role as one of the most widely used forms of cultural expression.
Song lyrics not only convey emotional states or personal narratives, but also serve as mirrors of
broader social conditions and, in many cases, actively contribute to shaping them. Various studies
have documented how certain music genres function as vehicles for expressing structural issues. For
example, corridos and narcocorridos have been extensively analyzed for their role in normalizing narco
culture [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ], reflecting environments marked by violence, organized crime, and the tensions of illicit
economic power. Rap and hip-hop, which emerged from marginalized urban contexts, have served as
powerful tools of protest against poverty, racism, police brutality, and drug addictions [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], although they
have also been criticized for reproducing misogynistic patterns [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. In the case of reggaetón—a genre
with global reach—numerous analyses [
        <xref ref-type="bibr" rid="ref11 ref12">11, 12</xref>
        ] have highlighted its frequent use of sexualized language,
stereotypical representations of women, and the glorification of male dominance. Even genres such as
rock or Latin American regional music are not exempt, often portraying female figures in subordinate
or idealized roles rooted in patriarchal perspectives.
      </p>
      <sec id="sec-1-1">
        <title>1.1. Motivation</title>
        <p>Analyzing song lyrics to detect misogynistic content is an important task for several reasons. Music
reaches a wide audience and can strongly influence how people think and feel. When listeners are
repeatedly exposed to sexist messages or lyrics that promote the idea of female inferiority, it can make
these ideas seem normal, reinforce harmful beliefs, and contribute to gender inequality. This is especially
worrying in the case of young people, whose ideas about gender roles and relationships are still being
shaped.</p>
        <p>
          In addition, previous studies have shown that sexist content in media, including music [
          <xref ref-type="bibr" rid="ref13 ref14">13, 14, 15</xref>
          ],
can increase tolerance for violence against women and reduce empathy for victims. For this reason,
identifying and helping to reduce these messages is an important social responsibility.
        </p>
        <p>The MiSonGyny shared task [16], organized as part of IberLEF 2025 [17], directly addresses this
challenge by promoting the development of automatic systems for detecting misogynistic speech in
song lyrics. This initiative provides a valuable framework for advancing research on harmful discourse
in culturally influential domains, such as music.</p>
      </sec>
      <sec id="sec-1-2">
        <title>1.2. Relevance and challenges of the tasks</title>
        <p>This study addresses the need for tools capable of automatically detecting and characterizing
misogynistic discourse in songs through the use of natural language processing (NLP) techniques. The
proposed tasks allow for evaluating the efectiveness of models not only in identifying the presence
of misogynistic content, but also in assessing their ability to distinguish between diferent types of
misogyny.</p>
        <p>These tasks present several significant challenges. One of the most prominent is the ambiguity
inherent in artistic language. Song lyrics often rely on figurative expressions such as metaphors,
symbolism, and poetic structures, which complicate the direct interpretation of the message. The
frequent use of words or phrases deeply rooted in specific cultural contexts further increases the
complexity of automatic detection and analysis.</p>
        <p>Another major challenge is the subjectivity involved in interpreting misogynistic content. The same
phrase may or may not be considered misogynistic by diferent groups, depending on cultural context
or the listener’s prior knowledge of the artist or song. This subjectivity poses a challenge both for the
annotations of the corpus and for the evaluation of detection models.</p>
        <p>Despite these dificulties, this work represents an initial step toward the development of automated
tools capable of accurately identifying and finely classifying various types of misogynistic discourse in
music, which could be highly valuable for monitoring new content. The source code for this work can
be found at https://github.com/sdamians/DSVS_MiSonGyny2025.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>Research on misogyny detection has gained momentum in recent years, particularly in the context of
social media platforms, where users often express gender-based hostility either explicitly or implicitly.
Many studies have focused on developing computational approaches to detect sexist and misogynistic
content on X, Reddit, or YouTube comments [18, 19, 20]. These approaches typically leverage supervised
learning models trained on annotated datasets, and often utilize linguistic features, syntactic patterns,
and more recently, contextual embeddings from pretrained language models such as BERT or RoBERTa
[21, 22].</p>
      <p>A notable portion of this research has centered on the automatic classification of hate speech and
ofensive language [ 23]. While misogyny is often a subset of these broader categories, dedicated
misogyny detection remains a relatively underexplored area, particularly when considering fine-grained
categorization of types of misogynistic discourse, such as sexualization, verbal aggression, or
dehumanization.</p>
      <p>
        In contrast to the significant attention given to misogyny in social media, relatively little research has
addressed the presence of gendered hate speech in musical lyrics. Some exceptions include qualitative
analyses in the humanities and cultural studies, which have examined the prevalence of sexist or violent
messages in genres such as rap and reggaetón [
        <xref ref-type="bibr" rid="ref10 ref11 ref12">10, 11, 12</xref>
        ]. These works provide critical insight into
how misogyny is perpetuated through artistic expression.
      </p>
      <p>More recently, there has been a growing interest in using NLP techniques to study song lyrics. Eforts
include sentiment analysis of lyrics across genres [24], as well as the use of machine learning for genre
classification [ 25]. However, the application of NLP for misogyny detection in music remains a nascent
ifeld. Existing datasets for misogyny detection rarely include song lyrics.</p>
      <p>To address the challenges of our corpus, mainly long lyrics where misogynistic content appears
in only a few lines, we explore the use of Multiple Instance Learning (MIL). This approach allows
the model to detect harmful expressions within a larger text, even when only song-level labels are
available. MIL has been successfully used in similar weakly supervised tasks like sentiment analysis in
long documents [26], making it suitable for identifying localized misogynistic content without needing
detailed annotations.</p>
      <p>Furthermore, due to the imbalanced and multi-class nature of the corpus in the second task, we use
Focal Loss [27] to improve learning. Originally developed for object detection, Focal Loss helps the
model focus on underrepresented classes and rare instances, such as the sparse presence of misogynistic
expressions in lyrics, by reducing the influence of well-classified examples.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Dataset and Preprocessing</title>
      <p>We used the datasets provided by the shared task composed of Spanish song lyrics (with some segments
occasionally written in English), curated for the task of misogyny detection. Due to the length of most
songs exceeding the input limits of standard Transformer-based models, each song was segmented into
smaller textual units and treated as a bag of sentences, aligning with the Multiple Instance Learning
(MIL) framework.</p>
      <p>For the first subtask, which involves binary classification (Misogynistic – M / Non-Misogynistic –
NM), the training dataset comprises a total of 2,104 songs, of which 1,462 are labeled as NM and 642
as M. For the second subtask, which addresses multi-class classification, the training set consists of
1,168 entries distributed across four categories: NR (Not Related) with 526 samples, S (Sexualization)
with 435, V (Violent) with 129, and H (Hate) with 78. The corresponding test sets contain 527 and 293
samples respectively. Due to the class distribution in both tasks, specific techniques were applied to
address the class imbalance.</p>
      <sec id="sec-3-1">
        <title>3.1. Preprocessing</title>
        <p>Several preprocessing steps were applied to ensure high-quality input data for MIL-based learning:
• Character encoding correction: Accent marks and special characters were misread due to
encoding inconsistencies; these were systematically corrected.
• Structural tag removal: Sentences containing metatextual cues used to structure lyrics, such as</p>
        <p>Coro, Verso, Letra de, and copyright notices (e.g., todos los derechos reservados) were eliminated.
• Normalization of contractions: A custom dictionary was built to standardize common informal
contractions in lyrics, such as pa’ → para or to’s → todos.
• Handling bilingual content: Some lyrics contain English sentences. These were retained and
normalized using the same contraction rules when applicable.
• Sentence capitalization: All sentences were capitalized to maintain consistency across instances.
• Removal of low-information tokens: Interjections and non-contextual elements such as Ey,</p>
        <p>Yeah, Ah, Oh, and jaja were removed, as they do not contribute meaningful content to the task.</p>
        <p>To explore diferent levels of granularity in the MIL setup, we created three dataset versions for each
subtask:
• v1: Each instance corresponds to a single sentence from the song.
• v2: Each instance is a fixed group of four consecutive sentences without overlap.
• v3: Each instance is a group of four sentences with overlap, meaning that consecutive instances
share some sentences.</p>
        <p>These versions allow us to investigate the impact of instance length and contextual overlap on the
performance of MIL-based misogyny detection.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Methodology</title>
      <p>In this work, we address misogyny detection in Spanish song lyrics using a Multiple Instance Learning
(MIL) framework. Each song is represented as a bag of textual instances (sentences or grouped sentences),
allowing the model to make predictions at the bag level while learning from instance-level evidence.</p>
      <sec id="sec-4-1">
        <title>4.1. Model Architecture</title>
        <p>We adopted a small Transformer-based model (SLM) as the sentence encoder, specifically the
pysentimiento/robertuito-base-cased pretrained model [28]. This language model is tailored
for informal Spanish text, trained on corpora from social media platforms such as X, making it
particularly well-suited to the slang, profanity, and informal expressions found in song lyrics. Due to the
model’s token limit of 128, we verified that none of the sentence instances exceeded 64 tokens, which
simplified the training process and avoided the need for aggressive truncation.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. MIL and Attention Pooling</title>
        <p>To aggregate information from the bag of instances, we implemented an attention-based pooling
mechanism [29]. The model learns attention weights over the instances within each bag, emphasizing
those that contribute more significantly to the presence of misogynistic content. These weights are
learned during training and allow the model to focus on contextually relevant sentences.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Loss Functions and Class Imbalance Handling</title>
        <p>Diferent strategies were used to address class imbalance in the two subtasks:
• For the binary classification task (Subtask 1), we computed class weights based on the class
distribution in the training set and incorporated them into the loss function to reduce bias toward
the majority class.
• For the multi-class classification task (Subtask 2), which presents a more severe class imbalance,
we employed Focal Loss to dynamically scale the loss, focusing learning on harder-to-classify
minority classes.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4. Dataset Variants and Instance Configuration</title>
        <p>We evaluated three versions of the dataset: Version 1 serves as the baseline; Version 2 applies Multiple
Instance Learning (MIL) with one bag of instances per song lyric; and Version 3 introduces overlapping
instances to create some data redundancy, aiming to preserve the sequential order of the song. Figure 1
and Figure 2 illustrate this configuration.</p>
      </sec>
      <sec id="sec-4-5">
        <title>4.5. Training and Evaluation</title>
        <p>We employed cross-validation to ensure robustness in model performance. Experiments were conducted
using 3-fold, 5-fold, and 10-fold cross-validation schemes. Given the presence of class imbalance in
both subtasks, we focused our final evaluation on the F1-score, which provides a balanced measure of
precision and recall. To better understand the variability and robustness of the models across diferent
folds, we report the following:
• F1-score lower bound: the minimum F1-score observed across the  folds, indicating the
worst-case performance.
• F1-score upper bound: the maximum F1-score observed, showing the best-case scenario.
• F1-score std dev: the standard deviation of the F1-scores, reflecting the model’s consistency
across the folds.</p>
        <p>• F1-score mean: the average F1-score across all folds, providing an overall estimate of model
performance.</p>
        <p>Model performance was compared across the three dataset versions (v1, v2, v3), for both subtasks.</p>
      </sec>
      <sec id="sec-4-6">
        <title>4.6. System Overview</title>
        <p>Figure 3 presents a comprehensive overview of the entire methodology, illustrating how the raw song
lyrics are first segmented and transformed into instances, how these instances are organized into bags,
and finally how the MIL model processes and aggregates these bags to perform the classification task.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Results</title>
      <p>This section presents the results obtained during the model validation phase and describes the final
submissions made to the test set. The performance of each dataset version (v1, v2, v3) was evaluated
using 5-fold cross-validation and several classification metrics.</p>
      <sec id="sec-5-1">
        <title>5.1. Cross-Validation Performance</title>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. Observations</title>
        <p>The results indicate that the use of overlapping windows in grouped sentences (v3) yielded the best
performance for Subtask 1, suggesting that overlapping context can enhance the model’s ability to
detect subtle misogynistic patterns distributed across song lyrics. In contrast, for Subtask 2, the model
performed best with version 2, which uses a single bag of non-overlapping instances per song, indicating
that overlapping introduced noise in certain classification tasks.</p>
        <p>The attention-based pooling mechanism enabled the model to focus on semantically rich instances,
which proved especially efective when contextual redundancy was beneficial, as in version v3. However,
these findings also suggest that overlapping context is not universally advantageous and its impact
may vary depending on the specific task.</p>
        <p>Moreover, the use of class weighting in Subtask 1 and Focal Loss in Subtask 2 helped mitigate class
imbalance, particularly in the multi-class setting where certain classes had significantly fewer training
examples.</p>
      </sec>
      <sec id="sec-5-3">
        <title>5.3. Test Set Submissions</title>
        <p>Three separate submissions were made to the test set, one for each dataset version. Table 3 and Table 4
summarize the results obtained.</p>
        <p>Version v3 achieved the highest F1-score in subtask 1, while version v2 achieved the highest in subtask
2, confirming the findings from the validation phase. These results demonstrate the efectiveness of
incorporating instance overlap and leveraging sentence-level attention in a Multiple Instance Learning
setup.</p>
        <p>Submit</p>
        <p>Recall F1-score
Submit</p>
      </sec>
      <sec id="sec-5-4">
        <title>5.4. Preliminary Error Analysis (optional)</title>
        <p>Although detailed error analysis will be presented in future work, initial observations show that:
• Songs mixing English and Spanish can confuse the model, especially when misogynistic
expressions appear in English.
• Informal contractions and slang may still pose challenges despite the use of a domain-specific
language model.
• Some false positives involve songs that use misogynistic language ironically or metaphorically,
or the intention is to generate general hate speech which the model fails to disambiguate.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusions</title>
      <p>In this work, we explored the use of Multiple Instance Learning (MIL) for the detection of misogynistic
content in song lyrics written in Spanish. Due to the length and informal nature of the texts, traditional
document-level classification approaches are not well-suited. Instead, we modeled each song as a bag of
sentences, allowing the model to focus on the most relevant parts of the text through an attention-based
pooling mechanism.</p>
      <p>Our experiments with three diferent dataset versions demonstrated that using grouped and
overlapping sentence blocks (v3) provided the best performance, suggesting that additional context and
redundancy are beneficial for detecting subtle or context-dependent expressions of misogyny.</p>
      <p>The incorporation of techniques to address class imbalance—such as class weighting and Focal
Loss—contributed to improving the learning process, particularly in the more challenging multi-class
setting. Furthermore, the use of a domain-specific transformer model pre-trained on social media texts
(robertuito-base-cased) allowed for better handling of informal language, slang, and contractions
frequently present in lyrics.</p>
      <p>Our findings show that MIL is a promising framework for tasks involving weak supervision and
noisy data, where the label of a full document does not necessarily apply uniformly to all its parts.</p>
      <sec id="sec-6-1">
        <title>Future Work</title>
        <p>Future directions for this work include:
• A deeper error analysis to better understand model behavior in edge cases, such as irony and
code-switching.
• Exploring hierarchical MIL models or hybrid architectures with token- and sentence-level
supervision.
• Expanding the dataset with more annotated samples to improve generalization, particularly for
the underrepresented classes in Subtask 2.
• Incorporating linguistic features or external knowledge bases to support better interpretability of
the model decisions.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>This work was done with partial support from the Mexican Government through Secretaría de Ciencia,
Humanidades, Tecnología e Innovación (SECIHTI) and Instituto Politécnico Nacional (IPN).</p>
    </sec>
    <sec id="sec-8">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, ChatGPT-4.0, ChatGPT-3.5 and Writefull for Overleaf were utilized
as assistants for the following tasks: Grammar and spelling check, Paraphrase and reword and Text
Translation (Spanish to English). After using these tools, the authors reviewed, verified and edited the
content as required. The authors take full responsibility for the content of this work.
[15] D. Yadav, D. Kalyani, Role of media in increasing violence against women and girls in the society,
International Journal of Scientific Research in Modern Science and Technology 3 (2024) 21–26.
doi:10.59828/ijsrmst.v3i2.183.
[16] T. Alcántara, M. Soto, C. Macias, O. Garcia-Vazquez, A. Espinosa-Juarez, H. Calvo, J. E.
ValdezRodríguez, E. Felipe-Riveron, Overview of MiSonGyny at IberLEF 2025: Misogyny Speech
Detection in Spanish Language Song Lyrics, Procesamiento del Lenguaje Natural 75 (2025).
[17] J. Á. González-Barba, L. Chiruzzo, S. M. Jiménez-Zafra, Overview of IberLEF 2025: Natural
Language Processing Challenges for Spanish and other Iberian Languages, in: Proceedings of the
Iberian Languages Evaluation Forum (IberLEF 2025), co-located with the 41st Conference of the
Spanish Society for Natural Language Processing (SEPLN 2025), CEUR-WS. org, 2025.
[18] Z. Talat, D. Hovy, Hateful symbols or hateful people? predictive features for hate speech detection
on twitter, in: Proceedings of the NAACL student research workshop, 2016, pp. 88–93.
[19] A. Founta, C. Djouvas, D. Chatzakou, I. Leontiadis, J. Blackburn, G. Stringhini, A. Vakali, M.
Sirivianos, N. Kourtellis, Large scale crowdsourcing and characterization of twitter abusive behavior,
in: Proceedings of the international AAAI conference on web and social media, volume 12, 2018.
[20] G. K. Shahi, A. K. Jaiswal, D. Nandini, L.-D. Ibáñez, T. Mandl, H. Liu, Report on the 1st workshop
on difusion of harmful content on online web (dhow) at websci 2024, in: Companion Publication
of the 16th ACM Web Science Conference, 2024, pp. 60–64.
[21] T. Caselli, V. Basile, J. Mitrović, M. Granitzer, Hatebert: Retraining bert for abusive language
detection in english, arXiv preprint arXiv:2010.12472 (2020).
[22] M. Zampieri, S. Malmasi, P. Nakov, S. Rosenthal, N. Farra, R. Kumar, Predicting the type and target
of ofensive posts in social media, arXiv preprint arXiv:1902.09666 (2019).
[23] M. Mozafari, R. Farahbakhsh, N. Crespi, Hate speech detection and racial bias mitigation in social
media based on bert model, PloS one 15 (2020) e0237861.
[24] D. Yang, W.-S. Lee, Music emotion identification from lyrics, in: 2009 11th IEEE International</p>
      <p>Symposium on Multimedia, IEEE, 2009, pp. 624–629.
[25] Q. H. Nguyen, T. T. Do, T. B. Chu, L. V. Trinh, D. H. Nguyen, C. V. Phan, T. A. Phan, D. V. Doan,
H. N. Pham, B. P. Nguyen, et al., Music genre classification using residual attention network, in:
2019 International conference on system science and engineering (ICSSE), IEEE, 2019, pp. 115–119.
[26] S. Angelidis, M. Lapata, Multiple instance learning networks for fine-grained sentiment analysis,</p>
      <p>Transactions of the Association for Computational Linguistics 6 (2018) 17–31.
[27] T.-Y. Lin, P. Goyal, R. Girshick, K. He, P. Dollár, Focal loss for dense object detection, in: Proceedings
of the IEEE international conference on computer vision, 2017, pp. 2980–2988.
[28] J. M. Pérez, D. A. Furman, L. Alonso Alemany, F. M. Luque, RoBERTuito: a pre-trained language
model for social media text in Spanish, in: Proceedings of the Thirteenth Language Resources and
Evaluation Conference, European Language Resources Association, Marseille, France, 2022, pp.
7235–7243. URL: https://aclanthology.org/2022.lrec-1.785.
[29] M. Ilse, J. Tomczak, M. Welling, Attention-based deep multiple instance learning, in: International
conference on machine learning, PMLR, 2018, pp. 2127–2136.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M. P.</given-names>
            <surname>Johnson</surname>
          </string-name>
          , K. J. Ferraro,
          <article-title>Research on domestic violence in the 1990s: Making distinctions</article-title>
          ,
          <source>Journal of Marriage and Family</source>
          <volume>62</volume>
          (
          <year>2000</year>
          )
          <fpage>948</fpage>
          -
          <lpage>963</lpage>
          . doi:https://doi.org/10.1111/j.1741-
          <fpage>3737</fpage>
          .
          <year>2000</year>
          .
          <volume>00948</volume>
          .x.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>P.</given-names>
            <surname>Homan</surname>
          </string-name>
          ,
          <article-title>Health consequences of structural sexism: Conceptual foundations, empirical evidence and priorities for future research</article-title>
          ,
          <source>Social Science &amp; Medicine</source>
          <volume>351</volume>
          (
          <year>2024</year>
          )
          <article-title>116379</article-title>
          . doi:https: //doi.org/10.1016/j.socscimed.
          <year>2023</year>
          .
          <volume>116379</volume>
          , gender, power, and
          <article-title>health: Modifiable factors and opportunities for intervention.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>K.</given-names>
            <surname>Miner-Rubino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. M.</given-names>
            <surname>Cortina</surname>
          </string-name>
          ,
          <article-title>Beyond targets: Consequences of vicarious exposure to misogyny at work</article-title>
          .,
          <source>Journal of Applied psychology</source>
          <volume>92</volume>
          (
          <year>2007</year>
          )
          <fpage>1254</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>E. M.</given-names>
            <surname>Archer</surname>
          </string-name>
          ,
          <article-title>Gender-based inequality in the modern american society</article-title>
          , Exploring Gender at Work: Multiple Perspectives (
          <year>2021</year>
          )
          <fpage>45</fpage>
          -
          <lpage>63</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>I.</given-names>
            <surname>Jablonka</surname>
          </string-name>
          ,
          <article-title>A history of masculinity: From patriarchy to gender justice</article-title>
          ,
          <string-name>
            <surname>Penguin</surname>
            <given-names>UK</given-names>
          </string-name>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Hopkins-Doyle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. L.</given-names>
            <surname>Petterson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Leach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Zibell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Chobthamkit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Binti Abdul Rahim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Blake</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bosco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Cherrie-Rees</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Beadle</surname>
          </string-name>
          , et al.,
          <article-title>The misandry myth: An inaccurate stereotype about feminists' attitudes toward men</article-title>
          ,
          <source>Psychology of women quarterly 48</source>
          (
          <year>2024</year>
          )
          <fpage>8</fpage>
          -
          <lpage>37</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Mulligan</surname>
          </string-name>
          ,
          <article-title>Corridos en La guerra contra el narco: Estéticas necropolíticas en México</article-title>
          ,
          <source>Ph.D. thesis</source>
          , University of Missouri-Columbia,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S.</given-names>
            <surname>Sorcia</surname>
          </string-name>
          <string-name>
            <surname>Reyes</surname>
          </string-name>
          ,
          <article-title>The music ecology of narcocorridos in mexico's everyday life (</article-title>
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>H.</given-names>
            <surname>Malcomson</surname>
          </string-name>
          ,
          <article-title>Making subjects grievable: narco rap, moral ambivalence and ethical sense making</article-title>
          ,
          <source>in: Ethnomusicology Forum</source>
          , volume
          <volume>30</volume>
          ,
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          &amp; Francis,
          <year>2021</year>
          , pp.
          <fpage>205</fpage>
          -
          <lpage>225</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>R.</given-names>
            <surname>Weitzer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. E.</given-names>
            <surname>Kubrin</surname>
          </string-name>
          ,
          <article-title>Misogyny in rap music: A content analysis of prevalence and meanings</article-title>
          ,
          <source>Men and masculinities 12</source>
          (
          <year>2009</year>
          )
          <fpage>3</fpage>
          -
          <lpage>29</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S. S.</given-names>
            <surname>Campo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Faure-Carvallo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. M. V.</given-names>
            <surname>Carrasco</surname>
          </string-name>
          , Reguetón y representaciones de la mujer:
          <article-title>un estudio en educación secundaria</article-title>
          ,
          <source>Revista Internacional de Educación Musical</source>
          <volume>10</volume>
          (
          <year>2022</year>
          )
          <fpage>25</fpage>
          -
          <lpage>32</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>E.-J.</given-names>
            <surname>Díez-Gutiérrez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Palomo-Cermeño</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Mallo-Rodríguez</surname>
          </string-name>
          ,
          <article-title>(in)equality and the influence of reggaeton music as a socialisation factor: A critical analysis</article-title>
          ,
          <source>Gender Studies</source>
          <volume>21</volume>
          (
          <year>2022</year>
          )
          <fpage>66</fpage>
          -
          <lpage>85</lpage>
          . URL: https://doi.org/10.2478/genst-2023-0005. doi:
          <volume>10</volume>
          .2478/genst-2023-0005.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>S.</given-names>
            <surname>Galdi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Guizzo</surname>
          </string-name>
          ,
          <article-title>Media-induced sexual harassment: The routes from sexually objectifying media to sexual harassment</article-title>
          ,
          <source>Sex Roles</source>
          <volume>84</volume>
          (
          <year>2021</year>
          )
          <fpage>645</fpage>
          -
          <lpage>669</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>J.</given-names>
            <surname>Burnay</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kepes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. J.</given-names>
            <surname>Bushman</surname>
          </string-name>
          ,
          <article-title>Efects of violent and nonviolent sexualized media on aggressionrelated thoughts, feelings, attitudes, and behaviors: A meta-analytic review</article-title>
          ,
          <source>Aggressive behavior 48</source>
          (
          <year>2022</year>
          )
          <fpage>111</fpage>
          -
          <lpage>136</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>