From Explanation to Detection: Multimodal Insights into Disagreement in Misogynous Memes

From Explanation to Detection: Multimodal Insights into Disagreement in Misogynous Memes GiuliaRizzi g.rizzi10@campus.unimib.it University of Milano-Bicocca

Milan Italy

Universitat Politècnica de València

Valencia Spain

PaoloRosso prosso@dsic.upv.es Universitat Politècnica de València

Valencia Spain

ElisabettaFersini elisabetta.fersini@unimib.it University of Milano-Bicocca

Milan Italy

Tenth Italian Conference on Computational Linguistics

Dec 04 -06 2024 Pisa Italy

From Explanation to Detection: Multimodal Insights into Disagreement in Misogynous Memes 1613-0073 C362E1D9E94083E2570A8A11F658C37C GROBID - A machine learning software for extracting information from scholarly documents Disagreement Perspectivism Multimodal Misogyny

Warning: This paper contains examples of language and images that may be offensive. This paper presents a probabilistic approach to identifying the disagreement-related elements in misogynistic memes by considering both modalities that compose a meme (i.e., visual and textual sources). Several methodologies to exploit such elements in the identification of disagreement among annotators have been investigated and evaluated on the Multimedia Automatic Misogyny Identification (MAMI) [1] dataset. The proposed unsupervised approach reaches comparable performances, and in some cases even better, with state-of-the-art approaches, but with a reduced number of parameters to be estimated. The source code of our approaches is publicly available † .

Introduction

Hate detection has been a serious concern in recent years, penetrating internet platforms and causing harm to individuals across various communities. Users found in the online environment new modes of representation to express various types of hatred, including the more deeply rooted ideologies and beliefs with historical origins, for example towards women [2]. Detecting abusive language has become an increasingly important task. The challenges introduced by the new modes of representation, which require a multimodal analysis, are further compounded when considering the subjectivity of the task. The subjectivity of the task derives from the fact that individuals' perception of what characterizes a message of hate varies widely. Such diversification is reflected in the labeling phase in the form of disagreement among annotators. Identifying elements within the sample that can lead to disagreement is of paramount importance for several reasons. For content that can lead to disagreement, specific annotation policies might be introduced, and the number of annotators might be enlarged to capture multiple perspectives [3,4,5]. In this work, we propose a methodology to identify the disagreement-related elements in multimodal samples by exploring both visual and textual elements in the

Related Works

Many natural language tasks, such as hate speech detection, humor detection, and sentiment analysis, involve subjectivity since they require an interpretation based on human judgment, cultural context, or personal opinion [6]. Such phenomenon is reflected in the dataset through multiple labels from different annotators or via the inclusion of a confidence level to ground truth labels. Labels derived from different interpretations are therefore able to capture multiple perspectives and understandings [6]. Information about annotators' disagreement has primarily been exploited as a means to improve data quality by excluding controversial instances [7,8]. Alternatively, aiming at improving model performances, different strategies have been developed to exploit disagreement information in the training phase. For instance, in [9], the authors assign weights to instances to prioritize the ones with higher confidence levels. Another commonly adopted strategy [6,10] aims at directly learning from disagreement without considering any aggregated label. While a considerable amount of research has been conducted to understand the reasons behind annotators' disagreement [11,12,8] and to leverage disagreement when training classification models [13,14,15,16,17,18,19], there has been comparatively little attention devoted to the explanation and a priori recognition of disagreement in hateful content. A taxonomy of possible reasons leading to annotators' dis-agreement has been proposed by [12]. Such taxonomy articulates four macro categories of reasons behind disagreement: sloppy annotations, ambiguity, missing information, and subjectivity. Moreover, the authors evaluate the impact on classification performance of the different types.

Only recently, works have focused on the task of explaining disagreement [20,21,22,23]. In [21], the authors propose exploratory text visualization techniques as a method for analyzing different perspectives from annotated data. In [22], the authors identify textual constituents that contribute to hateful message explanation by exploiting integrated gradients within a filtering strategy. A more recent approach [23] proposes a probabilistic semantic approach for the identification of disagreementrelated constituents (e.g. textual elements) in hateful content. Overall, the findings indicate that, while LLM can yield promising results, comparable outcomes can be attained with less complex strategies and fewer computational resources. While previous research has concentrated on the analysis of textual disagreement, this study represents, to the best of our knowledge, a first insight into the explanation of multimodal disagreement. In particular, we have revised and extended to the multimodal environment the methodology proposed in [23] in order to consider not only textual elements but also visual ones.

Proposed Approach

Identification of Disagreement-Related Elements

The first phase of the proposed approach aims to evaluate the relationship between elements (both visual and textual) that compose a meme and annotators' disagreement. Preliminary preprocessing operations have been performed before identifying disagreement-related elements. For what concerns the textual components, preprocessing operations have been performed (i.e., tokenization, lemmatization, lower casing and stop word removal) to identify a valid set of tokens 1 that might be related to disagreement. Considering the image component, the set of 14 human readable concepts (tags) identified by [24] to capture specific characteristics of misogynous content has been adopted. As proposed by the authors, tags were extracted via the Clarifai API [25].

The preprocessing steps allowed us to extract a list of visual and textual elements from each meme in the dataset.

In order to measure the relationship among each element in the memes and the disagreement among annotators, the approach proposed in [23] has been extended 1 To guarantee a more robust evaluation, tokens that appear less than 10 times in the dataset have been removed. to a multimodal scenario. In particular, [23] introduces a methodology to identify disagreement related constituents that, however, is limited to textual content. The approach includes a strategy to identify disagreementrelated textual constituents and an approach for generalization towards unseen textual constituents. Both methods have been extended to a multimodal scenario in order to identify disagreement related elements both in textual and visual sources that compose a meme.

Given an element 𝑒, a corresponding Element Disagreement Score ( EDS(e)) has been computed according to the following equation:

𝐸𝐷𝑆(𝑒) = 𝑃 (𝐴𝑔𝑟𝑒𝑒|𝑒) − 𝑃 (¬𝐴𝑔𝑟𝑒𝑒|𝑒)(1)

where 𝑃 (𝐴𝑔𝑟𝑒𝑒|𝑒) represents the conditional probability that there is agreement on a meme given that the meme contains the element 𝑒. Analogously, 𝑃 (¬𝐴𝑔𝑟𝑒𝑒|𝑒) denotes the conditional probability that there is no agreement on a meme given that, that meme, contains the element 𝑒. Given that EDS represents a difference between two complementary probabilities, it is bounded within the range of -1 to +1. A higher positive score indicates stronger agreement between annotators, whereas a lower negative score suggests disagreement.

The score can be estimated on the training data and exploited to identify additional disagreement-related elements on unseen memes.

Disagreement identification

Once the Element Disagreement Scores have been estimated for each visual and textual element in the training dataset, they can be exploited to qualify the level of disagreement on unseen samples. Analogously to what was carried out in [23], different aggregation strategies have been investigated, relying on the hypothesis that the identified elements can be exploited for identifying the disagreement thanks to their different distribution in samples with and without an agreement.

For each meme in the test set, the corresponding list of elements and the corresponding Elements Disagreement Score estimated on the training data have been extracted. In particular, for each meme, the textual and visual elements have been identified and paired with the corresponding score when available. The Multimodal Disagreement Score (MDS) has been estimated according to the following strategies: Sum, Mean, Median, and Minimum. A threshold 𝜏 has been estimated according to a grid-search approach for each strategy.

A qualitative evaluation, comprehensive of a comparison with the specific misogynistic terminology and an evaluation of the keyword included in the dataset creation phase, has been performed to assess the quality of the EDS, while both the F1-score for the two considered classes (agreement (+) and disagreement (-)) and a global F1-score have been computed to validate the MDS.

Generalization towards unseen elements

The score estimation is strongly based on what is observed in the training data, resulting in the lack of scores for any elements that do not appear in the training samples. This is particularly relevant for textual components rather than visual ones. In fact, while we can assume an open-word vocabulary (where a few terms on unseen data can not appear in the training set) for the textual source, we limited the visual tags to closed-word settings (only 14 tags can be considered both in training and unseen memes). Since we need to generalize only on unseen textual constituents, for each (unseen) textual element 𝑒 ˆ, an approximated EDS score has been computed as follows:

• Embeddings of the training lexicon: the contextualized embedding representation of each textual element 𝑒 has been obtained via mBert [26].

An average embedding vector representation x ⃗ 𝑒 is computed to jointly represent multiple embedding representations of 𝑒 derived by the different contexts where it occurs. In particular, given an element 𝑒 and 𝑁 sentences containing it, its vector representation x ⃗ 𝑒 is obtained by a simple aver-

age x ⃗ 𝑒 = 𝑁 ∑︀ 𝑖=1 v ⃗ 𝑖/𝑁

, where v ⃗ 𝑖 is the constituent contextualized embedding vector related to the 𝑖 𝑡ℎ occurrence of 𝑒 and obtained through mBert. • Embeddings of unseen term: given an unseen textual element 𝑒 ˆwithin a given sentence, its contextualized embedding representation has been computed via mBert [26].

• Most similar constituent: given an unseen textual element 𝑒 ˆwith the corresponding embedding v ⃗ 𝑒 ^and the average embedding of a training element 𝑒, the set 𝐷 of most similar constituents to 𝑒 ˆis determined according to:

𝐷 = ⋃︁ 𝑒 {𝑒|𝑐𝑜𝑠(x ⃗ 𝑒, v ⃗ 𝑒 ^) ≤ 𝜓}(2)

where

𝑐𝑜𝑠(x ⃗ 𝑒, v ⃗ 𝑒 ^)

is the cosine similarity between the average contextualized embedding representation of element 𝑒 and 𝑒 ˆ, and 𝜓 is a grid search estimated threshold.

• Unseen terms score: the EDS score for an unseen textual element 𝑒 ˆis computed as the weighted average of the most similar constituents 𝑒 of the training lexicon:

𝐸𝐷𝑆(𝑒 ˆ) = ∑︀ 𝑒∈𝐷 [𝑐𝑜𝑠(𝑒, 𝑒 ˆ) • 𝐸𝐷𝑆(𝑒)] ∑︀ 𝑒∈𝐷 𝑐𝑜𝑠(𝑒, 𝑒 ˆ)(3)

• Multimodal Disagreement Score with unseen constituents: All the above-proposed strategies for MDS estimation have been extended to also include elements that do not belong to the training lexicon and for which the EDS score has been estimated. In particular, given a multimodal sample 𝑠, the aggregation functions presented in Section 3.2 will in this case consider the 𝐸𝐷𝑆 values of both seen (by considering the 𝐸𝐷𝑆(𝑒)) and unseen (by considering the 𝐸𝐷𝑆(𝑒 ˆ)) elements. Such generalized aggregation functions will be later referred to through the prefix 𝐺−.

Results

The proposed approach has been evaluated on the Multimedia Automatic Misogyny Identification (MAMI) Dataset [1] consisting of 10.000 memes for training and 1.000 memes for testing 2 . The dataset comprises a range of memes that exemplify various forms of misogyny, including shaming, stereotyping, objectification, and violence. Each meme has been labeled by three crowdsourced annotators for misogynistic content 3 , with an estimated Fleiss-K [27] coefficient equal to 0.5767. In particular, the proposed approach has been adopted to estimate an Element Disagreement Score (EDS) for each element and, consequently, MDS for each meme in the dataset.

Table 1 reports the top-10 highest positive and highest negative disagreement scores derived for the textual component. We can notice how terms that are rarely linked with misogynistic messages (e.g., flu) and terms commonly used to address women in a harmful way (e.g., whale) also exploiting stereotypes (e.g. gamer and programmer), achieve a high positive score, indicating a strong relation with the agreement. Additionally, some personal names of famous people (i.e., Bernie and Miley) appear within the ranking. In particular, such names Terms with the highest positive and lowest negative scores might appear in memes as the target of a hateful message, referring to their personal life, physical appearance, or specific events that involved them. As a consequence, depending on the reasons that lead to such criticism (gender, physical appearance, and personal choices for Miley Cyrus vs. political stance and career, without the same gendered connotations, for Bernie Sanders) there might be disagreement about misogyny. Table 2 reports the top-5 highest positive and highest negative disagreement scores derived for the visual component. It is easy to notice how all the scores are positive and achieve small values, denoting a tendency of such tags to be weakly related to the agreement label. Figure 1 reports an example of a meme with disagreement along with the visual representation of the EDS of its textual and visual elements. Moreover, as highlighted with a grey bar, some of the reported scores have been estimated. Such scores correspond, in fact, to constituents that are not present in the training dataset and for which it was not possible to calculate the ESD score. The visual representation of the scores related to such elements corresponds to the score obtained through the estimation strategy. Overall, it is easy to notice the presence of elements strongly related to disagreement (i.e., sexual and market), highlighted in pink.

The concept of the "sexual marketplace" is often the

Table 2

Tags with the highest positive and lowest negative scores subject of debate, particularly in relation to its intersection with misogynistic ideologies [28,29]. Some supporters, often aligned with "manosphere" or "red pill" ideologies, argue that the sexual marketplace disproportionately empowers women, giving them more control over sexual selection and relationships, which can disadvantage men. On the other hand, critics assert that this perspective reduces human relationships to transactional exchanges and objectifies both genders, ultimately reinforcing misogynistic attitudes. This last viewpoint asserts that framing relationships in market terms devalues emotional connection and perpetuates harmful stereotypes about women's worth being tied solely to their sexual desirability. Achieved results suggest the ability of the approach to detect such variety in interpretations and reflect them within the EDS scores. Figure 2 reports two memes that share the same text and a different image. Despite such commonalities, the memes have been labeled differently: while the first meme has been labeled as misogynous by 2 annotators out of 3, the second one has been unanimously labeled as non-misogynous. Since such memes share a common textual representation, the derived textual elements and textual-EDS are also equal, resulting in an indistinguishable representation that is ineffective for disagreement identification. Moreover, although the memes differ in the visual content, resulting in different tags and, therefore, different textual-EDS, as previously mentioned, such a component alone is not sufficient for disagreement prediction. The findings demonstrate the necessity of joint considera- All the proposed aggregation strategies have been implemented, both considering the modalities individually and jointly. Table 3, and Table 4 summarise achieved results on disagreement identification considering only the score related to elements derived from the textual component (i.e., terms) and only the scores of elements derived from the visual component (i.e., tags) respectively. Table 5 instead summarises results achieved by the aggregation of the scores derived from all the elements (i.e., terms and tags). Results achieved on the textual component only highlight G-Mean as the most performing approach. Overall, the estimation strategy results in an improvement of performances up to 6%, confirming the ability of the proposed strategy to capture disagreement relationships for unseen terms. Furthermore, BERT [30] 4 has been reported as a state-of-the-art baseline for unimodal textual classification. Achieved results show how BERT performs better on the majority class, struggling in predicting the disagreement class. The proposed approach, instead leads to performance more balanced among the two classes.

Table 4 reports the performances of the different approaches for disagreement identification considering the visual component only. However, while the Sum approach (i.e., the most performing approach among the tagbased) demonstrates satisfactory performance in identifying positive instances (achieving an F1+ of 0.69), it exhibits considerable difficulty in accurately identifying negative instances.

Finally, Table 5 reports the performances of the different approaches for disagreement identification jointly considering both modalities. Furthermore, for a better comparison of the performance achieved by the proposed approach, a state-of-the-art baseline for multimodal classification has been implemented: CLIP [31] 5 . The inclusion of both modalities leads to a slight improvement in performances that, however, remain quite poor, highlighting the difficulty of the task. The inclusion of the unseen constituents estimation leads to an improvement of performance (except for the sum-based method) up to 8% for the mean-based approach. However, the best performances are achieved by the minimum and G-minimum approaches, for which the estimation methodology is not effective. Such behavior may be attributed to the imbalance in the dataset. The larger the number of samples with agreement, the greater the num-

Approach 𝜓 𝜏 F1+ F1- F1 Score Sum -3

Table 5

Comparison of the different approaches for disagreement detection considering both textual and visual components. The agreement label (+) indicates complete annotator agreement, regardless of the misogyny value, while the agreement label (-) denotes samples without complete agreement. Bold denotes the best approach in terms of F1-score, and underline represents the best approach according to the disagreement label. 𝜓 and 𝜏 represent the best hyperparameters estimated via a greed search approach, and 𝐸 is the set of elements.

ber of agreement-related terms that impact the estimation phase. Consequently, the estimation of scores for unseen elements is likely to be positive due to the aforementioned imbalance. Overall, the findings suggest that achieving a balanced performance remains challenging.

Conclusion and Future Works

This paper proposes a probabilistic approach to identify disagreement-related elements in multimodal content. The proposed approach allows for the identification of elements that could be used as a proxy to identify samples that might be perceived differently by the annotators, and therefore, that could lead to disagreement. Achieved results highlight the difficulty of the task, denoting the need for a more advanced approach. Future work will include different strategies for image analysis in order to provide a better description of the image itself in all the elements that compose it. Furthermore, a study of the compositionality might be carried out to better represent the relationship among such elements inside the meme. The sense of a meme is often derived from the meanings of its individual parts (i.e. the image and text) and the way they are combined. By analyzing how different elements interact and contribute to the overall message, it is possible to gain a deeper understanding of how the meaning is represented within the different modalities. This will help in identifying complex patterns and improve the accuracy of classification models.

Figure 1 :1Figure 1: Visual representation of disagreement scores distinguishing among textual and visual elements. Positive and negative scores are represented with green and pink respectively. The gray bar denotes elements for which the EDS has been estimated, while the white color represents elements with an EDS equal to zero.

Figure 2 :2Figure 2: Visual representation of disagreement scores distinguishing among textual and visual elements for two samples in the dataset. Positive and negative scores are represented with green and pink respectively. The white color represents elements with EDS equal to zero.

Table 11TermEDSTermEDSflu1.00market−0.64folk1.00fetish−0.60bug1.00nut−0.57Bernie1.00hotel−0.50whale1.00apologize −0.45feeling0.90Miley−0.45gamer0.87lonely−0.43rest0.87award−0.43programmer0.87coke−0.43san0.83blowjob−0.43

Table 33Comparison of the different approaches for disagreement detection considering the textual component only. The agreement label (+) indicates complete annotator agreement, regardless of the misogyny value, while the agreement label (-) denotes samples without complete agreement. Bold denotes the best approach in terms of F1-score, and underline represents the best approach according to the disagreement label..10.61 0.390.50Mean-0.20.78 0.200.49Median-0.20.07 0.790.43Minimum--0.10.29 0.750.52G-Sum0.83.10.65 0.370.51G-Mean0.80.20.73 0.340.53G-Median0.80.20.77 0.210.49G-Minimum0.8-0.10.75 0.300.52BERT [30]--0.80 0.000.40

𝜓 and 𝜏 represent the best hyperparameters estimated via a greed search approach. Although both a training and a test dataset are provided, only the training dataset is adopted, as the proposed work is focused on the analysis and prediction of disagreement and the test dataset is constructed to include only samples with complete agreement. The training dataset, instead, is characterized by 65% of data with complete agreement. Therefore, it has been divided in order to isolate the 90% for token estimation and the remaining 10% for the evaluation. Additionally, a boolean disagreement label has been derived to represent complete agreement among annotators. In particular, this last label is set to 1 if all the annotators have indicated the same label, to 0 otherwise. BERT has been implemented and finetuned using the hugging-face framework with default hyperparameters. We adopted "bert-basecased" available at https://huggingface.co/google-bert/bert-base-c ased. CLIP has been implemented and finetuned using the huggingface framework with default hyperparameters. In particular, we used the version available at https://huggingface.co/openai/clip-vit-l arge-patch14 to which we concatenated a linear layer for binary classification.

Acknowledgments

We acknowledge the support of the PNRR ICSC National Research Centre for High Performance Computing, Big Data and Quantum Computing (CN00000013), under the NRRP MUR program funded by the NextGenerationEU. The work of Paolo Rosso was in the framework of the FairTransNLP-Stereotypes research project (PID2021-124361OB-C31) funded by MCIN/AEI/10.13039/501100011033 and by ERDF, EU A way of making Europe.

SemEval-2022 task 5: Multimedia automatic misogyny identification EFersini FGasparini GRizzi ASaibene BChulvi PRosso ALees JSorensen Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), Association for Computational Linguistics the 16th International Workshop on Semantic Evaluation (SemEval-2022), Association for Computational Linguistics

Seattle, United States

2022 How do we study misogyny in the digital age? a systematic literature review using a computational linguistic approach LFontanella BChulvi EIgnazzi ASarra ATontodimamma Humanities and Social Sciences Communications 11 2024 Handling disagreement in hate speech modelling PKraljNovak TScantamburlo APelicon MCinelli IMozetič FZollo International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems Springer 2022 GRaSP: A multilayered annotation scheme for perspectives CVan Son TCaselli AFokkens IMaks RMorante LAroyo PVossen Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), European Language Resources Association (ELRA) NCalzolari KChoukri TDeclerck SGoggi MGrobelnik BMaegaard JMariani HMazo AMoreno JOdijk SPiperidis the Tenth International Conference on Language Resources and Evaluation (LREC'16), European Language Resources Association (ELRA)

Portorož, Slovenia

2016 Perspectivist approaches to natural language processing: a survey SFrenda GAbercrombie VBasile APedrani RPanizzon ATCignarella CMarco DBernardi Language Resources and Evaluation 2024 Learning from disagreement: A survey AUma TFornaciari DHovy SPaun BPlank MPoesio Journal of Artificial Intelligence Research 72 2021 From annotator agreement to noise models BBeigman Klebanov EBeigman Computational Linguistics 35 2009 The origin and value of disagreement among data labelers: A case study of individual differences in hate speech annotation YSang JStanton Information for a Better World: Shaping the Global Future: 17th International Conference, iConference 2022, Virtual Event Springer February 28-March 4, 2022. 2022 Proceedings, Part I A crowdsourced frame disambiguation corpus with ambiguity ADumitrache FMediagroep LAroyo CWelty Proceedings of NAACL-HLT NAACL-HLT 2019 Beyond black & white: Leveraging annotator disagreement via soft-label multi-task learning TFornaciari AUma SPaun BPlank DHovy MPoesio Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics 2021 Crowd worker strategies in relevance judgment tasks LHan EMaddalena AChecco CSarasua UGadiraju KRoitero GDemartini Proceedings of the 13th international conference on web search and data mining the 13th international conference on web search and data mining 2020 Why don't you do it right? analysing annotators' disagreement in subjective tasks MSandri ELeonardelli STonelli EJežek Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics the 17th Conference of the European Chapter of the Association for Computational Linguistics 2023 SShahriar TSolorio arXiv:2305.01050 Safewebuh at semeval-2023 task 11: Learning annotator disagreement in derogatory text: Comparison of direct training vs aggregation 2023 arXiv preprint eevvgg at SemEval-2023 task 11: Offensive language classification with rater-based information EGajewska 10.18653/v1/2023.semeval-1.24 Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), Association for Computational Linguistics AKOjha ASDoğruöz GDa San Martino HTayyar RMadabushi EKumar Sartori the 17th International Workshop on Semantic Evaluation (SemEval-2023), Association for Computational Linguistics

Toronto, Canada

2023 University at buffalo at semeval-2023 task 11: Masda-modelling annotator sensibilities through disaggregation MSullivan MYasin CLJacobs Proceedings of the 17th International Workshop on Semantic Evaluation the 17th International Workshop on Semantic Evaluation

SemEval-

2023. 2023 Ai-upv at exist 2023-sexism characterization using large language models under the learning with disagreements regime ADe Paula GRizzi EFersini DSpina CEUR WORKSHOP PRO-CEEDINGS CEUR-WS 2023 3497 When multiple perspectives and an optimization process lead to better performance, an automatic sexism identification on social media with pretrained transformers in a soft label context JErbani EEgyed-Zsigmond DNurbakova P.-EPortier Working Notes of CLEF 2023 MEVallecillo-Rodríguez FDel Arco LAUreña-López MTMartín-Valdivia AMontejo-Ráez Integrating annotator information in transformer finetuning for sexism detection 2023 Working Notes of CLEF Perspectives on hate: General vs. domain-specific models GRizzi MFontana EFersini Proceedings of the 3rd Workshop on Perspectivist Approaches to NLP (NLPerspectives)@ LREC-COLING 2024 the 3rd Workshop on Perspectivist Approaches to NLP (NLPerspectives)@ LREC-COLING 2024 2024 Change my mind: How syntax-based hate speech recognizer can uncover hidden motivations based on different viewpoints MMichele VBasile FMZanzotto 1st Workshop on Perspectivist Approaches to Disagreement in NLP, NLPerspectives 2022 as part of Language Resources and Evaluation Conference, LREC 2022 Workshop, European Language Resources Association (ELRA) 2022 Beyond explanation: A case for exploratory text visualizations of non-aggregated, annotated datasets LHavens BBach MTerras BAlex Proceedings of the 1st Workshop on Perspectivist Approaches to NLP @LREC2022, European Language Resources Association GAbercrombie VBasile STonelli VRieser AUma the 1st Workshop on Perspectivist Approaches to NLP @LREC2022, European Language Resources Association

Marseille, France

2022 Integrated gradients as proxy of disagreement in hateful content AAstorino GRizzi EFersini CEUR WORKSHOP PROCEEDINGS 2023 3596 Unraveling disagreement constituents in hateful speech GRizzi AAstorino PRosso EFersini European Conference on Information Retrieval Springer 2024 Recognizing misogynous memes: Biased models and tricky archetypes GRizzi FGasparini ASaibene PRosso EFersini Information Processing & Management 60 103474 2023 Clarifai Clarifai guide Bert: Pretraining of deep bidirectional transformers for language understanding JD M.-WCKenton LKToutanova Proceedings of NAACL-HLT NAACL-HLT 2019 Measuring nominal scale agreement among many raters JLFleiss Psychological bulletin 76 378 1971 DGing ANeary Gender, sexuality, and bullying special issue editorial 2019 Exploring misogyny through time: From historical origins to modern complexities EIgnazzi ASarra LFontanella Philosophies of Communication 2023 Bert: Pretraining of deep bidirectional transformers for language understanding JD M.-WCKenton LKToutanova Proceedings of NAACL-HLT NAACL-HLT 2019 Learning transferable visual models from natural language supervision ARadford JWKim CHallacy ARamesh GGoh SAgarwal GSastry AAskell PMishkin JClark International conference on machine learning

PMLR

2021