<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Forum for Information Retrieval Evaluation, December</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>SAViOR: Sentiment, Sarcasm, Abuse, and Vulgarity in Online Realities (Memes)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Krishna Tewari</string-name>
          <email>krishnatewari.rs.cse24@itbhu.ac.in</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Supriya Chanda</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Aditya Namdeo</string-name>
          <email>aditya.namdeo.met23@itbhu.ac.in</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sukomal Pal</string-name>
          <email>spal.cse@itbhu.ac.in</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Bennett University</institution>
          ,
          <addr-line>Greater Noida</addr-line>
          ,
          <country country="IN">INDIA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Indian Institute of Technology (BHU)</institution>
          ,
          <addr-line>Varanasi</addr-line>
          ,
          <country country="IN">INDIA</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Social Media</institution>
          ,
          <addr-line>Meme classification, Sentiment Analysis, Sarcasm Detection, Abuse Detection, XLM-RoBERTa</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <volume>1</volume>
      <fpage>7</fpage>
      <lpage>20</lpage>
      <abstract>
        <p>Meme-based communication on social media often encodes sentiment, sarcasm, vulgarity, and abuse in subtle and context-dependent ways, posing significant challenges for automated detection systems. These challenges are further amplified in low-resource Indian languages, where code-mixing, transliteration, and the scarcity of annotated data complicate modeling eforts. As part of the HASOC-meme shared task at FIRE 2025, we developed and evaluated two independent runs under the team name IReL. Run 1 employed XLM-RoBERTa ifne-tuned separately for each language (Bangla, Hindi, Gujarati, and Bodo), leveraging multilingual transformer embeddings to capture semantic nuances in noisy meme texts. Run 2 applied a zero-shot approach using ChatGPT specifically for Bodo, addressing the severe lack of annotated resources for this language. Both approaches processed text-based meme content exclusively and were evaluated on the oficial leaderboard using the macro-F1 metric. Our systems achieved macro-F1 scores of 0.5082 (Bangla), 0.4630 (Hindi), 0.5920 (Gujarati), and 0.5011 (Bodo), demonstrating the eficacy of multilingual transformers for high-resource languages and the viability of zero-shot large language models for severely underrepresented languages. These results highlight the strengths of transformer-based architectures in handling linguistic diversity while exposing limitations in handling subtle contextual cues in memes.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>The analysis of online memes presents a challenging problem in natural language processing, as these
short textual and visual artifacts often encode sentiment, sarcasm, vulgarity, and abuse in subtle,
context-dependent ways. Let  = {</p>
      <p>1,  2, … ,   } denote a corpus of memes, where each meme  
is represented by a textual component   written in one or more languages, potentially code-mixed
or transliterated. The task of automated classification is then to assign each meme   a set of labels
mitigate resource scarcity.</p>
      <p>CEUR
Workshop</p>
      <p>ISSN1613-0073</p>
      <p>Formally, given a meme   with text   , our model   predicts:
  ( ) =  ̂ = { ̂ ,  ̂ ,  ̂ ,  ̂ },



where  ̂ is the predicted label set and  denotes the model parameters.</p>
      <p>The remainder of this paper is structured as follows: Section 2 reviews prior research; Section 3
describes the HASOC 2025 dataset; Section 4 details our proposed methodology; Section 5 presents
experimental results; and Section 6 concludes with key findings and future directions.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        The analysis of online memes presents a multifaceted challenge in natural language processing (NLP),
especially within the context of low-resource Indian languages. Memes often encode sentiment, sarcasm,
vulgarity, and abuse in subtle, context-dependent ways. The short textual sequences, often code-mixed
and transliterated, require models that can capture cross-lingual semantic representations along with
nuanced contextual cues [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ].
      </p>
      <p>
        Early research in sentiment and ofensive language detection relied on traditional machine learning
approaches with handcrafted features, such as n-grams, sentiment lexicons, and syntactic patterns
[
        <xref ref-type="bibr" rid="ref5 ref6 ref7">5, 6, 7</xref>
        ]. While efective for longer, formal text, these approaches struggle with the brevity and noisiness
of memes. To address these limitations, deep learning models, particularly recurrent and convolutional
networks, were introduced [
        <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
        ]. However, these methods often fail to generalize across languages,
especially for low-resource languages with limited annotated data.
      </p>
      <p>
        Transformer-based architectures have become the dominant paradigm for multilingual NLP tasks.
Multilingual BERT (mBERT) [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and XLM-RoBERTa [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] have been widely adopted for cross-lingual
text classification, demonstrating the ability to transfer knowledge across languages without requiring
large parallel corpora. Specifically for Indian languages, the IndicNLP Suite [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] provides preprocessing
tools such as tokenizers and normalizers for code-mixed text, while MuRIL [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] ofers pre-trained
embeddings tuned for multiple Indian languages. Dowlagar and Mamidi [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] further explored the
combination of XLM-RoBERTa with convolutional neural networks for sentence-level classification in
technical domains, which is applicable to meme text analysis.
      </p>
      <p>
        Shared tasks have played a key role in benchmarking and advancing research on ofensive language
and hate speech detection. The HASOC (Hate Speech and Ofensive Content) shared tasks, initiated
in 2019, provided multilingual datasets for Hindi, Bengali, and other Indian languages [
        <xref ref-type="bibr" rid="ref13">13, 14, 15</xref>
        ].
The tasks focused on the detection of hate speech, ofensive content, and profanity in social media
text, highlighting challenges such as data imbalance, code-mixing, and transliteratioN. The
DravidianCodeMix shared task at FIRE 2021 further extended this line of research by providing datasets in Tamil,
Malayalam, and Kannada, emphasizing ofensive language detection in code-mixed social media content
[16].
      </p>
      <p>Understanding memes often requires multimodal analysis. The Hateful Memes Challenge by Facebook
AI [17] introduced a dataset combining textual and visual content, encouraging the development of
models capable of reasoning across modalities. Suryawanshi, Chakravarthi, and others [18] proposed a
taxonomy and dataset for detecting opinion manipulation in troll memes, reinforcing the importance of
contextual and multimodal cues in meme comprehension.</p>
      <p>Low-resource languages, such as Bodo, pose a significant challenge due to the scarcity of annotated
data. Zero-shot and few-shot learning approaches have emerged as viable solutions. Large language
models such as ChatGPT, pre-trained on extensive multilingual corpora, can be leveraged in
zeroshot settings to classify memes in low-resource languages [19]. This approach mitigates the need for
task-specific fine-tuning and enables practical solutions for severely underrepresented languages.</p>
      <p>Despite the progress of transformer-based methods, several challenges remain. Sarcasm, cultural
references, and subtle forms of abuse are often context-dependent and dificult for models to detect
accurately [20, 21]. Additionally, code-mixed sequences with transliteration and spelling variation
complicate tokenization and embedding representation. Research has shown that combining multilingual
embeddings with task-specific fine-tuning and cross-lingual transfer can improve performance [</p>
      <p>
        Benchmark-driven research underscores the importance of shared tasks in evaluating system
performance. HASOC datasets [23], for instance, have demonstrated that fine-tuned transformer models
significantly outperform traditional approaches on sentiment, sarcasm, and abusive language
classification [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Studies by Chanda et al. [24, 25, 26, 27] illustrate that language-specific fine-tuning,
combined with cross-lingual embeddings, is efective for medium and high-resource Indian languages,
while zero-shot inference remains the most practical solution for languages with extremely limited
annotated data. Recent research also explores hybrid approaches, integrating multimodal information
and linguistic heuristics to improve robustness [18, 17, 28, 29].
      </p>
      <p>In summary, prior work highlights several key themes: the necessity of multilingual and code-mixed
text processing, the dominance of transformer-based architectures, the critical role of benchmark-driven
evaluations such as HASOC and Dravidian-CodeMix, and the potential of zero-shot learning for
lowresource languages. Building upon these strands, our work implements two independent runs for the
HASOC-meme 2025 task: Run 1 fine-tunes XLM-RoBERTa separately for each language, while Run 2
applies zero-shot inference using ChatGPT for Bodo. This hybrid methodology leverages
languagespecific embeddings and zero-shot capabilities to address the linguistic diversity and annotation sparsity
inherent in social media memes.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Dataset</title>
      <p>The dataset provided in the HASOC-meme shared task at FIRE 2025 consisted of multimodal memes
(image + text) in four Indian languages: Hindi, Bengali, Gujarati, and Bodo. Each meme was annotated
for four text-based classification subtasks: Sentiment Detection, Sarcasm Detection, Vulgarity Detection,
and Abuse Detection. An additional task, Target Community Identification, was initially included
but later excluded from evaluation due to a high prevalence of null values and ambiguities in group
assignments. Consequently, our experiments focused exclusively on the first four classification tasks.
The detailed statistics of each dataset is presented in Table 1.</p>
      <p>The datasets exhibited significant class imbalance across all four subtasks, with minority classes
(e.g., Vulgar or Abusive memes) being severely underrepresented. Exploratory analysis revealed
several task-specific and target-group-specific trends. In the Gujarati dataset, abusive memes primarily
targeted genders and individuals, whereas most vulgar memes were associated with gender. Sarcasm
was frequent, but balanced sarcastic/non-sarcastic labels were observed mainly in memes targeting
individuals. In the Hindi dataset, individual-targeted memes were mostly non-abusive and non-vulgar
but often carried negative or neutral sentiment. Vulgar memes disproportionately targeted individuals
and gender categories, while abusive memes were directed at individuals, gender, and “others” categories.
Sarcasm was especially prevalent in memes targeting individuals.</p>
      <p>Figures 1a and 1b show example memes from the dataset, including the OCR-extracted text and their
corresponding annotations for the Bengali and religion-targeted categories, respectively.
(a) OCR-extracted text: “অনুজীব এবং পটস দর
বানােনা িকছু খাদ্য: ল্যাক েটাব্যা িসলাস,
মৌমািছ, ইস্ট, ভাম-প্যা ি ”. Sentiment: Negative;
Sarcasm: Sarcastic; Vulgarity: Not Vulgar;
Abuse: Abusive; Target: Political.
(b) OCR-extracted text: “যখন অেনক
খোাজাখোুজ কের কাগজ হােত পেয় দখা য তামার
ঠাকু রদার নাম দলুালচন্দর মন্ডল লখা আেছ”.</p>
      <p>Sentiment: Neutral; Sarcasm: Sarcastic;
Vulgarity: Not Vulgar; Abuse: Abusive;
Target: Religion.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Methodology</title>
      <p>We address the problem of robust multimodal meme classification in a multilingual setting. Formally,
given a meme composed of an image  and OCR-extracted text  in one of the target languages (Hindi,
Bengali, Bodo, or Gujarati), the goal is to predict class labels  = { 1,  2,  3,  4} corresponding to
Sentiment Analysis, Sarcasm Detection, Vulgarity Detection, and Abusive Content Classification.
 =̂ arg max  ( ∣  , ),</p>
      <p>∈
where  denotes the set of all possible label combinations across tasks.</p>
      <p>Our solution leverages two stages: preprocessing, fine-tuning XLM-RoBERTa (XLM-R) on all
languages (submitted as Run 1), and/or zero-shot classification using ChatGPT (submitted as Run 2)
exclusively for the Bodo dataset.</p>
      <sec id="sec-4-1">
        <title>4.1. Preprocessing</title>
        <p>In the preprocessing phase, we performed several essential cleaning steps to prepare the dataset.
Hashtags, punctuation marks, URLs, numbers, and user mentions were removed to eliminate noisy
tokens that did not contribute meaningfully to the classification tasks. Emojis were converted into their
corresponding text representations to preserve sentiment and emotion information in textual form.
Extra spaces were eliminated to ensure clean and consistent input formatting.</p>
        <p>An important step involved normalizing code-mixed content written in Latin script into the
appropriate native script (Hindi, Bengali, Bodo, or Gujarati) using the Indic-NLP transliteration tool. This was
critical for handling frequent use of Romanized text in social media posts and enabling the downstream
models to process language-specific tokens efectively.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. XLM-RoBERTa Fine-Tuning Across All Languages (Run 1)</title>
        <p>
          To efectively model the multilingual and code-mixed text, we employed XLM-RoBERTa (XLM-R) [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], a
transformer-based model pretrained on massive multilingual corpora. For each of the four classification
tasks, the model predicts a label  ̂ given the OCR-extracted text  as input. The fine-tuning objective
was formulated as a multi-class classification problem, optimized using cross-entropy loss.
        </p>
        <p>We fine-tuned XLM-R separately for each language, utilizing task-specific training datasets. A low
learning rate of 5 × 10−6 was adopted to stabilize training dynamics, accompanied by a cosine learning
rate scheduler. The models were trained for 15–20 epochs with early stopping based on the validation
macro-F1 score, which balances performance across all classes and mitigates bias towards majority
labels.</p>
        <p>To address the severe class imbalance, we weighted the loss function inversely proportional to class
frequencies. Additionally, data oversampling techniques were applied to minority classes, particularly
for the Vulgarity task which follows a tri-class scheme: Negative (-1), Neutral (0), and Positive (1). The
ifnal model used the xlm-roberta-base checkpoint from the Hugging Face library, with the input
consisting solely of the tokenized OCR text. Empirical results demonstrated that XLM-R consistently
outperformed other baselines, capturing the semantic subtleties of code-mixed and noisy text, and was
selected as our oficial Run 1 submission.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Zero-Shot Classification with ChatGPT (Run 2)</title>
        <p>Recognizing the extremely limited training data available for the Bodo dataset, we explored zero-shot
classification using ChatGPT (GPT-4o-mini) to evaluate its robustness in low-resource settings. Unlike
the fine-tuning approach, no task-specific training was conducted.</p>
        <p>For each example, we constructed detailed prompts incorporating both the OCR-extracted text and
the image modality. The prompt instructed the model to classify the meme strictly based on the given
image and text into Sentiment, Sarcasm, Vulgarity, and Abuse categories. An example of the prompt
template used is as follows:</p>
        <p>Prompt = “Classify the following meme into Sentiment, Sarcasm, Vulgarity, and Abuse categories
based on the image and OCR-extracted text.”</p>
        <sec id="sec-4-3-1">
          <title>Output format:</title>
        </sec>
        <sec id="sec-4-3-2">
          <title>Sentiment: [Negative/Neutral/Positive],</title>
        </sec>
        <sec id="sec-4-3-3">
          <title>Sarcasm: [Sarcastic/Non-Sarcastic],</title>
        </sec>
        <sec id="sec-4-3-4">
          <title>Vulgarity: [Vulgar/Non-Vulgar],</title>
        </sec>
        <sec id="sec-4-3-5">
          <title>Abuse: [Abusive/Non-Abusive].”</title>
          <p>This approach enabled the model to generate fluent, contextually complete classification outputs
without fine-tuning. Prompts were submitted in batches of 50 examples via the OpenAI API. The raw
outputs were parsed to extract structured labels, discarding any extraneous commentary. The inclusion
of the image alongside text proved essential for improving classification accuracy, particularly given
the subtle and ambiguous nature of the content.</p>
          <p>Overall, this zero-shot paradigm served as a complementary methodology to the XLM-R-based
supervised fine-tuning, providing a practical solution for extremely low-resource scenarios.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Results</title>
      <sec id="sec-5-1">
        <title>5.1. Bangla</title>
        <p>This section reports the oficial leaderboard results of the HASOC 2025 meme classification task across
four languages: Bangla, Hindi, Gujarati, and Bodo.
orthography and cultural humor patterns. The below-baseline placement indicates potential limitations
in script handling and adaptation to the semantic richness of Bangla memes.
5.2. Hindi
Table 3 shows the Hindi leaderboard. FiRC-NLP achieved the top score (0.6571), followed by NLPFusion
(0.6240) and KK_NLP_AI_IIIT_Ranchi (0.5960). Many teams clustered between 0.57–0.59, underscoring
the strength of the competition. IReL scored 0.4630, ranking 16th and below the baseline (0.5418). This
gap is larger than in Bangla, reflecting the greater dificulty of handling Hindi memes, which often
involve code-mixing, transliteration, and sarcasm. The results indicate that IReL’s system did not fully
capture the complexity of Hindi’s multilingual meme content.</p>
      </sec>
      <sec id="sec-5-2">
        <title>5.3. Gujarati</title>
        <p>Table 4 highlights the Gujarati results. FiRC-NLP (0.6750) and NLPFusion (0.6344) were the clear
leaders, followed by MUCS, SCaLAR, and KK_NLP_AI_IIIT_Ranchi around 0.61. IReL achieved its
best performance here, scoring 0.5920 and ranking 7th, well above the baseline (0.4929). This result
demonstrates that IReL’s model was able to align well with Gujarati meme characteristics, likely
benefiting from more efective script handling and better adaptation. The placement in the upper half
of the leaderboard shows competitiveness, though a gap remains compared to the leaders.
5.4. Bodo</p>
      </sec>
      <sec id="sec-5-3">
        <title>5.5. Cross-Language Observations</title>
        <p>Across the four languages, IReL’s results exhibit marked variability. The strongest performance was
in Gujarati (7th place, 0.5920), highlighting efective script handling and feature capture. In contrast,
Hindi (0.4630) and Bangla (0.5082) results fell below the baseline, primarily due to transliteration
inconsistencies, cultural humor cues, and label imbalance, which limited discriminative learning. Manual
inspection suggests that many errors originated from sarcasm-sentiment confusion and Romanized
abusive slang that XLM-R tokenization could not interpret. These findings motivate incorporating
Indian-language specific pretraining (e.g., MuRIL, IndicBERT v2) and data augmentation through
back-translation and code-mixing simulation to mitigate these shortcomings. Furthermore, we plan
to conduct granular error analyses using interpretability tools (e.g., SHAP, LIME) to systematically
diagnose class-wise failures.</p>
        <p>Notably, the zero-shot ChatGPT run for Bodo achieved a macro-F1 of 0.5011, exceeding the baseline
(0.3922) and validating large language models’ viability in extremely low-resource settings.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion and Future Work</title>
      <p>The HASOC 2025 meme shared task underscored the complexity of detecting sentiment, sarcasm,
vulgarity, and abuse across Bangla, Hindi, Gujarati, and Bodo in highly code-mixed and noisy settings.
IReL submitted two complementary runs: the first fine-tuned XLM-RoBERTa separately for each
language, achieving macro-F1 scores of 0.5082 (Bangla), 0.4630 (Hindi), 0.5920 (Gujarati), and 0.5011
(Bodo), thereby demonstrating competitive performance in Gujarati but below-baseline outcomes
in Bangla and Hindi where transliteration and cultural nuance posed challenges. The second run
leveraged zero-shot prompting with ChatGPT for Bodo, surpassing the oficial baseline and illustrating
the potential of large language models for severely under-resourced languages. Overall, these results
highlight the strengths of multilingual transformers for medium and high-resource contexts, the utility
of zero-shot inference in low-resource scenarios, and the persistent dificulty of handling sarcasm,
transliterated slang, and community-specific references. These findings motivate several improvements:
(i) diversifying model architectures beyond XLM-R to include MuRIL, IndicBERT, and distilled variants;
(ii) enriching datasets through back-translation, balanced resampling, and multimodal fusion; (iii)
implementing explainable-AI diagnostics to interpret misclassifications; and (iv) enhancing scalability
through student–teacher distillation and quantized inference for real-world deployment. Collectively,
these directions aim to produce robust, eficient, and interpretable meme-understanding systems that
scale across diverse Indic languages.</p>
    </sec>
    <sec id="sec-7">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the authors used ChatGPT, Grammarly in order to: Grammar and
spelling check, Paraphrase and reword. After using these tools, the authors reviewed and edited the
content as needed and take full responsibility for the publication’s content.
Evaluation, volume 2826 of CEUR-WS.org, CEUR Workshop Proceedings, Hyderabad, India, 2020,
pp. 87–111. doi:10.1145/3441501.3441517.
[14] T. Ranasinghe, K. Ghosh, A. S. Pal, A. Senapati, A. E. Dmonte, M. Zampieri, S. Modha, S. Satapara,
Overview of the hasoc subtracks at fire 2023: Hate speech and ofensive content identification in
assamese, bengali, bodo, gujarati and sinhala, in: Proceedings of the 15th annual meeting of the
forum for information retrieval evaluation, 2023, pp. 13–15.
[15] T. Ranasinghe, K. Ghosh, A. S. Pal, A. Senapati, A. E. Dmonte, M. Zampieri, S. Modha, S. Satapara,
Overview of the hasoc subtracks at fire 2023: Hate speech and ofensive content identification
in assamese, bengali, bodo, gujarati and sinhala, in: Proceedings of the 15th Annual Meeting of
the Forum for Information Retrieval Evaluation, FIRE ’23, Association for Computing Machinery,
New York, NY, USA, 2024, p. 13–15. URL: https://doi.org/10.1145/3632754.3633278. doi:10.1145/
3632754.3633278.
[16] B. R. Chakravarthi, P. K. Kumaresan, R. Sakuntharaj, A. K. Madasamy, S. Thavareesan, B. Premjith,
K. Sreelakshmi, S. C. Navaneethakrishnan, J. P. McCrae, T. Mandl, Overview of the
hasocdravidiancodemix shared task on ofensive language detection in tamil and malayalam, in: Working
Notes of FIRE 2021 – Forum for Information Retrieval Evaluation, volume 3159 of CEUR-WS.org,
CEUR Workshop Proceedings, Hyderabad, India, 2021. Track on ofensive language identification
in code-mixed Dravidian languages.
[17] D. Kiela, D. Hazarika, E. Fini, S. K. Deepak, S. Poria, The hateful memes challenge: Detecting hate
speech in multimodal memes, in: arXiv preprint arXiv:2005.04790, 2020.
[18] S. Suryawanshi, B. R. Chakravarthi, M. Arcan, P. Buitelaar, Trollswithopinion: A taxonomy
and dataset for predicting domain-specific opinion manipulation in troll memes, Multimedia
Tools Appl. 82 (2022) 9137–9171. URL: https://doi.org/10.1007/s11042-022-13796-x. doi:10.1007/
s11042-022-13796-x.
[19] Y. Bang, S. Karthik, R. Arora, Multilingual hate speech detection with limited resources, in:</p>
      <p>Proceedings of the 4th Workshop on NLP for Low-Resource Languages, ACL, 2023.
[20] K. Ghosh, A. Senapati, Hate speech detection in low-resourced indian languages: An analysis of
transformer-based monolingual and multilingual models with cross-lingual experiments, Natural
Language Processing 31 (2025) 393–414. doi:10.1017/nlp.2024.28.
[21] K. Ghosh, D. A. Senapati, Hate speech detection: a comparison of mono and multilingual
transformer model with cross-language evaluation, in: S. Dita, A. Trillanes, R. I. Lucas (Eds.),
Proceedings of the 36th Pacific Asia Conference on Language, Information and
Computation, Association for Computational Linguistics, Manila, Philippines, 2022, pp. 853–865. URL:
https://aclanthology.org/2022.paclic-1.94/.
[22] S. Modha, T. Mandl, G. K. Shahi, H. Madhu, S. Satapara, T. Ranasinghe, M. Zampieri, Overview
of the hasoc subtrack at fire 2021: Hate speech and ofensive content identification in english
and indo-aryan languages and conversational hate speech, in: Proceedings of the 13th Annual
Meeting of the Forum for Information Retrieval Evaluation (FIRE ’21), ACM, India, 2022. doi:10.
1145/3503162.3503176.
[23] K. Ghosh, S. Saha, T. Mandl, S. Modha, Findings from shared tasks on hate speech detection:
Performance patterns for low-resource languages, Pattern Recognition Letters (2025). URL:
https://www.sciencedirect.com/science/article/pii/S0167865525003150. doi:https://doi.org/10.
1016/j.patrec.2025.09.004.
[24] S. Chanda, A. Mishra, S. Pal, Advancing language identification in code-mixed tulu texts:
Harnessing deep learning techniques, in: K. Ghosh, T. Mandl, P. Majumder, M. Mitra (Eds.), Working
Notes of FIRE 2023 – Forum for Information Retrieval Evaluation (FIRE-WN 2023), volume 3681 of
CEUR Workshop Proceedings, CEUR-WS.org, Goa, India, 2023. CC BY 4.0.
[25] S. Chanda, S. Ujjwal, S. Das, S. Pal, Fine-tuning pre-trained transformer based model for hate
speech and ofensive content identification in english indo-aryan and code-mixed (english-hindi)
languages, in: Forum for Information Retrieval Evaluation (FIRE 2021) Working Notes, Forum for
Information Retrieval Evaluation, 2021. Shared Task HASOC 2021.
[26] S. Chanda, S. D. Sheth, S. Pal, Coarse and fine-grained conversational hate speech and ofensive
content identification in code-mixed languages using fine-tuned multilingual embedding, in:
Forum for Information Retrieval Evaluation (FIRE 2022) Working Notes, Forum for Information
Retrieval Evaluation, 2022. Shared Task HASOC 2022.
[27] A. Saroj, S. Chanda, S. Pal, Irlab iitv at semeval-2020 task 12: Multilingual ofensive language
identification in social media using svm, in: A. Hébelot, X. Zhu, A. Palmer, N. Schneider, J. May,
E. Shutova (Eds.), Proceedings of the Fourteenth Workshop on Semantic Evaluation (SemEval 2020),
International Committee for Computational Linguistics, Barcelona (online), 2020, pp. 2012–2016.
doi:10.18653/v1/2020.semeval- 1.265.
[28] K. Ghosh, N. K. Singh, J. Mahapatra, et al., SafeSpeech: a three-module pipeline for hate intensity
mitigation of social media texts in Indic languages, Social Network Analysis and Mining 14 (2024).</p>
      <p>URL: https://doi.org/10.1007/s13278-024-01393-9. doi:10.1007/s13278- 024- 01393- 9.
[29] M. Das, A. Mukherjee, BanglaAbuseMeme: A Dataset for Bengali Abusive Meme Classification,
in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing,
2023, pp. 15498–15512.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>K.</given-names>
            <surname>Ghosh</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. Das</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Narzary</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Saha</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Barman</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Mukherjee</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Modha</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Ganguly</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          <string-name>
            <surname>Garain</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Jaki</surname>
          </string-name>
          , T. Mandl,
          <article-title>Overview of the HASOC Track at FIRE 2025: Abusive Meme Identification - Shadows Behind the Laughter</article-title>
          , in: K. Ghosh,
          <string-name>
            <given-names>T.</given-names>
            <surname>Mandl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Pal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Majumdar</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . Chakraborty (Eds.),
          <source>Forum for Information Retrieval Evaluation (Working Notes) (FIRE 2025) December</source>
          <volume>17</volume>
          -20, Varanasi , India, CEUR-WS.org,
          <year>2025</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>K.</given-names>
            <surname>Ghosh</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. Das</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Patel</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Bhandary</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. Das</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Mukherjee</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Modha</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Ganguly</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          <string-name>
            <surname>Garain</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Jaki</surname>
          </string-name>
          , T. Mandl,
          <article-title>Overview of the HASOC Track at FIRE 2025: Abusive Meme Identification - Shadows Behind the Laughter</article-title>
          ,
          <source>in: FIRE '25: Proceedings of the 17th Annual Meeting of the Forum for Information Retrieval Evaluation. December</source>
          <volume>17</volume>
          -20, Varanasi , India, Association for Computing Machinery (ACM), New York, NY, USA,
          <year>2025</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D.</given-names>
            <surname>Kakwani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kunchukuttan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Golla</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. C.</given-names>
            <surname>Gokul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bhattacharyya</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. M. Khapra</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Kumar</surname>
          </string-name>
          , Indicnlpsuite:
          <article-title>Monolingual corpora, evaluation benchmarks and pre-trained multilingual language models for indian languages, in: Findings of the Association for Computational Linguistics: EMNLP 2020, Association for Computational Linguistics</article-title>
          , Online,
          <year>2020</year>
          , pp.
          <fpage>4948</fpage>
          -
          <lpage>4961</lpage>
          . doi:
          <volume>10</volume>
          . 18653/v1/
          <year>2020</year>
          .findings- emnlp.445.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Khanuja</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Bansal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mehtani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Khosla</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Dey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Gopalan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. K.</given-names>
            <surname>Margam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Aggarwal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. T.</given-names>
            <surname>Nagipogu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Dave</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. C. B.</given-names>
            <surname>Gali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Subramanian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Talukdar</surname>
          </string-name>
          , Muril:
          <article-title>Multilingual representations for indian languages</article-title>
          ,
          <year>2021</year>
          . arXiv:
          <volume>2103</volume>
          .10730, submitted 19
          <source>Mar 2021; revised 2 Apr</source>
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>B.</given-names>
            <surname>Pang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <article-title>Opinion mining and sentiment analysis</article-title>
          ,
          <source>Foundations and Trends® in Information Retrieval</source>
          <volume>2</volume>
          (
          <year>2008</year>
          )
          <fpage>1</fpage>
          -
          <lpage>135</lpage>
          . doi:
          <volume>10</volume>
          .1561/1500000011.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Taboada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Brooke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tofiloski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Voll</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stede</surname>
          </string-name>
          ,
          <article-title>Lexicon-based methods for sentiment analysis</article-title>
          ,
          <source>Computational Linguistics</source>
          <volume>37</volume>
          (
          <year>2011</year>
          )
          <fpage>267</fpage>
          -
          <lpage>307</lpage>
          . URL: https://aclanthology.org/J11-2001/. doi:
          <volume>10</volume>
          .1162/COLI_a_
          <fpage>00049</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R.</given-names>
            <surname>Socher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Perelygin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Chuang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Manning</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Ng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Potts</surname>
          </string-name>
          ,
          <article-title>Recursive deep models for semantic compositionality over a sentiment treebank</article-title>
          ,
          <source>in: Proceedings of the Conference on Empirical Methods in Natural Language Processing</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <article-title>Convolutional neural networks for sentence classification</article-title>
          ,
          <source>in: Proceedings of the Conference on Empirical Methods in Natural Language Processing</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Dyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Smola</surname>
          </string-name>
          , E. Hovy,
          <article-title>Hierarchical attention networks for document classification</article-title>
          ,
          <source>in: Proceedings of NAACL-HLT</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J.</given-names>
            <surname>Devlin</surname>
          </string-name>
          , M.-
          <string-name>
            <given-names>W.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Toutanova</surname>
          </string-name>
          , Bert:
          <article-title>Pre-training of deep bidirectional transformers for language understanding</article-title>
          ,
          <source>in: Proceedings of NAACL-HLT</source>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A.</given-names>
            <surname>Conneau</surname>
          </string-name>
          , G. Lample,
          <article-title>Unsupervised cross-lingual representation learning at scale</article-title>
          ,
          <source>in: Proceedings of ACL</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>S.</given-names>
            <surname>Dowlagar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mamidi</surname>
          </string-name>
          ,
          <article-title>Multilingual pre-trained transformers and convolutional nn classification models for technical domain identification</article-title>
          ,
          <source>in: Preprint</source>
          ,
          <year>2021</year>
          . ArXiv:
          <volume>2101</volume>
          .
          <fpage>00000</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>T.</given-names>
            <surname>Mandl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Modha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. K.</given-names>
            <surname>Shahi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. K.</given-names>
            <surname>Jaiswal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Nandini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Patel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Majumder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Schäfer</surname>
          </string-name>
          ,
          <article-title>Overview of the hasoc track at fire 2020: Hate speech and ofensive content identification in indo-european languages</article-title>
          ,
          <source>in: Working Notes of FIRE 2020 - Forum for Information Retrieval</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>