<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Memotion 3: Dataset on Sentiment and Emotion Analysis of codemixed Hindi-English Memes</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Shreyash Mishra</string-name>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>S Suryavardan</string-name>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Parth Patwa</string-name>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Megha Chakraborty</string-name>
          <xref ref-type="aff" rid="aff6">6</xref>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anku Rani</string-name>
          <xref ref-type="aff" rid="aff6">6</xref>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Aishwarya Reganti</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Aman Chadha</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Amitava Das</string-name>
          <email>amitava@mailbox.sc.edu</email>
          <xref ref-type="aff" rid="aff6">6</xref>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Amit Sheth</string-name>
          <xref ref-type="aff" rid="aff6">6</xref>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Manoj Chinnakotla</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Asif Ekbal</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Srijan Kumar</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>IIIT Sri City</string-name>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>India</string-name>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Amazon AI</institution>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Carnegie Mellon University</institution>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Georgia Tech</institution>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>IIT Patna</institution>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Microsoft</institution>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>Stanford University</institution>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff6">
          <label>6</label>
          <institution>University of South Carolina</institution>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff7">
          <label>7</label>
          <institution>Washington</institution>
          ,
          <addr-line>DC</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Memes are the new-age conveyance mechanism for humor on social media sites. Memes often include an image and some text. Memes can be used to promote disinformation or hatred, thus it is crucial to investigate in details. We introduce Memotion 3, a new dataset with 10,000 annotated memes. Unlike other prevalent datasets in the domain, including prior iterations of Memotion, Memotion 3 introduces Hindi-English Codemixed memes while prior works in the area were limited to only the English memes.</p>
      </abstract>
      <kwd-group>
        <kwd>Memes</kwd>
        <kwd>Hindi-English</kwd>
        <kwd>Multimodality</kwd>
        <kwd>Dataset</kwd>
        <kwd>Machine Learning</kwd>
        <kwd>Entailment</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>With the rise of social media platforms as a conduit for users to communicate their thoughts and
interact with one another, the amount of hate online has also parallely proliferated. The power
of free uncensored speech however can cause considerable angst in the online community by
demeaning other people. A popular form of producing such harmful content is the creation</p>
      <p>
        †Work does not relate to position at Amazon.
(A. Das)
© 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
of memes. Memes generally consist of popular images and texts associated with them that
intend to spark humor among the readers. A popular definition of memes, now widely used in
the field, describes them as “a group of texts with shared characteristics, with a shared core of
content, form, and stance”. Broadly, “content” refers to ideas and ideologies, “form” refers to our
sensory experiences such as audio or visual, and “stance” refers to the tone or style, structures
for participation, and communicative functions of the meme [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The artistic use of images and
text makes the content relatable and viral. Although initially used for comic purposes only,
memes have quickly evolved as a mechanism used to taunt and demean certain sections of the
society. They are also used to spread misinformation and fake news. Memes are a language in
themselves, with a capacity to transcend cultures and construct collective identities between
people. These shareable visual jokes can also be powerful tools for self-expression, connection,
social influence and even political subversion [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>Social media platforms have many initiatives to moderate this kind of content, but memes have
managed to hold their relevance despite these eforts. Detecting hate-speech and aggression
on social media platforms is a popular research field, both in academia and industry, however,
memes are continuously evolving and outpace contemporary hate-classification systems because
(i) they can be multi-modal in nature, (ii) they might not use explicit hate content/words but
more subtler forms of aggression like satire or sarcasm, and (iii) they can contain code-mixed
content (languages like Hindi, Telugu, etc. written in Latin script) which is harder to parse and
detect. Code-mixed content is especially prevalent in multilingual societies.</p>
      <p>The previous iterations of Memotion each curated 10k multi-modal memes from various
social media websites like Reddit, Facebook, Imgur, and Instagram, and proposed emotion and
sentiment classification tasks on these datasets. In the current iteration- Memotion 3, we add an
additional layer of complexity by introducing memes that are Hindi-English code-mixed. This
addition ensures that models will see data that is more current and prevalent on social media
and hence improve robustness. The rest of the paper is organised as follows: we describe the
related work and the task in Section 2 and Section 3, respectively. Section 4 contains the details
of the dataset we collected for memotion analysis: Memotion 3; followed by a brief description
of baseline models 5 and their results in Section 6. We conclude with the mention of future
work and limitations, in Section 7.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>Analysis of data to extract the sentiment and emotion has gained a lot of traction in recent years.
This has been majorly focused on the large amount of data generated every second, thanks to
social media. Most research in this area are focused on textual modality with some inclusion of
multi-lingual data aimed at determining the polarity of the given data.</p>
      <p>
        Many of the existing sentiment analysis datasets are of textual in nature and are in English
[
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ] with negative/positive or neutral categories. This also includes the works on hate speech
detection in English from platforms such as Twitter [
        <xref ref-type="bibr" rid="ref5 ref6 ref7">5, 6, 7</xref>
        ] that classifies tweets based on
the detected racism, sexism etc. Further works in this area shed light on multilingual or
code-mixed data with inclusion of languages, such as Hindi [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], Hindi-English [
        <xref ref-type="bibr" rid="ref10 ref11 ref9">9, 10, 11</xref>
        ],
Spanish-English [
        <xref ref-type="bibr" rid="ref9">9, 12</xref>
        ], Malayalam-English [13] and more [14, 15, 16, 17]. Recent approaches
to solve this problem mostly involve the use of deep learning [18, 19, 20] and large language
models [21, 22, 23, 24].
      </p>
      <p>However, social media is a multi-modal platform, as a result a combination of textual and
visual data is vital to capture the context and analyse the data. Text-image pairs can be used
for image captioning, sentiment analysis, hate speech detection and mitigate cyberbullying as
shown by existing research [25, 26, 27, 28, 29]. Moreover, research has also been done towards
sentiment and emotion analysis of video based data [30, 31].</p>
      <p>One of the most commonly occurring formats of multi-modal data in social media is a meme.
Although there has only been limited work, specifically towards memes, research in this area
has grown in recent times. MultiOFF [32] is binary classification dataset that aims to detect
whether memes are ofensive or not. The Hateful memes dataset from Facebook [ 33] provides
memes collected from USA based social media groups along with some manually reconstructed
memes annotated for both uni-modal and multi-modal hate speech. The previous iterations of
Memotion i.e. Memotion 1 [34] and Memotion 2[35, 36] drew attention to analysis of English
memes that covered several categories, such as hatefulness, motivation, humour, sarcasm and
overall sentiment. TamilMemes [37] is a also meme classification dataset that categorizes memes
as being trolls or not, however this is one of the few datasets not in English. Some approaches
toward this include [38, 39, 40, 41].</p>
      <p>With Memotion 3, we present the first code-mixed Hinglish (Hindi-English) meme analysis
dataset with 10k memes annotated for the aforementioned categories in the previous iterations
of Memotion.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Memotion 3 task</title>
      <p>A dataset of 10,000 annotated Hindi-english memes is made available. Each data point has a
label for each sub-task as well as an accompanying image and text. Similar to Memotion 1 [34]
and Memotion 2 [35], we consider sentiment, emotions, and their intensities. Unlike previous
works, however, this iteration of the Memotion challenge focuses on Hinglish language memes.
Our subtasks are as follows:
• Task A: Sentiment Analysis - Classify a meme as positive, negative, or neutral. Figure
1 explains the potential negative connotations of a specific meme.
• Task B: Emotion Classification - Classify a meme into humorous, sarcastic, ofensive,
or inspirational. More than one category can apply to a meme. Tasks B can be clearly
understood by looking at the meme in Figure 2.
• Task C: Scales/Intensity of Emotion Classes - Calculate the degree to which a given
emotion is being conveyed is the third task. The intensity of each emotion is shown in
Figure 2.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Data</title>
      <p>In this section we describe the data collection, annotation and data analysis.
4.1. Data Collection
We downloaded the memes after on topics of interest, such as politics, sports etc. We also
collected memes using a Selenium-based web crawler. All memes are gathered from public
websites Reddit and Google images. We cleaned the data to remove redundancies and performed
random manual quality check. The memes are release along with the source URLs and OCR
text. For OCR, we utilised the Google Vision API1.</p>
      <p>(a)
(b)
(c)
4.2. Data Annotation
we recruited Undergraduate student proficient in English, Hindi and meme knowledge. For
annotation, they use an interface built by us, as shown in Figure 3. The annotators were asked
to assess whether the meme’s creator intended it to be positive, negative, or neutral in Task A.</p>
      <p>fuvenrnyy_funnonty_futwnnhisyitlaerdi_omuseaninggneonte_sraalrvcearsyt_ictnwoits_toefdfensivveersyhl_iagothfeftefunlns_oiovtfe_femnostiviveamtiootniavaltionanleutraplositinveevgearyti_vvpeeorsyit_inveegative
emotion (0-4 levels). Fig. 6 shows the distribution of memes across all the labels. Fig. 4 displays
the word occurrence in the dataset. From the wordcloud we can see that lot of code mixed
words like nahi, kya are prominent in the dataset.</p>
      <p>from the statistical features in Fig. 5, we can conclude that the emotions in memes overlap,
demonstrating the dificulty of the tasks. A number of intriguing facts are revealed, including the
fact that many ofending memes are humorous. Additionally, a lot of the memes are humorous
and lack inspiration, as can be seen. On average, the Code Mixed Index (CMI) [42] for the
training, validation and test set is 14.94, 20.19 and 20.06 respectively.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Baseline model</title>
      <p>The importance of considering both the visual and textual features is vital for multi-modal data,
especially in the case of memes where the context can only be captured using a combination of
both components. Attention models are exceptional at representing text with respect to context
and a widely known model with strong performance is BERT [43]. As the dataset is not in English
but instead in Hindi-English, we use a multilingual variant of BERT, specifically Hinglish-BERT
from Verloop [44], with both the backbone and linear layers of the LM finetuned. The model is
implemented using BERT-base-multilingual-cased, which is fine-tuned on Hinglish data. The
visual features are obtained from the pre-trained Vision transformer model (ViT) [45]. The ViT
model can outperform normal CNNs computationally and by accuracy, thanks to the positional
embedding of image patches done by ViT. The pooled output from the ViT model is concatenated
with the Hinglish-BERT embedding. The combined features are then classified after being passed
through a MLP. With changes to the MLP, the multi-modal features are used for all three
subtasks. The model architecture is displayed in Figure 7. The results for each task are provided in
Table 1. The codes will be made available at https://github.com/Shreyashm16/Memotion-3.0.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Results</title>
      <p>Baseline results in Table 1 show Weighted F1 scores for each task and sub-task. Using ViT for
extracting visual features and Hinglish-BERT for the textual features, the baseline models scores
33.28% for Task A, 74.74% for Task B and 52.27% for Task C.</p>
      <p>This dataset will be made public, and we leave it to future research to develop more
sophisticated systems that go deeper into Memotion Analysis.</p>
      <sec id="sec-6-1">
        <title>Task</title>
      </sec>
      <sec id="sec-6-2">
        <title>Task-A</title>
      </sec>
      <sec id="sec-6-3">
        <title>Task-B</title>
      </sec>
      <sec id="sec-6-4">
        <title>Task-C</title>
      </sec>
      <sec id="sec-6-5">
        <title>Class</title>
      </sec>
      <sec id="sec-6-6">
        <title>Sentiment</title>
      </sec>
      <sec id="sec-6-7">
        <title>Humour</title>
      </sec>
      <sec id="sec-6-8">
        <title>Sarcasm</title>
      </sec>
      <sec id="sec-6-9">
        <title>Ofensive</title>
      </sec>
      <sec id="sec-6-10">
        <title>Motivation</title>
      </sec>
      <sec id="sec-6-11">
        <title>Average</title>
      </sec>
      <sec id="sec-6-12">
        <title>Humour</title>
      </sec>
      <sec id="sec-6-13">
        <title>Sarcasm</title>
      </sec>
      <sec id="sec-6-14">
        <title>Ofensive</title>
      </sec>
      <sec id="sec-6-15">
        <title>Motivation</title>
      </sec>
      <sec id="sec-6-16">
        <title>Average</title>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>7. Conclusion and Future Work</title>
      <p>In this study, we present a hindi-english dataset for the challenge of sentiment in a multimodal
environment. This is the first significant multimodal dataset for Hindi code-mixed meme
categorization that we are aware of. We provide annotated data for three tasks, namely sentiment
analysis, emotion classification, and strength of emotion, in order to provide a fine-grained and
thorough analysis of memes. By combining the image features extracted from the ViT model
and the textual features using Hinglish-BERT, and then passing these joint embeddings to a
simple MLP, we design the baseline for the tasks. It should be mentioned that our models are
preliminary and that more creative approaches will enhance performance much more. In the
future, we intend to extend our work by designing a single model for all languages, instead of
creating separate models for memes of diferent languages. We could also work on generating
memes for the task, instead of collection to customize the dataset.
[12] V. Basile, C. Bosco, E. Fersini, D. Nozza, V. Patti, F. M. Rangel Pardo, P. Rosso, M. Sanguinetti,
SemEval-2019 task 5: Multilingual detection of hate speech against immigrants and women
in Twitter, in: Proceedings of the 13th International Workshop on Semantic Evaluation,
Association for Computational Linguistics, Minneapolis, Minnesota, USA, 2019, pp. 54–63.</p>
      <p>URL: https://aclanthology.org/S19-2007. doi:10.18653/v1/S19- 2007.
[13] B. R. Chakravarthi, N. Jose, S. Suryawanshi, E. Sherly, J. P. McCrae, A sentiment analysis
dataset for code-mixed malayalam-english, CoRR abs/2006.00210 (2020). URL: https:
//arxiv.org/abs/2006.00210. arXiv:2006.00210.
[14] S. Bhattacharya, S. Singh, R. Kumar, A. Bansal, A. Bhagat, Y. Dawer, B. Lahiri, A. K.</p>
      <p>Ojha, Developing a multilingual annotated corpus of misogyny and aggression, in:
Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, European
Language Resources Association (ELRA), Marseille, France, 2020, pp. 158–168. URL: https:
//www.aclweb.org/anthology/2020.trac2-1.25.
[15] B. R. Chakravarthi, V. Muralidaran, R. Priyadharshini, J. P. McCrae, Corpus creation for
sentiment analysis in code-mixed tamil-english text, CoRR abs/2006.00206 (2020). URL:
https://arxiv.org/abs/2006.00206. arXiv:2006.00206.
[16] T. Mandl, S. Modha, A. Kumar M, B. R. Chakravarthi, Overview of the hasoc track at fire
2020: Hate speech and ofensive language identification in tamil, malayalam, hindi, english
and german, in: Proceedings of the 12th Annual Meeting of the Forum for Information
Retrieval Evaluation, FIRE ’20, Association for Computing Machinery, New York, NY,
USA, 2021, p. 29–32. URL: https://doi.org/10.1145/3441501.3441517. doi:10.1145/3441501.
3441517.
[17] B. R. Chakravarthi, R. Priyadharshini, V. Muralidaran, S. Suryawanshi, N. Jose, E. Sherly, J. P.</p>
      <p>McCrae, Overview of the track on sentiment analysis for dravidian languages in code-mixed
text, in: Proceedings of the 12th Annual Meeting of the Forum for Information Retrieval
Evaluation, FIRE ’20, Association for Computing Machinery, New York, NY, USA, 2021, p.
21–24. URL: https://doi.org/10.1145/3441501.3441515. doi:10.1145/3441501.3441515.
[18] P. Patwa, S. Pykl, A. Das, P. Mukherjee, V. Pulabaigari, Hater-o-genius aggression
classification using capsule networks, in: Proceedings of the 17th International Conference on
Natural Language Processing (ICON), 2020, pp. 149–154.
[19] S. T. Aroyehun, A. Gelbukh, Aggression detection in social media: Using deep neural
networks, data augmentation, and pseudo labeling, in: Proceedings of the First Workshop
on Trolling, Aggression and Cyberbullying (TRAC-2018), 2018, pp. 90–97.
[20] A. Ribeiro, N. Silva, INF-HatEval at SemEval-2019 task 5: Convolutional neural networks
for hate speech detection against women and immigrants on Twitter, in: Proceedings of
the 13th International Workshop on Semantic Evaluation, Association for Computational
Linguistics, Minneapolis, Minnesota, USA, 2019, pp. 420–425. URL: https://aclanthology.
org/S19-2074. doi:10.18653/v1/S19- 2074.
[21] D. Tula, P. Potluri, S. Ms, S. Doddapaneni, P. Sahu, R. Sukumaran, P. Patwa,
Bitions@DravidianLangTech-EACL2021: Ensemble of multilingual language models with
pseudo labeling for ofence detection in Dravidian languages, in: Proceedings of the First
Workshop on Speech and Language Technologies for Dravidian Languages, Association
for Computational Linguistics, Kyiv, 2021, pp. 291–299. URL: https://aclanthology.org/2021.
dravidianlangtech-1.42.
[22] M. Mozafari, R. Farahbakhsh, N. Crespi, A bert-based transfer learning approach for hate
speech detection in online social media, in: Complex Networks and Their Applications
VIII: Volume 1 Proceedings of the Eighth International Conference on Complex Networks
and Their Applications COMPLEX NETWORKS 2019 8, Springer, 2020, pp. 928–940.
[23] D. Tula, M. Shreyas, V. Reddy, P. Sahu, S. Doddapaneni, P. Potluri, R. Sukumaran, P. Patwa,
Ofence detection in dravidian languages using code-mixing index-based focal loss, SN
Computer Science 3 (2022) 330.
[24] R. Ali, U. Farooq, U. Arshad, W. Shahzad, M. O. Beg, Hate speech detection on twitter
using transfer learning, Computer Speech &amp; Language 74 (2022) 101365.
[25] R. Jha, V. Kaki, V. Kolla, S. Bhagat, P. Patwa, A. Das, S. Pal, Image2tweet: Datasets
in Hindi and English for generating tweets from images, in: Proceedings of the 18th
International Conference on Natural Language Processing (ICON), NLP Association of
India (NLPAI), National Institute of Technology Silchar, Silchar, India, 2021, pp. 670–676.</p>
      <p>URL: https://aclanthology.org/2021.icon-main.84.
[26] A. Hu, S. Flaxman, Multimodal sentiment analysis to explore the structure of
emotions, Proceedings of the 24th ACM SIGKDD International Conference on
Knowledge Discovery |&amp; Data Mining (2018). URL: http://dx.doi.org/10.1145/3219819.3219853.
doi:10.1145/3219819.3219853.
[27] R. Gomez, J. Gibert, L. Gomez, D. Karatzas, Exploring hate speech detection in multimodal
publications, 2019. arXiv:1910.03814.
[28] H. Zhong, H. Li, A. Squicciarini, S. Rajtmajer, C. Grifin, D. Miller, C. Caragea,
Contentdriven detection of cyberbullying on the instagram social network, 2016.
[29] H. Hosseinmardi, S. A. Mattson, R. I. Rafiq, R. Han, Q. Lv, S. Mishra, Detection of
cyberbullying incidents on the instagram social network, 2015. arXiv:1503.03909.
[30] L.-P. Morency, R. Mihalcea, P. Doshi, Towards Multimodal Sentiment Analysis:
Harvesting Opinions from The Web, in: International Conference on Multimodal Interfaces
(ICMI 2011), Alicante, Spain, 2011. URL: http://ict.usc.edu/pubs/Towards%20Multimodal%
20Sentiment%20Analysis-%20Harvesting%20Opinions%20from%20The%20Web.pdf.
[31] A. Bagher Zadeh, P. P. Liang, S. Poria, E. Cambria, L.-P. Morency, Multimodal language
analysis in the wild: CMU-MOSEI dataset and interpretable dynamic fusion graph, in:
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics
(Volume 1: Long Papers), Association for Computational Linguistics, Melbourne, Australia,
2018, pp. 2236–2246. URL: https://aclanthology.org/P18-1208. doi:10.18653/v1/P18- 1208.
[32] S. Suryawanshi, B. R. Chakravarthi, M. Arcan, P. Buitelaar, Multimodal meme dataset
(MultiOFF) for identifying ofensive content in image and text, in: Proceedings of the
Second Workshop on Trolling, Aggression and Cyberbullying, European Language Resources
Association (ELRA), Marseille, France, 2020, pp. 32–41. URL: https://aclanthology.org/2020.
trac-1.6.
[33] D. Kiela, H. Firooz, A. Mohan, V. Goswami, A. Singh, P. Ringshia, D. Testuggine, The hateful
memes challenge: Detecting hate speech in multimodal memes, 2021. arXiv:2005.04790.
[34] C. Sharma, D. Bhageria, W. Paka, Scott, S. P Y K L, A. Das, T. Chakraborty, V. Pulabaigari,
B. Gambäck, SemEval-2020 Task 8: Memotion Analysis-The Visuo-Lingual Metaphor!, in:
Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval-2020),
Association for Computational Linguistics, Barcelona, Spain, 2020.
[35] S. Ramamoorthy, N. Gunti, S. Mishra, S. Suryavardan, A. Reganti, P. Patwa, A. Das,
T. Chakraborty, A. Sheth, A. Ekbal, C. Ahuja, Memotion 2: Dataset on sentiment and
emotion analysis of memes, in: Proceedings of Defactify: First workshop on multimodal
Fact CHecking and Hatespeech detection, 2022.
[36] P. Patwa, S. Ramamoorthy, N. Gunti, S. Mishra, S. Suryavardan, A. Reganti, A. Das,
T. Chakraborty, A. Sheth, A. Ekbal, C. Ahuja, Findings of memotion 2: Sentiment and
emotion analysis of memes, in: Proceedings of Defactify: First workshop on multimodal
Fact CHecking and Hatespeech detection, 2022.
[37] S. Suryawanshi, B. R. Chakravarthi, P. Verma, M. Arcan, J. P. McCrae, P. Buitelaar, A dataset
for troll classification of Tamil memes, in: Proceedings of the 5th Workshop on Indian
Language Data Resource and Evaluation (WILDRE-5), European Language Resources
Association (ELRA), Marseille, France, 2020.
[38] S. Pramanick, S. Sharma, D. Dimitrov, M. S. Akhtar, P. Nakov, T. Chakraborty, Momenta:
A multimodal framework for detecting harmful memes and their targets, arXiv preprint
arXiv:2109.05184 (2021).
[39] A.-M. Bucur, A. Cosma, I.-B. Iordache, Blue at memotion 2.0 2022: You have my image,
my text and my transformer, arXiv preprint arXiv:2202.07543 (2022).
[40] N. Gunti, S. Ramamoorthy, P. Patwa, A. Das, Memotion analysis through the lens of
joint embedding (student abstract), in: Proceedings of the AAAI Conference on Artificial
Intelligence, 2022.
[41] G. G. Lee, M. Shen, Amazon pars at memotion 2.0 2022: Multi-modal multi-task learning
for memotion 2.0 challenge, Proceedings http://ceur-ws. org ISSN 1613 (2020) 0073.
[42] B. Gambäck, A. Das, On measuring the complexity of code-mixing, in: Proceedings of the
11th international conference on natural language processing, Goa, India, 2014, pp. 1–7.
[43] J. Devlin, M.-W. Chang, K. Lee, K. Toutanova, Bert: Pre-training of deep bidirectional
transformers for language understanding, arXiv preprint arXiv:1810.04805 (2018).
[44] M. Bhange, N. Kasliwal, Hinglishnlp: Fine-tuned language models for hinglish sentiment
detection, arXiv preprint arXiv:2008.09820 (2020).
[45] A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M.
Dehghani, M. Minderer, G. Heigold, S. Gelly, et al., An image is worth 16x16 words:
Transformers for image recognition at scale, arXiv preprint arXiv:2010.11929 (2020).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>L.</given-names>
            <surname>Shifman</surname>
          </string-name>
          ,
          <article-title>Memes in a Digital World: Reconciling with a Conceptual Troublemaker</article-title>
          ,
          <source>Journal of Computer-Mediated Communication</source>
          <volume>18</volume>
          (
          <year>2013</year>
          )
          <fpage>362</fpage>
          -
          <lpage>377</lpage>
          . URL: https://doi.org/10.1111/jcc4.12013. doi:
          <volume>10</volume>
          . 1111/jcc4.12013. arXiv:https://academic.oup.com/jcmc/articlepdf/18/3/362/19492245/jjcmcom0362.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>N.</given-names>
            <surname>Akhther</surname>
          </string-name>
          ,
          <article-title>Internet memes as form of cultural discourse: A rhetorical analysis on facebook</article-title>
          ,
          <year>2018</year>
          . doi:
          <volume>10</volume>
          .31234/osf.io/sx6t7.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A. L.</given-names>
            <surname>Maas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. E.</given-names>
            <surname>Daly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. T.</given-names>
            <surname>Pham</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Ng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Potts</surname>
          </string-name>
          ,
          <article-title>Learning word vectors for sentiment analysis, in: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics</article-title>
          , Portland, Oregon, USA,
          <year>2011</year>
          , pp.
          <fpage>142</fpage>
          -
          <lpage>150</lpage>
          . URL: http://www.aclweb.org/anthology/P11-1015.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Go</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Bhayani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <article-title>Twitter sentiment classification using distant supervision</article-title>
          ,
          <source>Processing</source>
          <volume>150</volume>
          (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Waseem</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Hovy</surname>
          </string-name>
          ,
          <article-title>Hateful symbols or hateful people? predictive features for hate speech detection on twitter</article-title>
          ,
          <year>2016</year>
          , pp.
          <fpage>88</fpage>
          -
          <lpage>93</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>N16</fpage>
          -2013.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Waseem</surname>
          </string-name>
          ,
          <article-title>Are you a racist or am i seeing things? annotator influence on hate speech detection on twitter</article-title>
          ,
          <source>in: NLP+CSS@EMNLP</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>P.</given-names>
            <surname>Burnap</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Williams</surname>
          </string-name>
          ,
          <article-title>Us and them: identifying cyber hate on twitter across multiple protected characteristics</article-title>
          ,
          <source>EPJ Data Science</source>
          <volume>5</volume>
          (
          <year>2016</year>
          ). doi:
          <volume>10</volume>
          .1140/epjds/ s13688-016-0072-6.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>P.</given-names>
            <surname>Patwa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Bhardwaj</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Guptha</surname>
          </string-name>
          , G. Kumari,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sharma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. PYKL</given-names>
            ,
            <surname>A. Das</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ekbal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Akhtar</surname>
          </string-name>
          , T. Chakraborty,
          <article-title>Overview of constraint 2021 shared tasks: Detecting english covid-19 fake news and hindi hostile posts</article-title>
          ,
          <source>in: Proceedings of the First Workshop on Combating Online Hostile Posts in Regional Languages during Emergency Situation (CONSTRAINT)</source>
          , Springer,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>P.</given-names>
            <surname>Patwa</surname>
          </string-name>
          , G. Aguilar,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Pandey</surname>
          </string-name>
          ,
          <string-name>
            <surname>S. PYKL</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Gambäck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Chakraborty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Solorio</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. Das</surname>
          </string-name>
          , SemEval
          <article-title>-2020 task 9: Overview of sentiment analysis of code-mixed tweets</article-title>
          ,
          <source>in: Proceedings of the Fourteenth Workshop on Semantic Evaluation</source>
          , International Committee for Computational Linguistics,
          <source>Barcelona (online)</source>
          ,
          <year>2020</year>
          . URL: https://aclanthology.org/
          <year>2020</year>
          .semeval-
          <volume>1</volume>
          .
          <fpage>100</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A.</given-names>
            <surname>Joshi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Prabhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Shrivastava</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Varma</surname>
          </string-name>
          ,
          <article-title>Towards sub-word level compositions for sentiment analysis of Hindi-English code mixed text</article-title>
          ,
          <source>in: Proceedings of COLING</source>
          <year>2016</year>
          ,
          <source>the 26th International Conference on Computational Linguistics: Technical Papers</source>
          ,
          <source>The COLING 2016 Organizing Committee</source>
          , Osaka, Japan,
          <year>2016</year>
          , pp.
          <fpage>2482</fpage>
          -
          <lpage>2491</lpage>
          . URL: https: //aclanthology.org/C16-1234.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>R.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. N.</given-names>
            <surname>Reganti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bhatia</surname>
          </string-name>
          , T. Maheshwari,
          <article-title>Aggression-annotated Corpus of HindiEnglish Code-mixed Data</article-title>
          , in: N. C. C. chair), K. Choukri,
          <string-name>
            <given-names>C.</given-names>
            <surname>Cieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Declerck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Goggi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hasida</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Isahara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Maegaard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mariani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Mazo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Moreno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Odijk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Piperidis</surname>
          </string-name>
          , T. Tokunaga (Eds.),
          <source>Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC</source>
          <year>2018</year>
          ),
          <article-title>European Language Resources Association (ELRA), Miyazaki</article-title>
          , Japan,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>