<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>You Don't Say. . . Linguistic Features in Sarcasm Detection</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Martina Ducret</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anna Feldman</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jing Peng</string-name>
          <email>pengjg@montclair.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Montclair State University Montclair</institution>
          ,
          <addr-line>New Jersey</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <fpage>1035</fpage>
      <lpage>1044</lpage>
      <abstract>
        <p>We explore linguistic features that contribute to sarcasm detection. The linguistic features that we investigate are a combination of text and word complexity, stylistic and psychological features. We experiment with sarcastic tweets with and without context. The results of our experiments indicate that contextual information is crucial for sarcasm prediction. One important observation is that sarcastic tweets are typically incongruent with their context in terms of sentiment or emotional load.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Sarcasm, or verbal irony, is a figurative language
device employed to convey the opposite meaning
of what is actually being said. In verbal
communication, a pause, intonation, or look can provide the
cues necessary to determine whether there is
sarcastic intent behind a comment. In writing, these
social cues are inaccessible. Thus, we must rely on
our understanding of the world, the speaker, and the
context beyond the statement to discern between
sarcasm and sincerity. This task has proven to be
so subjective that social media users moderate their
own comments using symbols and hashtags such as
/s and #sarcasm to denote the sentiment on Reddit
and Twitter, respectively. In fact, the dataset used
in this paper was collected using such hashtags
        <xref ref-type="bibr" rid="ref12">(Ghosh et al., 2020)</xref>
        .
      </p>
      <p>For machines, the lack of real-word knowledge
is detrimental to their understanding of sarcasm
as it hinders many natural language processing
applications. Beyond social-media conversations,
assessing product reviews as positive or negative
requires an understanding of both rhetorical and
literary devices. Back in 2012, BIC rolled out a “For
Her” line of pens which led their intended female
audience to poke fun at the misogynist message of
the product. One reviewer commented, “Well at
last pens for us ladies to use. . . now all we need is
“for her” paper and I can finally learn to write!”.
While this review seems positive and gave the
product four stars, our understanding of the social
climate today leads us to conclude that this review is
sarcastic and should be classified as such.</p>
      <p>In social media communication, new slang
words are introduced every day and emojis are
often used to negate the sentiment of the text. In
addition, stylistic devices and stylometric features are
also often employed to convey a meaning opposite
from its literal interpretation. While deep
learning models can be very effective in their detection
of sarcasm, they provide a “black box” approach
that gives linguists little to no insight into what
features are characteristic of sarcasm. The purpose
of the current work is to learn linguistic patterns
associated with sarcastic tweets and their contexts
and determine which are the strongest indicators
of sarcasm. The next step is to combine these
observations with transformer-based architectures to
achieve a better prediction accuracy.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Previous Work</title>
      <p>
        The field of automatic sarcasm recognition has
become quite active in recent years. The most current
event is the shared task
        <xref ref-type="bibr" rid="ref12">(Ghosh et al., 2020)</xref>
        organized as a part of the 2nd FigLang workshop at
ACL 2020. The task is typically framed as a
binary classification task (sarcastic vs. non-sarcastic)
considering either an utterance in isolation or in
combination with contextual information. Early
approaches to automatic sarcasm detection rely
on different types of features, including sarcasm
markers, word embeddings, emoticons, patterns
between positive and negative sentiment
        <xref ref-type="bibr" rid="ref11 ref13 ref15 ref16 ref17 ref21 ref23 ref25 ref28 ref30 ref30 ref30 ref32 ref33 ref4 ref7 ref7 ref7">(e.g.,
Davidov et al. 2010; Tsur et al. 2010; Gonza´lez-Iba´n˜ ez
et al. 2011; Riloff et al. 2013; Maynard and
Greenwood 2014; Wallace et al. 2015; Ghosh et al. 2015;
Joshi et al. 2015; Veale and Hao 2010; Liebrecht
et al. 2013)</xref>
        . Buschmeier et al. (2014) explore a
range of features, mainly focused on sentiment, for
the detection of verbal irony in product reviews.
While this paper provides a good baseline for irony
classification, our data differs in that it includes
a multi-speaker thread of context prior to the
sarcastic remark. More recent approaches apply deep
learning methods
        <xref ref-type="bibr" rid="ref29 ref33 ref8">(e.g., Ghosh and Veale 2016; Tay
et al. 2018; Wallace et al. 2015)</xref>
        . There is a great
amount of research exploring the role of contextual
information for sarcasm detection
        <xref ref-type="bibr" rid="ref11 ref11 ref14 ref14 ref15 ref15 ref15 ref16 ref16 ref16 ref16 ref2 ref20 ref22 ref24 ref24 ref24 ref27 ref3 ref3 ref33 ref33 ref5 ref5 ref6 ref9">(e.g., Joshi et al.
2015; Bamman and Smith 2015; Misra and Arora
2019; Bamman and Smith 2015; Khattri et al. 2015;
Amir et al. 2016; Rajadesingan et al. 2015; Ghosh
and Veale 2017; Schifanella et al. 2016; Cai et al.
2019; Castro et al. 2019)</xref>
        . Ghosh et al. (2020)
report that almost all systems submitted as part of the
shared task have used the transformer architecture,
such as BERT
        <xref ref-type="bibr" rid="ref31">(Turc et al. 2019)</xref>
        or RoBERTa
        <xref ref-type="bibr" rid="ref18">(Liu
et al. 2020)</xref>
        , and other variants. They performed
better than RNN architectures, even without any
task specific fine-tuning. Unfortunately, it is
difficult to interpret what these models capture about
sarcastic tweets and their context. Our approach
uses classical supervised algorithms to better
understand which elements characterize sarcasm in
a social media setting. We categorize linguistic
features, experiment with different combinations,
and take context into account when performing our
experiments.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Our Approach</title>
      <p>Our approach utilizes a combination of complex,
stylometric, and psychological linguistic features
to automatically detect the presence or absence of
sarcasm in a given text. We intentionally
experiment with classical machine learning classification
algorithms to get a better understanding of the
linguistic features contributing to the sarcasm
detection task. Our linguistic intuition is that there will
be a discordance between the linguistic features
corresponding with the responses and contexts
labeled as sarcastic. Sarcastic tweets are likely to be
semantically or emotionally incongruent with their
preceding tweets, while non-sarcastic tweets show
a greater harmony with their context. To measure
the emotional load of a response and its context,
we extract a number of sentiment- and
emotionrelated features. We also look at the distribution
of these features across the two classes.
Furthermore, we test the performance of our classifier and
importance of our features by considering just the
response tweet versus the response with its
accompanying context.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Data Set</title>
      <p>
        We use the Twitter Corpus from the CodaLab
shared task on sarcasm detection
        <xref ref-type="bibr" rid="ref12">(Ghosh et al.,
2020)</xref>
        . The training data consists of 2,500 tweets
labeled ‘SARCASM’ and 2,500 tweets labeled ‘NON
SARCASM’, the balanced test data consists of
an additional 1,800 labeled tweets. Ghosh et al.
(2020), this is a self-labeled data set where the
tweets are annotated as sarcastic based on the
hashtags used by the users. The non-sarcastic tweets are
the ones that do not contain the sarcasm hashtags,
but may be labeled with either positive or negative
sentiment hashtags, such as ’#happy’. Retweets,
duplicates, quotes, etc., are excluded
        <xref ref-type="bibr" rid="ref12">(see Ghosh
et al. 2020 for more details)</xref>
        . Each sarcastic and
non-sarcastic tweet is accompanied with an
hierarchical conversation thread, e.g., context/1 is the
immediate context, context/0 is the context that
preceded context/1, and so on. The training and test
data include up to 19 preceding tweets labeled as
context/0, context/1, . . . , context/19 (if available).
5
      </p>
    </sec>
    <sec id="sec-5">
      <title>Feature Extraction</title>
      <p>
        Our research focuses on the role linguistic features
play in sarcasm detection. We classify our features
into three categories: complexity, stylistic, and
psychological. Abonizio et al. (2020) defines
complexity features as linguistic features that capture
the overall objective of the context at the word and
sentence level. Stylistic features use natural
language techniques to gain grammatical information
to better understand the syntax and style of the
document. Psychological features are closest related
to emotions and the cognitive aspect of NLP. We
expand on these psychological features by utilizing
VAD (Valence, Arousal, Dominance) (Warriner
et al., 2013), emotional embeddings, and LIWC
        <xref ref-type="bibr" rid="ref28 ref30 ref32 ref7">(Tausczik and Pennebaker, 2010)</xref>
        . Lastly, we use
word-level count vectors, word-level tf-idf, n-gram
word-level tf-idf, n-gram character-level tf-idf. We
stack these features and refer to them as count
vectors for the remainder of this paper.
5.1
LIWC
        <xref ref-type="bibr" rid="ref28 ref30 ref32 ref7">(Tausczik and Pennebaker, 2010)</xref>
        is a text
analysis program with a built-in dictionary that
counts words in psychologically meaningful
categories. After all the words have been reviewed, the
module calculates the total percentages of words
that are similar and match that of the user
dictionary categories. We used LIWC to extract features
to detect and categorize the meaning, emotional
sentiment, and social relationship of the words in
the data set.
5.2
      </p>
      <sec id="sec-5-1">
        <title>Valence, Arousal, Dominance (VAD)</title>
        <p>VAD (Valence Arousal Dominance) (Warriner et al.,
2013) includes almost 14,000 lemmas rated on a
1-9 scale according to the emotions evoked by the
terms. Valence refers to the pleasantness of the
word, arousal determines how dull or exciting the
emotion is, and dominance ranges from
submission to feeling in control. The VAD dimensions
allow us to further explore the affective meanings
of tweets and determine their viability as a
predictor of sarcasm. We compute VAD scores for each
“response” and use the three scores obtained as a
feature in our classifiers. Furthermore, we explore
using the scores as a measure of congruity between
our response and contexts. We calculate the VAD
scores for each individual response and context and
then subtract the response scores by their
respective context scores. In other words, if a response
receives a valence score of 8 and its context/0
receives a valence score of 2, the valence congruity
score would be a 6. We hypothesize that sarcastic
tweets might show very little affective congruity
compared to their non-sarcastic counterparts.
5.3</p>
      </sec>
      <sec id="sec-5-2">
        <title>VADER</title>
        <p>
          VADER (Valence Aware Dictionary and sEntiment
Reasoner)
          <xref ref-type="bibr" rid="ref11 ref14 ref15 ref16 ref24 ref3 ref33">(Hutto and Gilbert, 2015)</xref>
          is a lexicon
and rule-based tool built especially for sentiment
analysis of social media texts. VADER maps
lexical features to emotions and provides insight into
the intensity of such emotions through a series of
polarity indices. VADER considers capitalization,
punctuation, degree modifiers, emojis, and
negations to compute its negative, positive and neutral
scores. Furthermore, VADER’s compound score
provides a normalized, weighted composite score
for a given tweet.
The emotions conveyed in our data set are
portrayed through emotional embeddings. Calculating
the emotions of the text goes a level deeper than
just looking at the word embeddings. Using a
pretrained model from Hugging Face
          <xref ref-type="bibr" rid="ref26">(Saravia et al.,
2018)</xref>
          , we categorize the tweets into six emotions.
The emotions include, joy, anger, fear, surprise,
sadness and love. Figure 1 above represents an
example of the distribution of emotions between
response and context/0 in the balanced training
data set. The results support our intuition that
sarcasm is typically associated with negative
emotions. When the context is labeled as “anger”,
nonsarcastic tweets tend to respond with joy, while
sarcastic tweets usually respond with anger. By
contrast, when the context is labeled as “joy”,
nonsarcastic tweets overwhelmingly respond with joy,
while sarcastic tweets still largely respond with
anger. There are 1,216 instances of the same
emotion expressed in both response and context for the
non- sarcasm class and 863 instances of this in the
sarcasm class. Sarcastic tweets are generally
incongruent with emotions throughout the response and
context, unless associated with a negative emotion,
e.g., anger.
5.5
        </p>
      </sec>
      <sec id="sec-5-3">
        <title>Tweet-Context Similarity Scores</title>
        <p>
          We use the standard document similarity estimation
technique using word embeddings
          <xref ref-type="bibr" rid="ref23">(GloVe,
Pennington et al. 2014)</xref>
          and emotional embeddings
          <xref ref-type="bibr" rid="ref26">(Saravia et al. 2018)</xref>
          , which consists of measuring
the similarity between the vector representations of
the two documents. Let x1; : : : xm and y1; : : : ; yn
be the emotion (or word embedding) vectors of two
documents. The cosine similarity value between
the two documents (e.g., a tweet and its context)
centroids Cx = 1 Pim=1 xi and Cy = n1 Pin=1 yi
m
is calculated as follows:
cos(Cx; Cy) =
hCx; Cyi ;
kCxkkCyk
(1)
where hx; yi denotes the inner product of two
vectors x and y.
        </p>
        <p>We compute two similarity scores: 1) semantic
cosine similarity using word embeddings; 2)
cosine similarity using emotional embeddings. Our
linguistic intuition is that a sarcastic response is
going to be semantically or emotionally incongruent
with its context and this is what creates the sarcasm
effect.</p>
        <p>Message
It’s no secret that this president has routinely targeted
religious and ethnic minorities. He has fanned the
flames of hate against refugees, Muslims, Africans,
immigrants, women and all racial and religious
minorities.</p>
        <p>He is routinely and openly hostile to any legitimate
Congressional oversight. He has made clear his
wanton corruption by soliciting a bribe from a foreign
government for his personal political gain.</p>
        <p>
          Yassss queen, you’re so brave and bold.
5.6
After running all of the features on the training data,
we implemented SHAP (SHapley Additive
exPlanations)
          <xref ref-type="bibr" rid="ref20 ref9">(Lundberg and Lee, 2017)</xref>
          to determine
which features are the most important for
classification. SHAP is a theoretic output technique that
explains predictions of our model, by producing
a SHAPLEY score that plots the most important
features in our model. The features produced by
SHAP were used in our experiments and are
referred to as our “select linguistic features”. The top
20 features SHAP selects contain a combination
of character features such as character count, as
well as a number of sentiment features, including
VADER scores, emotion scores for both a response
and its context as well as VAD features.
6
6.1
        </p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Experimental Evaluation</title>
      <sec id="sec-6-1">
        <title>Data Preprocessing</title>
        <p>
          Our preprocessing procedure consists of steps to
remove noisy and unnecessary data. First, we
tokenize and lemmatize the tweets using NLTK
          <xref ref-type="bibr" rid="ref19">(Loper
and Bird, 2002)</xref>
          . We also remove any instance of
“@USER” due to the repetition of this token in the
beginning of most tweets. Prior research
demonstrated that classifiers did not tend to benefit from
large quantities of additional context and we
noticed that a majority of the tweets only contained
context/0 and context/1. While we plan to
experiment further with additional context layers, in this
work we only report on experiments that involve
context/0 and context/1. We did not remove any
stop words due to the small amount of text in each
tweet. We also maintained punctuation and emojis
as they proved to be useful information during the
extraction of certain features, such as VADER.
7
        </p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Results</title>
      <p>We use a Random Forest classifier and run 21
different experiments of which the most relevant ones
are outlined in Table 3. The baseline scores
represent an attention based LSTM model described
in Ghosh et al. (2018) and used in the CodaLab
Shared Task. We look at how each feature
performed on just the response versus the response
and context. We notice that for response, a
combination of all count features and all linguistic
features achieves the best F1 score of 67%. This score
is further increased to 70% when the context is
considered.
c/0
c/1
R
c/0
c/1</p>
      <p>R</p>
      <p>Message
A2 I revert back to Canvas. I am sure you can post
assignments for parents in this, (haven’t done this
yet). Canvas = #thebomb #KidsDeserveIt
Can you telk me more about Canvas? I haven’t heard
of it.</p>
      <p>It’s Edmodo with #MorePower You can create
assignments in it, post all work, the assignments can be
auto graded and imported into your Skyward grade
book.
Table 1 is an example of a sarcastic tweet whose
context/0, context/1 and response received an
emotion of anger, anger, and joy, respectively. Table 2
represents a non-sarcastic thread of tweets where
each message was classified as joy. This indicates
that non-sarcastic tweets tend to be more
emotionally similar to the preceding context while sarcastic
tweets tend to shift in emotion. As a result, when
compared to its contexts, the sarcastic tweet
received lower emotional similarity scores than the
non-sarcastic tweet.
In this paper we explored the role various
linguistic features play in computational sarcasm
detection. We investigated a combination of text and
word complexity features, stylistic and
psychological features. The result of our experiments
indicate that contextual information is crucial for
sarcasm detection. We also observed that sarcastic
tweets are often incongruent with their context in
terms of sentiment or emotional load. Using a
Random Forest classifier and the features we extracted
we obtain promising results. Our current work is
concerned with combining these observations with
transformer-based architectures to achieve a better
prediction accuracy.</p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgments</title>
      <p>This work is supported by the US National Science
Foundation under Grant No.: 1704113.</p>
      <p>Sarcasm</p>
      <p>ArXiv,</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Hugo</given-names>
            <surname>Queiroz</surname>
          </string-name>
          <string-name>
            <surname>Abonizio</surname>
          </string-name>
          , Janaina Ignacio de Morais, Gabriel Marques Tavares, and Sylvio Barbon Junior.
          <year>2020</year>
          .
          <article-title>Language-independent fake news detection: English, portuguese, and spanish mutual features</article-title>
          .
          <source>Future Internet</source>
          ,
          <volume>12</volume>
          (
          <issue>5</issue>
          ):
          <fpage>87</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Silvio</given-names>
            <surname>Amir</surname>
          </string-name>
          , Byron C Wallace,
          <string-name>
            <surname>Hao Lyu</surname>
            , and Paula Carvalho Ma´rio
            <given-names>J</given-names>
          </string-name>
          <string-name>
            <surname>Silva</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Modelling context with user embeddings for sarcasm detection in social media</article-title>
          .
          <source>arXiv preprint arXiv:1607</source>
          .
          <fpage>00976</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>David</given-names>
            <surname>Bamman</surname>
          </string-name>
          and
          <string-name>
            <given-names>Noah</given-names>
            <surname>Smith</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Contextualized sarcasm detection on twitter</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Konstantin</given-names>
            <surname>Buschmeier</surname>
          </string-name>
          , Philipp Cimiano, and
          <string-name>
            <given-names>Roman</given-names>
            <surname>Klinger</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>An impact analysis of features in a classification approach to irony detection in product reviews</article-title>
          .
          <source>In Proceedings of the 5th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis</source>
          , pages
          <fpage>42</fpage>
          -
          <lpage>49</lpage>
          , Baltimore, Maryland. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Yitao</given-names>
            <surname>Cai</surname>
          </string-name>
          , Huiyu Cai, and
          <string-name>
            <given-names>Xiaojun</given-names>
            <surname>Wan</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Multimodal sarcasm detection in twitter with hierarchical fusion model</article-title>
          .
          <source>In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics</source>
          , pages
          <fpage>2506</fpage>
          -
          <lpage>2515</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Santiago</given-names>
            <surname>Castro</surname>
          </string-name>
          , Devamanyu Hazarika, Vero´nica Pe´rezRosas,
          <string-name>
            <surname>Roger</surname>
            <given-names>Zimmermann</given-names>
          </string-name>
          , Rada Mihalcea, and
          <string-name>
            <given-names>Soujanya</given-names>
            <surname>Poria</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Towards multimodal sarcasm detection (an obviously perfect paper)</article-title>
          .
          <source>arXiv preprint arXiv:1906</source>
          .
          <year>01815</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Dmitry</given-names>
            <surname>Davidov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Oren</given-names>
            <surname>Tsur</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Ari</given-names>
            <surname>Rappoport</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Semi-supervised recognition of sarcasm in Twitter and Amazon</article-title>
          .
          <source>In Proceedings of the Fourteenth Conference on Computational Natural Language Learning</source>
          , pages
          <fpage>107</fpage>
          -
          <lpage>116</lpage>
          , Uppsala, Sweden. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Aniruddha</given-names>
            <surname>Ghosh</surname>
          </string-name>
          and
          <string-name>
            <given-names>Tony</given-names>
            <surname>Veale</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Fracking sarcasm using neural network</article-title>
          .
          <source>In Proceedings of the 7th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis</source>
          , pages
          <fpage>161</fpage>
          -
          <lpage>169</lpage>
          , San Diego, California. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>Aniruddha</given-names>
            <surname>Ghosh</surname>
          </string-name>
          and
          <string-name>
            <given-names>Tony</given-names>
            <surname>Veale</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Magnets for sarcasm: Making sarcasm detection timely, contextual and very personal</article-title>
          .
          <source>In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing</source>
          , pages
          <fpage>482</fpage>
          -
          <lpage>491</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Debanjan</given-names>
            <surname>Ghosh</surname>
          </string-name>
          ,
          <string-name>
            <surname>Alexander R Fabbri</surname>
            , and
            <given-names>Smaranda</given-names>
          </string-name>
          <string-name>
            <surname>Muresan</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Sarcasm analysis using conversation context</article-title>
          .
          <source>Computational Linguistics</source>
          ,
          <volume>44</volume>
          (
          <issue>4</issue>
          ):
          <fpage>755</fpage>
          -
          <lpage>792</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Debanjan</given-names>
            <surname>Ghosh</surname>
          </string-name>
          , Weiwei Guo, and
          <string-name>
            <given-names>Smaranda</given-names>
            <surname>Muresan</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Sarcastic or not: Word embeddings to predict the literal or sarcastic meaning of words</article-title>
          .
          <source>In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing</source>
          , pages
          <fpage>1003</fpage>
          -
          <lpage>1012</lpage>
          , Lisbon, Portugal. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Debanjan</given-names>
            <surname>Ghosh</surname>
          </string-name>
          , Avijit Vajpayee, and
          <string-name>
            <given-names>Smaranda</given-names>
            <surname>Muresan</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>A report on the 2020 sarcasm detection shared task</article-title>
          .
          <source>In Proceedings of the Second Workshop on Figurative Language Processing</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>11</lpage>
          , Online. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Roberto</given-names>
            <surname>Gonza</surname>
          </string-name>
          <article-title>´lez-Iba´n˜ez, Smaranda Muresan</article-title>
          , and
          <string-name>
            <given-names>Nina</given-names>
            <surname>Wacholder</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Identifying sarcasm in Twitter: A closer look</article-title>
          .
          <source>In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies</source>
          , pages
          <fpage>581</fpage>
          -
          <lpage>586</lpage>
          , Portland, Oregon, USA. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>C.J.</given-names>
            <surname>Hutto</surname>
          </string-name>
          and
          <string-name>
            <given-names>Eric</given-names>
            <surname>Gilbert</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Vader: A parsimonious rule-based model for sentiment analysis of social media text</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>Aditya</given-names>
            <surname>Joshi</surname>
          </string-name>
          , Vinita Sharma, and
          <string-name>
            <given-names>Pushpak</given-names>
            <surname>Bhattacharyya</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Harnessing context incongruity for sarcasm detection</article-title>
          .
          <source>In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)</source>
          , pages
          <fpage>757</fpage>
          -
          <lpage>762</lpage>
          , Beijing, China. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <given-names>Anupam</given-names>
            <surname>Khattri</surname>
          </string-name>
          , Aditya Joshi, Pushpak Bhattacharyya, and
          <string-name>
            <given-names>Mark</given-names>
            <surname>Carman</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Your sentiment precedes you: Using an author's historical tweets to predict sarcasm</article-title>
          .
          <source>In Proceedings of the 6th workshop on computational approaches to subjectivity, sentiment and social media analysis</source>
          , pages
          <fpage>25</fpage>
          -
          <lpage>30</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>Christine</given-names>
            <surname>Liebrecht</surname>
          </string-name>
          , Florian Kunneman, and Antal van den Bosch.
          <year>2013</year>
          .
          <article-title>The perfect solution for detecting sarcasm in tweets #not</article-title>
          .
          <source>In Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis</source>
          , pages
          <fpage>29</fpage>
          -
          <lpage>37</lpage>
          , Atlanta, Georgia. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>Yinhan</given-names>
            <surname>Liu</surname>
          </string-name>
          , Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen,
          <string-name>
            <surname>Omer Levy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Mike</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Luke</given-names>
            <surname>Zettlemoyer</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Veselin</given-names>
            <surname>Stoyanov</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>RoBERTa: A Robustly Optimized BERT Pretraining Approach</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <given-names>Edward</given-names>
            <surname>Loper</surname>
          </string-name>
          and
          <string-name>
            <given-names>Steven</given-names>
            <surname>Bird</surname>
          </string-name>
          .
          <year>2002</year>
          .
          <article-title>NLTK: The Natural Language Toolkit</article-title>
          . In
          <source>In Proceedings of the ACL Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics</source>
          . Philadelphia: Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Scott M Lundberg</surname>
            and
            <given-names>Su-In</given-names>
          </string-name>
          <string-name>
            <surname>Lee</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>A unified approach to interpreting model predictions</article-title>
          . In I. Guyon,
          <string-name>
            <given-names>U. V.</given-names>
            <surname>Luxburg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bengio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wallach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fergus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vishwanathan</surname>
          </string-name>
          , and R. Garnett, editors,
          <source>Advances in Neural Information Processing Systems</source>
          <volume>30</volume>
          , pages
          <fpage>4765</fpage>
          -
          <lpage>4774</lpage>
          . Curran Associates, Inc.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <given-names>Diana</given-names>
            <surname>Maynard</surname>
          </string-name>
          and
          <string-name>
            <given-names>Mark</given-names>
            <surname>Greenwood</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Who cares about sarcastic tweets? investigating the impact of sarcasm on sentiment analysis</article-title>
          .
          <source>In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)</source>
          , pages
          <fpage>4238</fpage>
          -
          <lpage>4243</lpage>
          , Reykjavik, Iceland.
          <source>European Language Resources Association (ELRA).</source>
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <given-names>Rishabh</given-names>
            <surname>Misra</surname>
          </string-name>
          and
          <string-name>
            <given-names>Prahal</given-names>
            <surname>Arora</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>detection using hybrid neural network</article-title>
          . abs/
          <year>1908</year>
          .07414.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Jeffrey</surname>
            <given-names>Pennington</given-names>
          </string-name>
          , Richard Socher, and
          <string-name>
            <given-names>Christopher D.</given-names>
            <surname>Manning</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Glove: Global vectors for word representation</article-title>
          .
          <source>In Empirical Methods in Natural Language Processing (EMNLP)</source>
          , pages
          <fpage>1532</fpage>
          -
          <lpage>1543</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <given-names>Ashwin</given-names>
            <surname>Rajadesingan</surname>
          </string-name>
          , Reza Zafarani, and Huan Liu.
          <year>2015</year>
          .
          <article-title>Sarcasm detection on twitter: A behavioral modeling approach</article-title>
          .
          <source>In Proceedings of the eighth ACM international conference on web search and data mining</source>
          , pages
          <fpage>97</fpage>
          -
          <lpage>106</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <given-names>Ellen</given-names>
            <surname>Riloff</surname>
          </string-name>
          , Ashequl Qadir, Prafulla Surve, Lalindra De Silva, Nathan Gilbert, and
          <string-name>
            <given-names>Ruihong</given-names>
            <surname>Huang</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Sarcasm as contrast between a positive sentiment and negative situation</article-title>
          .
          <source>In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing</source>
          , pages
          <fpage>704</fpage>
          -
          <lpage>714</lpage>
          , Seattle, Washington, USA. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <given-names>Elvis</given-names>
            <surname>Saravia</surname>
          </string-name>
          ,
          <string-name>
            <surname>Hsien-Chi Toby</surname>
            <given-names>Liu</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yen-Hao</surname>
            <given-names>Huang</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Junlin Wu</surname>
          </string-name>
          , and
          <string-name>
            <surname>Yi-Shin Chen</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>CARER: Contextualized affect representations for emotion recognition</article-title>
          .
          <source>In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing</source>
          , pages
          <fpage>3687</fpage>
          -
          <lpage>3697</lpage>
          , Brussels, Belgium. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <given-names>Rossano</given-names>
            <surname>Schifanella</surname>
          </string-name>
          , Paloma de Juan, Joel Tetreault, and
          <string-name>
            <given-names>Liangliang</given-names>
            <surname>Cao</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Detecting sarcasm in multimodal social platforms</article-title>
          .
          <source>In Proceedings of the 24th ACM international conference on Multimedia</source>
          , pages
          <fpage>1136</fpage>
          -
          <lpage>1145</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <string-name>
            <given-names>Yla R.</given-names>
            <surname>Tausczik</surname>
          </string-name>
          and
          <string-name>
            <given-names>James W.</given-names>
            <surname>Pennebaker</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>The psychological meaning of words: Liwc and computerized text analysis methods</article-title>
          .
          <source>Journal of Language and Social Psychology</source>
          ,
          <volume>29</volume>
          (
          <issue>1</issue>
          ):
          <fpage>24</fpage>
          -
          <lpage>54</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          <string-name>
            <given-names>Yi</given-names>
            <surname>Tay</surname>
          </string-name>
          , Anh Tuan Luu, Siu Cheung Hui, and
          <string-name>
            <given-names>Jian</given-names>
            <surname>Su</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Reasoning with sarcasm by reading inbetween</article-title>
          .
          <source>In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)</source>
          , pages
          <fpage>1010</fpage>
          -
          <lpage>1020</lpage>
          , Melbourne, Australia. Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          <string-name>
            <given-names>Oren</given-names>
            <surname>Tsur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Dmitry</given-names>
            <surname>Davidov</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Ari</given-names>
            <surname>Rappoport</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Icwsm - a great catchy name: Semi-supervised recognition of sarcastic sentences in online product reviews</article-title>
          .
          <source>In ICWSM. The AAAI Press.</source>
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          <string-name>
            <given-names>Iulia</given-names>
            <surname>Turc</surname>
          </string-name>
          ,
          <string-name>
            <surname>Ming-Wei</surname>
            <given-names>Chang</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Kenton</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and Kristina</given-names>
            <surname>Toutanova</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Well-read students learn better: On the importance of pre-training compact models</article-title>
          . arXiv preprint arXiv:
          <year>1908</year>
          .08962v2.
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          <string-name>
            <given-names>Tony</given-names>
            <surname>Veale</surname>
          </string-name>
          and
          <string-name>
            <given-names>Yanfen</given-names>
            <surname>Hao</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Detecting ironic intent in creative comparisons</article-title>
          .
          <source>In ECAI 2010 - 19th European Conference on Artificial Intelligence</source>
          , Lisbon, Portugal,
          <source>August 16-20</source>
          ,
          <year>2010</year>
          , Proceedings, volume
          <volume>215</volume>
          <source>of Frontiers in Artificial Intelligence and Applications</source>
          , pages
          <fpage>765</fpage>
          -
          <lpage>770</lpage>
          . IOS Press.
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          <string-name>
            <given-names>Byron C.</given-names>
            <surname>Wallace</surname>
          </string-name>
          , Do Kook Choe, and
          <string-name>
            <given-names>Eugene</given-names>
            <surname>Charniak</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Sparse, contextually informed models for irony detection: Exploiting user communities, entities and sentiment</article-title>
          .
          <source>In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Amy Warriner</source>
          , Victor Kuperman, and
          <string-name>
            <given-names>Marc</given-names>
            <surname>Brysbaert</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Norms of valence, arousal, and dominance for 13,915 english lemmas</article-title>
          .
          <source>Behavior research methods</source>
          ,
          <volume>45</volume>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>