<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Stance Classi cation for Rumour Analysis in Twitter: Exploiting A ective Information and Conversation Structure</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Endang Wahyu Pamungkas</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Valerio Basile</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Viviana Patti Dipartimento di Informatica</string-name>
          <email>pattig@di.unito.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Universita degli Studi di Torino</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>1http://www.journalism.org/2017/09/07/ news-use-across-social-media-platforms-2017/</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Analysing how people react to rumours
associated with news in social media is an important
task to prevent the spreading of
misinformation, which is nowadays widely recognized as
a dangerous tendency. In social media
conversations, users show di erent stances and
attitudes towards rumourous stories. Some
users take a de nite stance, supporting or
denying the rumour at issue, while others just
comment it, or ask for additional evidence
on the rumour's veracity. A shared task has
been proposed at SemEval-2017 (Task 8,
SubTask A), which is focused on rumour stance
classi cation in English tweets. The goal is
predicting user stance towards emerging
rumours in Twitter, in terms of supporting,
denying, querying, or commenting the original
rumour, looking at the conversation threads
originated by the rumour. This paper
describes a new approach to this task, where the
use of conversation-based and a ective-based
features, covering di erent facets of a ect, is
explored. Our classi cation model
outperforms the best-performing systems for stance
classi cation at SemEval-2017 showing the
effectiveness of the feature set proposed.</p>
      <p>Copyright © CIKM 2018 for the individual papers by the papers'
authors. Copyright © CIKM 2018 for the volume as a collection
by its editors. This volume and its papers are published under
the Creative Commons License Attribution 4.0 International (CC
BY 4.0).
1</p>
    </sec>
    <sec id="sec-2">
      <title>Introduction</title>
      <p>Nowadays, people increasingly tend to use social media
like Facebook and Twitter as their primary source of
information and news consumption. There are several
reasons behind this tendency, such as the simplicity
to gather and share the news and the possibility of
staying abreast of the latest news and updated faster
than with traditional media. An important factor is
also that people can be engaged in conversations on
the latest breaking news with their contacts by
using these platforms. Pew Research Center's newest
report1 shows that two-thirds of U.S. adults gather
their news from social media, where Twitter is the
most used platform. However, the absence of a
systematic approach to do some form of fact and veracity
checking may also encourage the spread of rumourous
stories and misinformation [PVV13]. Indeed, in social
media, unveri ed information can spread very quickly
and becomes viral easily, enabling the di usion of false
rumours and fake information.</p>
      <p>Within this scenario, it is crucial to analyse
people attitudes towards rumours in social media and to
resolve their veracity as soon as possible. Several
approaches have been proposed to check the rumour
veracity in social media [SSW+17]. This paper focus
on a stance-based analysis of event-related rumours,
following the approach proposed at SemEval-2017 in
the new RumourEval shared task (Task 8, sub-task
A) [DBL+17]. In this task English tweets from
conversation threads, each associated to a newsworthy event
and the rumours around it, are provided as data. The
goal is to determine whether a tweet in the thread
is supporting, denying, querying, or commenting the
original rumour which started the conversation. It can
be considered a stance classi cation task, where we
have to predict the user's stance towards the rumour
from a tweet, in the context of a given thread. This
task has been de ned as open stance classi cation task
and is conceived as a key step in rumour resolution,
by providing an analysis of people reactions towards
an emerging rumour [PVV13, ZLP+16]. The task is
also di erent from detecting stance towards a speci c
target entity [MKS+16].</p>
      <p>Contribution We describe a novel classi cation
approach, by proposing a new feature matrix, which
includes two new groups: (a) features exploiting the
conversational structure of the dataset [DBL+17]; (b)
a ective features relying on the use of a wide range
of a ective resources capturing di erent facets of
sentiment and other a ect-related phenomena. We were
also inspired by the fake news study on Twitter in
[VRA18], showing that false stories inspire fear,
disgust, and surprise in replies, while true stories inspire
anticipation, sadness, joy, and trust. Meanwhile, from
a dialogue act perspective, the study of [NS13] found
that a relationship exists between the use of an a
ective lexicon and the communicative intention of an
utterance which includes AGREE-ACCEPT (support),
REJECT (deny), INFO-REQUEST (question), and
OPINION (comment). They exploited several LIWC
categories to analyse the role of a ective content.</p>
      <p>Our results show that our model outperforms the
state of the art on the Semeval-2017 benchmark
dataset. Feature analysis highlights the contribution
of the di erent feature groups, and error analysis is
shedding some light on the main di culties and
challenges which still need to be addressed.</p>
      <p>
        Outline The paper is organized as follows.
Section 2
        <xref ref-type="bibr" rid="ref5">introduces the SemEval-2017</xref>
        Task 8. Section 3
describes our approach to deal with open stance
classication by exploiting di erent groups of features.
Section 4 describes the evaluation and includes a
qualitative error analysis. Finally, Section 5 concludes the
paper and points to future directions.
2
      </p>
    </sec>
    <sec id="sec-3">
      <title>SemEval-2017 Task 8: RumourEval</title>
      <p>The SemEval-2017 Task 8 Task A [DBL+17] has as
its main objective to determine the stance of the users
in a Twitter thread towards a given rumour, in terms
of support, denying, querying or commenting (SDQC)
on the original rumour. Rumour is de ned as a
\circulating story of questionable veracity, which is
apparently credible but hard to verify, and produces su cient
skepticism and/or anxiety so as to motivate nding out
the actual truth" [ZLP+15]. The task was very timing
due to the growing importance of rumour resolution
in the breaking news and to the urgency of preventing
the spreading of misinformation.</p>
      <sec id="sec-3-1">
        <title>Rumour</title>
        <p>Charlie Hebdo
Ebola Essien
Ferguson
Ottawa Shooting
Prince Toronto
Putin Missing
Sydney Siege</p>
      </sec>
      <sec id="sec-3-2">
        <title>Total</title>
      </sec>
      <sec id="sec-3-3">
        <title>Rumour</title>
        <p>Ferguson
Ottawa Shooting
Sydney Siege
Charlie Hebdo
Germanwings
Marina Joyce
Hillary's Illness</p>
      </sec>
      <sec id="sec-3-4">
        <title>Total</title>
        <p>Dataset2 The data for this task are taken from
Twitter conversations about news-related rumours
collected by [ZLP+16]. They were annotated using
four labels (SDQC): support - S (when tweet's
author support the rumour veracity); deny -D (when
tweet's author denies the rumour veracity); query
Q (when tweet's author ask for additional
information/evidence); comment -C (when tweet's author just
make a comment and does not give important
information to asses the rumour veracity). The distribution
consists of three sets: development, training and test
sets, as summarized in Table 1, where you can see also
the label distribution and the news related to the
rumors discussed. Training data consist of 297 Twitter
conversations and 4,238 tweets in total with related
direct and nested replies, where conversations are
associated to seven di erent breaking news. Test data
consist of 1049 tweets, where two new rumourous
topics were added.</p>
        <p>Participants Eight teams participated in the task.
The best performing system was developed by
Turing (78.4 in accuracy). ECNU, MamaEdha,
UWaterloo, and DFKI-DKT utilized ensemble classi er.
Some systems also used deep learning techniques,
including Turing, IKM, and MamaEdha. Meanwhile,
NileTRMG and IITP used classical classi er (SVM) to
2http://alt.qcri.org/semeval2017/task8/index.php?id=
data-and-tools
build their systems. Most of the participants exploited
word embedding to construct their feature space,
beside the Twitter domain features.
3</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Proposed Method</title>
      <p>We developed a new model by exploiting several
stylistic and structural features characterizing
Twitter language. In addition, we propose to utilize
conversational-based features by exploiting the
peculiar tree structure of the dataset. We also explored the
use of a ective based feature by extracting information
from several a ective resources including dialogue-act
inspired features.
3.1</p>
      <sec id="sec-4-1">
        <title>Structural Features</title>
        <p>They were designed taking into account several
Twitter data characteristics, and then selecting the most
relevant features to improve the classi cation
performance. The set of structural features that we used is
listed below.</p>
        <p>Retweet Count: The number of retweet of each
tweet.</p>
        <p>Question Mark: presence of question mark "?";
binary value (0 and 1).</p>
        <p>Question Mark Count: number of question
marks present in the tweet.</p>
        <p>Hashtag Presence: this feature has a binary
value 0 (if there is no hashtag in the tweet) or 1
(if there is at least one hashtag in the tweet).
Text Length: number of characters after
removing Twitter markers such as hashtags, mentions,
and URLs.</p>
        <p>URL Count: number of URL links in the tweet.
3.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Conversation Based Features</title>
        <p>These features are devoted to exploit the peculiar
characteristics of the dataset, which have a tree structure
re ecting the conversation thread3.</p>
      </sec>
      <sec id="sec-4-3">
        <title>Text Similarity to Source Tweet: Jaccard</title>
        <p>Similarity of each tweet with its source tweet.</p>
      </sec>
      <sec id="sec-4-4">
        <title>Text Similarity to Replied Tweet: the degree</title>
        <p>of similarity between the tweet with the previous
tweet in the thread (the tweet is a reply to that
tweet).</p>
        <p>Tweet Depth: the depth value is obtained by
counting the node from sources (roots) to each
tweet in their hierarchy.</p>
        <p>3The implementation of these features is inspired from
unpublished shared code [Gra17].
3.3</p>
      </sec>
      <sec id="sec-4-5">
        <title>A ective Based Features</title>
        <p>The idea to use a ective features in the context of our
task was inspired by recent works on fake news
detection, focusing on emotional responses to true and false
rumors [VRA18], and by the work in [NS13] re ecting
on the role of a ect in dialogue acts [NS13].
Multifaceted a ective features have been already proven to
be e ective in some related tasks [LFPR16], including
the stance detection task proposed at SemEval-2016
(Task 6).</p>
        <p>We used the following a ective resources relying on
di erent emotion models.</p>
        <p>Emolex: it contains 14,182 words associated
with eight primary emotion based on the Plutchik
model [MT13, Plu01].</p>
        <p>EmoSenticNet(EmoSN): it is an enriched
version of SenticNet [COR14] including 13,189 words
labeled by six Ekman's basic emotion [PGH+13,
Ekm92].</p>
      </sec>
      <sec id="sec-4-6">
        <title>Dictionary of A ect in Language (DAL): in</title>
        <p>cludes 8,742 English words labeled by three scores
representing three dimensions: Pleasantness,
Activation and Imagery [Whi09].</p>
      </sec>
      <sec id="sec-4-7">
        <title>A ective Norms for English Words</title>
        <p>(ANEW): consists of 1,034 English words
[BL99] rated with ratings based on the
ValenceArousal-Dominance (VAD) model [OST57].</p>
      </sec>
      <sec id="sec-4-8">
        <title>Linguistic Inquiry and Word Count</title>
        <p>(LIWC): this psycholinguistic resource [PFB01]
includes 4,500 words distributed into 64
emotional categories including positive (PosEMO)
and negative (NegEMO).
3.4</p>
      </sec>
      <sec id="sec-4-9">
        <title>Dialogue-Act Features</title>
        <p>We also included additional 11 categories from bf
LIWC, which were already proven to be e ective in
dialogue-act task in previous work [NS13]. Basically,
these features are part of the a ective feature group,
but we present them separately because we are
interested in exploring the contribution of such feature
set separately. This feature set was obtained by
selecting 4 communicative goals related to our classes
in the stance task: agree-accept (support), reject
(deny), info-request (question), and opinion
(comment). The 11 LIWC categories include:</p>
        <sec id="sec-4-9-1">
          <title>Agree-accept: Assent, Certain, A ect;</title>
        </sec>
        <sec id="sec-4-9-2">
          <title>Reject: Negate, Inhib;</title>
        </sec>
      </sec>
      <sec id="sec-4-10">
        <title>Info-request: You, Cause;</title>
        <sec id="sec-4-10-1">
          <title>Opinion: Future, Sad, Insight, Cogmech.</title>
          <p>We used the RumourEval dataset from SemEval-2017
Task 8 described in Section 2. We de ned the rumour
stance detection problem as a simple four-way
classication task, where every tweet in the dataset (source
and direct or nested reply) should be classi ed into one
among four classes: support, deny, query, and
comment. We conducted a set of experiments in order to
evaluate and analyze the e ectiveness of our proposed
feature set.4.</p>
          <p>
            The results are summarized in Table 2, showing
that our system outperforms all of the other systems
in terms of accuracy. Our best result was obtained
by a simple con guration with a support vector
classi er with radial basis function (RBF) kernel. Our
model performed better than the best-perform
            <xref ref-type="bibr" rid="ref5">ing
systems in SemEval 2017</xref>
            Task 8 Subtask A (Turing team,
[KLA17]), which exploited deep learning approach by
using LTSM-Branch model. In addition, we also got a
higher accuracy than the system described in [ADB17],
which exploits a Random Forest classi er and word
embeddings based features.
          </p>
          <p>We experimented with several classi ers, including
Naive Bayes, Decision Trees, Support Vector Machine,
and Random Forest, noting that SVM outperforms the
other classi ers on this task. We explored the
parameter space by tuning the SVM hyperparameters,
namely the penalty parameter C, kernel type, and class
weights (to deal with class imbalance). We tested
several values for C (0.001, 0.01, 0.1, 1, 10, 100, and 1000),
four di erent kernels (linear, RBF, polynomial, and
sigmoid) and weighted the classes based on their
distribution in the training data. The best result was
obtained with C=1, RBF kernel, and without class
weighting.</p>
          <p>An ablation test was conducted to explore the
contribution of each feature set. Table 5 shows the result
of our ablation test, by exploiting several feature sets
on the same classi er (SVM with RBF kernel) 5. This
evaluation includes macro-averages of precision, recall
and F1-score as well as accuracy. We also presented
4We built our system by using scikit-learn Python
Library: http://scikit-learn.org/</p>
          <p>5Source code is available on the GitHub platform:
https://github.com/dadangewp/SemEval2017-RumourEval</p>
        </sec>
      </sec>
      <sec id="sec-4-11">
        <title>Support</title>
      </sec>
      <sec id="sec-4-12">
        <title>Deny</title>
      </sec>
      <sec id="sec-4-13">
        <title>Query</title>
      </sec>
      <sec id="sec-4-14">
        <title>Comment</title>
      </sec>
      <sec id="sec-4-15">
        <title>Support</title>
      </sec>
      <sec id="sec-4-16">
        <title>Deny</title>
      </sec>
      <sec id="sec-4-17">
        <title>Query</title>
      </sec>
      <sec id="sec-4-18">
        <title>Comment</title>
        <p>the scores for each class in order to get a better
understanding of our classi er's performance.</p>
        <p>Using only conversational, a ective, or dialogue-act
features (without structural features) did not give a
good classi cation result. Set B (conversational
features only) was not able to detect the query and deny
classes, while set C (a ective features only) and D
(dialogue-act features only) failed to catch the
support, query, and deny classes. Conversational features
were able to improve the classi er performance
significantly, especially in detecting the support class. Sets
E, H, I, and K which utilize conversational features
induce an improvement on the prediction of the support
class (roughly from 0.3 to 0.73 on precision).
Meanwhile, the combination of a ective and dialogue-act
features was able to slightly improve the classi cation
of the query class. The improvement can be seen from
set E to set K where the F1-score of query class
increased from 0.52 to 0.58. Overall, the best result was
obtained by the K set which encompasses all sets of
features. It is worth to be noted that in our best
conguration system, not all of a ective and dialogue-act
features were used in our feature vector. After several
optimization steps, we found that some features were
not improving the system's performance. Our nal list
of a ective and dialogue-act based features includes:</p>
      </sec>
      <sec id="sec-4-19">
        <title>DAL Activation, ANEW Dominance, Emolex</title>
      </sec>
      <sec id="sec-4-20">
        <title>Negative, Emolex Fear, LIWC Assent, LIWC</title>
      </sec>
      <sec id="sec-4-21">
        <title>Cause, LIWC Certain and LIWC Sad. There</title>
        <p>fore, we have only 17 columns of features in the best
performing system covering structural, conversational,
a ective and dialogue-act features.</p>
        <p>We conducted a further analysis of the classi cation
result obtained by the best performing system (79.50
on accuracy). Table 3 shows the confusion matrix of
our result. On the one hand, the system is able to
detect the comment tweets very well. However, this
result is biased due to the number of comment data in
the dataset. On the other hand, the system is failing
to detect denying tweets, which were falsely classi ed
into comments (68 out of 71)6. Meanwhile,
approximately two thirds of supporting tweets and almost half
of querying tweets were classi ed into the correct class
by the system.</p>
        <p>In order to assess the impact of class imbalance on
the learning, we performed an additional experiment
with a balanced dataset using the best performing
conguration. We took a subset of the instances equally
distributed with respect to their class from the
training set (330 instances for each class) and test set (71
instances for each class). As shown in Table 4, our
classi er was able to correctly predict the
underrepresented classes much better, although the overall
accuracy is lower (59.9%). The result of this analysis
clearly indicates that class imbalance has a negative
impact on the system performance.
4.1</p>
        <p>Error analysis
We conducted a qualitative error analysis on the 215
misclassi ed in the test set, to shed some light on the
issues and di culties to be addressed in future work
and to detect some notable error classes.</p>
      </sec>
      <sec id="sec-4-22">
        <title>Denying by attacking the rumour's author. An</title>
        <p>interesting nding from the analysis of the Marina
Joyce rumour data is that it contains a lot of
denying tweets including insulting comments towards the
author of the source tweet, like in the following cases:</p>
        <sec id="sec-4-22-1">
          <title>Rumour: Marina Joyce</title>
          <p>Misclassi ed tweets:
(da1) stfu you toxic sludge
(da2) @sampepper u need rehab
Misclassi cation type: deny (gold)
comment (prediction)
Source tweet:
(s1) Anyone who knows Marina Joyce
personally knows she has a serious drug
addiction. she needs help, but in the form of rehab
#savemarinajoyce
Tweets like (da1) and (da2) seem to be more inclined
to show the respondent's personal hatred towards the
s1-tweet's author than to deny the veracity of the
rumour. In other words, they represent a peculiar form
of denying the rumour, which is expressed by personal
attack and by showing negative attitudes or hatred
towards the rumour's author. This is di erent from
denying by attacking the source tweet content, and it
was di cult to comprehend for our system, that often
misclassi ed such kind of tweets as comments.
Noisy text, speci c jargon, very short text. In
(da1) and (da2) (as in many tweets in the test set), we
also observe the use of noisy text (abbreviations,
misspellings, slang words and slurs, question statements
without question mark, and so on) that our classi er
struggles to handle . Moreover, especially in tweets
from the Marina Joyce rumour's group, we found some
very short tweets in the denying class that do not
provide enough information, e.g. tweets like \shut up!",
\delete", and \stop it. get some help".</p>
          <p>Argumentation context. We also observed
misclassi cation cases that seem to be related to a deeper
capability of dealing with the argumentation context
underlying the conversation thread.</p>
        </sec>
        <sec id="sec-4-22-2">
          <title>Rumour: Ferguson</title>
          <p>Misclassi ed tweet:
(arg1)@QuadCityPat @AP I join you in this
demand. Unconscionable.</p>
          <p>Misclassi cation type: deny (gold)
comment (prediction)
Source tweet:
(s2) @AP I demand you retract the lie that
people in #Ferguson were shouting \kill the
police", local reporting has refuted your ugly
racism
6A similar observation is reported by the best team at
Semeval-2017 [KLA17].</p>
          <p>Here the misclassi ed tweet is a reply including an
explicit expression of agreement with the author of the
source tweet (\I join you"). Tweet (s2) is one of the
rare cases of source tweets denying the rumor (source
tweets in the RumourEval17 dataset are mostly
supporting the rumor at issue). Our hypothesis is that it
is di cult for a system to detect such kind of stance
without a deeper comprehension of the argumentation
context (e.g., if the author's stance is denying the
rumor, and I agree with him, then I am denying the
rumor as well). In general, we observed that when the
source tweet is annotated by the deny label, most of
denying replies of the thread include features typical
of the support class (and vice versa), and this was a
criticism.</p>
          <p>Mixed cases. Furthermore, we found some
borderline mixed cases in the gold standard annotation. See
for instance the following case:</p>
        </sec>
        <sec id="sec-4-22-3">
          <title>Rumour: Ferguson</title>
          <p>Misclassi ed tweet:
(mx1) @MichaelSkolnik @MediaLizzy Oh
do tell where they keep track of "vigilante"
stats. That's interesting.</p>
          <p>Misclassi cation type: query (gold)
comment (prediction)
Source tweet:
(s3) Every 28 hours a black male is killed
in the United States by police or vigilantes.
#Ferguson
Tweet (mx1) is annotated with a query label rather
than as a comment (our system prediction), but we
can observe the presence of a comment (\That's
interesting") after the request for clari cation, so it seems
to be a kind of mixed case, where both labels make
sense.</p>
        </sec>
      </sec>
      <sec id="sec-4-23">
        <title>Citation of the source's tweet. We have noticed</title>
        <p>many misclassi ed cases of replying tweets with
error pattern support (gold) comment (our
prediction), where the text contains a literal citation of the
source tweet, like in the following tweet: THIS HAS
TO END \@MichaelSkolnik: Every 28 hours a black
male is killed in the United States by police or
vigilantes. #Ferguson" (the text enclosed in quotes is the
source tweet). Such kind of mistakes could be maybe
addressed by applying some pre-processing to the data,
for instance by detecting the literal citation and
replacing it with a marker.</p>
      </sec>
      <sec id="sec-4-24">
        <title>Figurative language devices. Finally, the use of</title>
        <p>gurative language (e.g., sarcasm) is also an issue that
should be considered for the future work. Let us
consider for instance the following misclassi ed tweets:</p>
        <sec id="sec-4-24-1">
          <title>Rumour: Hillary's Illness</title>
          <p>Misclassi ed tweets:
(fg1) @mitchellvii True, after all she can open
a pickle jar.
(fg2) @mitchellvii Also, except for having
a 24/7 MD by her side giving her
Valium injections, Hillary is in good health!
https://t.co/GieNxwTXX7
(fg3) @mitchellvii @JoanieChesnutt At the
very peak yes, almost time to go down a cli
and into the earth.</p>
          <p>Misclassi cation type: support (gold)
comment (prediction)
Source tweet:
(s4) Except for the coughing, fainting,
apparent seizures and "short-circuits," Hillary is in
the peak of health.</p>
          <p>All misclassi ed tweets (fg1-fg3) from the Hillary's
illness data are replies to a source tweet (s4), which is
featured by sarcasm. In such replies authors support
the rumor by echoing the sarcastic tone of the source
tweet. Such more sophisticated cases, where the
supportive attitude is expressed in an implicit way, were
challenging for our classi er, and they were quite
systematically misclassi ed as simple comments.
5</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>In this paper we proposed a new classi cation model
for rumour stance classi cation. We designed a set
of features including structural, conversation-based,
a ective and dialogue-act based feature.
Experiments on the SemEval-2017 Task 8 Subtask A dataset
show that our system based on a limited set of
wellengineered features outperforms the state-of-the-art
systems in this task, without relying on the use of
sophisticated deep learning approaches. Although
achieving a very good result, several research
challenges related to this task are left open. Class
imbalance was recognized as one the main issues in this
task. For instance, our system was struggling to
detect the deny class in the original dataset distribution,
but it performed much better in that respect when we
balanced the distribution across the classes.</p>
      <p>A re-run of the RumourEval shared task has been
proposed at SemEval 20197 and it will be very
interesting to participate to the new task with an evolution
of the system here described.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgements</title>
      <p>Endang Wahyu Pamungkas, Valerio Basile and
Viviana Patti were partially funded by Progetto di
Ateneo/CSP 2016 (Immigrants, Hate and Prejudice in
Social Media, S1618 L2 BOSC 01).</p>
      <p>7http://alt.qcri.org/semeval2019/
[ADB17]</p>
      <sec id="sec-6-1">
        <title>Ahmet Aker, Leon Derczynski, and Kalina</title>
        <p>
          Bontcheva. Simple open stance
classication for rumour analysis.
          <xref ref-type="bibr" rid="ref5">In Proc.
of RANLP 2017</xref>
          , pages 31{39.
          <xref ref-type="bibr" rid="ref5">INCOMA
Ltd., 2017</xref>
          .
        </p>
      </sec>
      <sec id="sec-6-2">
        <title>Margaret M Bradley and Peter J Lang.</title>
        <p>A ective norms for english words (anew):
Instruction manual and a ective ratings.</p>
        <p>Technical report, Technical Report C-1,
The Center for Research in
Psychophysiology, University of Florida., 1999.
[COR14] Erik Cambria, Daniel Olsher, and Dheeraj
Rajagopal. Senticnet 3: a common
and common-sense knowledge base for
cognition-driven sentiment analysis. In</p>
        <p>Proc. of AAAI 2014, 2014.
[Ekm92]
[KLA17]</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [DBL+17]
          <string-name>
            <surname>Leon</surname>
            <given-names>Derczynski</given-names>
          </string-name>
          , Kalina Bontcheva, Maria Liakata, Rob Procter, Geraldine Wong Sak Hoi, and Arkaitz Zubiaga.
          <article-title>Semeval2017 task 8: Rumoureval: Determining rumour veracity and support for rumours</article-title>
          .
          <source>In Proc. of SemEval-2017</source>
          , pages
          <fpage>69</fpage>
          {
          <fpage>76</fpage>
          .
          <string-name>
            <surname>ACL</surname>
          </string-name>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [Gra17]
          <string-name>
            <given-names>Paul</given-names>
            <surname>Ekman</surname>
          </string-name>
          .
          <article-title>An argument for basic emotions</article-title>
          .
          <source>Cognition &amp; emotion, 6</source>
          (
          <issue>3</issue>
          -4):
          <volume>169</volume>
          {
          <fpage>200</fpage>
          ,
          <year>1992</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>David</given-names>
            <surname>Graf</surname>
          </string-name>
          . Semeval-2017-t8,
          <year>June 2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Elena</given-names>
            <surname>Kochkina</surname>
          </string-name>
          , Maria Liakata, and
          <string-name>
            <given-names>Isabelle</given-names>
            <surname>Augenstein</surname>
          </string-name>
          . Turing at SemEval
          <article-title>-2017 Task 8: Sequential Approach to Rumour Stance Classi cation with Branch-LSTM.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <source>In Proc. of SemEval-2017</source>
          , pages
          <fpage>475</fpage>
          {
          <fpage>480</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>ACL</surname>
          </string-name>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [LFPR16]
          <article-title>Mirko Lai, Delia Irazu Hernandez Far as, Viviana Patti, and Paolo Rosso. Friends and enemies of Clinton and Trump: using context for detecting stance in political tweets</article-title>
          .
          <source>In Proc. of MICAI</source>
          <year>2016</year>
          , volume
          <volume>10061</volume>
          <source>of LNCS</source>
          , pages
          <volume>155</volume>
          {
          <fpage>168</fpage>
          . Springer,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [MKS+16]
          <string-name>
            <surname>Saif</surname>
            <given-names>Mohammad</given-names>
          </string-name>
          , Svetlana Kiritchenko, Parinaz Sobhani,
          <string-name>
            <surname>Xiao-Dan Zhu</surname>
            , and
            <given-names>Colin</given-names>
          </string-name>
          <string-name>
            <surname>Cherry</surname>
          </string-name>
          .
          <article-title>Semeval-2016 task 6: Detecting stance in tweets</article-title>
          .
          <source>In Proc. of SemEval</source>
          <year>2016</year>
          , pages
          <fpage>31</fpage>
          {
          <fpage>41</fpage>
          .
          <string-name>
            <surname>ACL</surname>
          </string-name>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>[MT13] Saif M Mohammad and Peter D Turney</surname>
          </string-name>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <article-title>Crowdsourcing a word{emotion association lexicon</article-title>
          .
          <source>Computational Intelligence</source>
          ,
          <volume>29</volume>
          (
          <issue>3</issue>
          ):
          <volume>436</volume>
          {
          <fpage>465</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [OST57]
          <article-title>[PFB01] [Plu01] [PVV13] Nicole Novielli</article-title>
          and
          <string-name>
            <given-names>Carlo</given-names>
            <surname>Strapparava</surname>
          </string-name>
          .
          <article-title>The role of a ect analysis in dialogue act identi cation</article-title>
          .
          <source>IEEE Transactions on A ective Computing</source>
          ,
          <volume>4</volume>
          (
          <issue>4</issue>
          ):
          <volume>439</volume>
          {
          <fpage>451</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>C.E.</given-names>
            <surname>Osgood</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.J.</given-names>
            <surname>Suci</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.H.</given-names>
            <surname>Tenenbaum</surname>
          </string-name>
          .
          <article-title>The Measurement of meaning</article-title>
          . University of Illinois Press, Urbana:,
          <year>1957</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>James W Pennebaker</surname>
          </string-name>
          , Martha E Francis, and
          <string-name>
            <surname>Roger</surname>
          </string-name>
          J Booth.
          <article-title>Linguistic inquiry and word count (LIWC): LIWC 2001</article-title>
          . Mahway: Lawrence Erlbaum Associates,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [PGH+13]
          <string-name>
            <surname>Soujanya</surname>
            <given-names>Poria</given-names>
          </string-name>
          , Alexander Gelbukh, Amir Hussain, Newton Howard,
          <string-name>
            <surname>Dipankar Das</surname>
            , and
            <given-names>Sivaji</given-names>
          </string-name>
          <string-name>
            <surname>Bandyopadhyay</surname>
          </string-name>
          .
          <article-title>Enhanced senticnet with a ective labels for conceptbased opinion mining</article-title>
          .
          <source>IEEE Intelligent Systems</source>
          ,
          <volume>28</volume>
          (
          <issue>2</issue>
          ):
          <volume>31</volume>
          {
          <fpage>38</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>American</surname>
          </string-name>
          scientist,
          <volume>89</volume>
          (
          <issue>4</issue>
          ):
          <volume>344</volume>
          {
          <fpage>350</fpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <article-title>Reading the riots on twitter: methodological innovation for the analysis of big data</article-title>
          .
          <source>International journal of social research methodology</source>
          ,
          <volume>16</volume>
          (
          <issue>3</issue>
          ):
          <volume>197</volume>
          {
          <fpage>214</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [SSW+17]
          <string-name>
            <surname>Kai</surname>
            <given-names>Shu</given-names>
          </string-name>
          , Amy Sliva, Suhang Wang,
          <string-name>
            <given-names>Jiliang</given-names>
            <surname>Tang</surname>
          </string-name>
          , and Huan Liu.
          <article-title>Fake news detection on social media: A data mining perspective</article-title>
          .
          <source>ACM SIGKDD Explorations Newsletter</source>
          ,
          <volume>19</volume>
          (
          <issue>1</issue>
          ):
          <volume>22</volume>
          {
          <fpage>36</fpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [VRA18]
          <string-name>
            <given-names>Soroush</given-names>
            <surname>Vosoughi</surname>
          </string-name>
          , Deb Roy, and
          <string-name>
            <given-names>Sinan</given-names>
            <surname>Aral</surname>
          </string-name>
          .
          <article-title>The spread of true and false news online</article-title>
          .
          <source>Science</source>
          ,
          <volume>359</volume>
          (
          <issue>6380</issue>
          ):
          <volume>1146</volume>
          {
          <fpage>1151</fpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [Whi09]
          <string-name>
            <given-names>Cynthia</given-names>
            <surname>Whissell</surname>
          </string-name>
          .
          <article-title>Using the revised dictionary of a ect in language to quantify the emotional undertones of samples of natural language</article-title>
          .
          <source>Psychological reports</source>
          ,
          <volume>105</volume>
          (
          <issue>2</issue>
          ):
          <volume>509</volume>
          {
          <fpage>521</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [ZLP+15]
          <string-name>
            <surname>Arkaitz</surname>
            <given-names>Zubiaga</given-names>
          </string-name>
          , Maria Liakata, Rob Procter, Kalina Bontcheva, and
          <string-name>
            <given-names>Peter</given-names>
            <surname>Tolmie</surname>
          </string-name>
          .
          <article-title>Towards detecting rumours in social media</article-title>
          .
          <source>In AAAI Workshop: AI for Cities</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [ZLP+16]
          <string-name>
            <surname>Arkaitz</surname>
            <given-names>Zubiaga</given-names>
          </string-name>
          , Maria Liakata, Rob Procter, Geraldine Wong Sak Hoi, and
          <string-name>
            <given-names>Peter</given-names>
            <surname>Tolmie</surname>
          </string-name>
          .
          <article-title>Analysing how people orient to and spread rumours in social media by looking at conversational threads</article-title>
          .
          <source>PloS one</source>
          ,
          <volume>11</volume>
          (
          <issue>3</issue>
          ):e0150989,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>