<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Work-
shops October</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>False News Classi cation and Dissemination: The Case of the 2019 Indonesian Presidential Election</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Statistics</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Frank W. Takes</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Rayan Suryadikara</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Suzan Verberne</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2020</year>
      </pub-date>
      <volume>1</volume>
      <fpage>9</fpage>
      <lpage>20</lpage>
      <abstract>
        <p />
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>In this paper we investigate automated
methods for understanding false news
dissemination on Twitter in relation to one particular
event: the 2019 Indonesian presidential
election. We collected a sample of 2,360 tweets
related to topics addressed by fact-checking
websites. The tweets were hand-labeled
according to their trustworthiness. We trained
several classi cation models on the
humanlabelled data, using three groups of text
features. The word n-gram features appeared to
be the most e ective, reaching a recall of 85%
for true news and 62% for false news. With
this classi er we labeled a larger sample of
tweets related to fact-checking topics in the
context of the 2019 Indonesian presidential
elections. We then analysed the dissemination
of true news and false news in the underlying
Twitter network using community detection
and centrality measures. The top in uential
users in the network disseminate more false
news, including a government institution
account and a veri ed politician's account. Our
results show that the combination of text
features and social network analysis can provide
valuable insights in detecting and preventing
the dissemination of false news. Moreover, we
make the dataset used in this research
available for reuse by the community.</p>
      <p>Copyright c by the paper's authors. Use permitted under
Creative Commons License Attribution 4.0 International (CC BY
4.0). CEUR Workshop Proceedings (CEUR-WS.org).</p>
    </sec>
    <sec id="sec-2">
      <title>Introduction</title>
      <p>
        A recent study strictly de ned fake news as news
articles that are intentionally and veri ably false and
could therefore mislead readers [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In a political
context the de nition can be considered a bit wider. One
study argues that politicians tend to label any news
sources which do not support their positions as fake
news [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ]. This is especially common in the context of
a large political event, e.g., an election. For example,
there was an allegation that Joko Widodo was both
a communist and Chinese in the Indonesia 2014
presidential election [
        <xref ref-type="bibr" rid="ref11">10</xref>
        ]. In this paper, we focus on the
2019 presedential election in Indonesia.
      </p>
      <p>
        Social media ourishes as an alternative
information source, in particular during elections, where many
politicians utilize social media as means to reach out to
the public more directly. Politicians prefer Twitter
because of its e ciency in spreading messages, sparking
conversations, building public opinion, or gaining
support [
        <xref ref-type="bibr" rid="ref20">19</xref>
        ]. Especially in volatile political times, there
are so-called buzzer teams that attempt to amplify
messages and creates a \buzz" on social networks to
spread positive content about one side of the political
spectrum, while disseminating negative content about
the other [
        <xref ref-type="bibr" rid="ref12">11</xref>
        ]. Hashtags are often used to increase
their visibility to Indonesian Twitter users, which
often become trending topics that then gain even more
attention [
        <xref ref-type="bibr" rid="ref12">11</xref>
        ].
      </p>
      <p>Because of these problems and their political
impact, there is an urgent need to automatically identify
and analyze false news in social media. This process
could then result in the identi cation of the actors
involved, as well as their networks that disseminated the
false news. This research studies how false news can be
detected based on the content of the messages posted,
and then analyses its dissemination using social
network analysis. The particular case that is considered
is the 2019 Indonesian presidential election on
Twitter, for which data was manually gathered and labeled
in light of this study.</p>
      <p>The contributions of this paper are:
A new hand-labeled dataset of 2,360 tweets for the
detection of false news in the Indonesian language;
A method based on word features that can
reasonably distinguish true news and false news in
this data.</p>
      <p>An analysis of how true news and false news
disseminate in the Twitter network related to the
2019 Indonesian elections, and what role
particular communities, accounts, and hashtags play in
the dissemination of false news.</p>
      <p>The remainder of the paper is organized as follows.
In Section 2 we discuss related work. In Section 3
we introduce the data and the annotation process. In
Section 4 we present the methods we use, followed by
experimental results in Section 5. Finally, the
conclusions of the research are outlined in Section 6.
2</p>
    </sec>
    <sec id="sec-3">
      <title>Related Work</title>
      <p>In this section, we discuss work on false news on
social media as well as methods for identifying this false
news.</p>
      <p>
        A recent study examined fake news from a political
perspective, inspired by the 2016 US presidential
elections [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. They di erentiated fake news and its close
cousins in the political subject: unintentional
reporting mistakes, rumors, conspiracy theories, satires, false
statements by politicians, and slanted or misleading
reports. The nature of the political world itself where a
great number of critical reports have been discredited
as fake news leads to rede ning fake news which spread
on social media [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. A relevant study by Vosoughi et
al. [
        <xref ref-type="bibr" rid="ref24">23</xref>
        ] focused on the veracity of Twitter posts which
have been true or false.
      </p>
      <p>
        In addition, they also de ned news (either true or
false) as any story or claim with an assertion in it,
especially in social media. This extends the de
nition scope of false news from `intentional'
characteristics, allowing to incorporate aforementioned fake news'
close cousins [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] into a single term. Therefore, the
`false news' term will be used throughout the paper
which incorporates fake news and its close cousins.
      </p>
      <p>
        In the text classi cation eld for Indonesian
language, most research focuses on hate speech
identication. One of the rst researches on Indonesian
hate speech was conducted with multiple text features
(character n-grams and negative sentiment) and
classi ers (Naive Bayes, SVM, and Random Forest) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
This research and data set were expanded with adding
abusive language and hate speeches' target and levels
[
        <xref ref-type="bibr" rid="ref9">8</xref>
        ]. However, there has not been conducted research to
detect false news in the Indonesian language, despite
they are usually associated with hate speech.
      </p>
      <p>
        A study analyzed Australia's Department of
Immigration and Citizenship (DIAC) Twitter data to
identify topics over the DIAC Twitter account and
the spread of tweets, particularly the most retweeted
tweets [26]. Another study further explored the
analysis by taking the mention feature into account and
term co-occurrence analysis with Korean Presidential
Election on Twitter [
        <xref ref-type="bibr" rid="ref19">18</xref>
        ]. It marked the possibility to
analyse the real political situation from the social
network. On the other hand, one research utilized and
built hashtag co-occurrence graph [
        <xref ref-type="bibr" rid="ref25">24</xref>
        ] to discover
semantic relations between words in a tweet.
      </p>
      <p>
        Another study [
        <xref ref-type="bibr" rid="ref8">7</xref>
        ] investigated lter bubble e ects
which tend to be generated by recommender systems
that personalize and lter tweets via community
detection. Regarding in uential actors in a network, a
recent study with the main topic is the 2014 Malaysian
oods [
        <xref ref-type="bibr" rid="ref15">14</xref>
        ] utilized betweenness centrality to identify
the potentially key Twitter users during information
dissemination. Another study analyses false news
based on the impact of emotion [
        <xref ref-type="bibr" rid="ref6">5</xref>
        ] or the pro ling
of Twitter users [
        <xref ref-type="bibr" rid="ref5">4</xref>
        ].
      </p>
      <p>While these works present the analysis of lter
bubbles or the in uential users, our study will utilize
actual true news and false news labels of news messages
to assess which type of news is circulated inside certain
communities and/or spread of particular in uential
actors.
3
3.1</p>
    </sec>
    <sec id="sec-4">
      <title>Data</title>
      <sec id="sec-4-1">
        <title>Data collection</title>
        <p>For crawling tweets we use the GetOldTweets Library1
to bypass the limitations of the o cial Twitter API.
This allows us to to download historical Twitter data
within a speci c date range for a particular query. The
queries we used for crawling Twitter data are built
on topics that were published by two Indonesian
factchecking websites2. The tweets are in the Indonesian
language. We gathered data from the rst day of the
2019 Indonesian presidential campaign (September 23,
2018) to a week after the election result was publicized
(May 28, 2019).</p>
        <p>We selected 281 topics related to the presidential
elections from the above referenced fact-checking
websites with their corresponding supporting URLs. For
each topic we created a query. For example, for the
supporting URL that examines whether the 23
European Union ambassadors support Prabowo-Sandi or
1https://github.com/Jefferson-Henrique/
GetOldTweets-python</p>
        <p>2https://cekfakta.tempo.co/ (Cek Fakta Tempo from
Tempo) and https://turnbackhoax.id/ (Turn Back Hoax from
Ma ndo)
not3, we used the topic \European Ambassadors
Support Prabowo" as the query to extract the relevant
tweets.</p>
        <p>To ensure alignment between the extracted tweets
and the supporting URL, tweets from the rst time
the news aired in social media until its seventh day
are selected. After removal of duplicate tweets, this
resulted in a set of 8,784 tweets for the 281 topics. For
annotation, tweets that one retweet, one like, and one
reply, or less are removed resulting in a set of 2,360
that we use for annotation.
3.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Annotation</title>
        <p>We recruited 10 native Indonesian speakers to
annotate the data. They do not have political job, political
a liation, or belong to a political party to facilitate
the impartiality. Having 2,360 tweets as original data
set, and two annotators per tweet, each annotator had
to label 472 tweets.</p>
        <p>The information provided to the annotators was the
topic, the supporting URL, and the tweet text. One
topic is linked to one supporting URL and to multiple
tweets. We wrote an extensive annotation guideline
for Indonesian false news and validated it in several
short iterations before starting the actual annotation
process. 4 Annotators are asked to assign one of four
classes to each tweet:</p>
        <p>True: Tweets that relate to the topic and are true
or accurate according to the supporting URLs;
False: Tweets that relate to the topic and are false
or inaccurate according the to supporting URLs;
Misleading: Tweets that relate to the topic and
have accurate information according to
supporting URLs but lead to wrong conclusions;
Other: Tweets that do not relate to the topic or
are not discussed within supporting URLs.</p>
        <p>
          While misleading news is sometimes considered a
subset of false news, we decided to distinguish it
separately for text classi cation. According to [
          <xref ref-type="bibr" rid="ref22">21</xref>
          ],
misleading news tends to use correct facts and data, but
how the news is delivered or how conclusions are drawn
is false and therefore leads to the wrong interpretation.
This is consistent with other de nitions that
misleading news conceives false facts by topic changes,
irrelevant information, and equivocations to mislead the
audience [
          <xref ref-type="bibr" rid="ref23">22</xref>
          ].
        </p>
        <p>3https://cekfakta.tempo.co/fakta/111/fakta-atau-hoaxbenarkah-23-dubes-uni-eropa-dukung- prabowo-sandi,
determined to be false news</p>
        <p>4The annotation guideline can be found here: https:
//github.com/rayansuryadikara/false_news_detection_and_
dissemination_analysis</p>
      </sec>
      <sec id="sec-4-3">
        <title>Class</title>
        <sec id="sec-4-3-1">
          <title>True News False News Misleading News Other</title>
        </sec>
      </sec>
      <sec id="sec-4-4">
        <title>Total</title>
        <p>896
648
189
627
2,360</p>
        <p>The annotation process was conducted in two
stages. In the rst stage, two annotators annotated
the data. In the second stage, a third annotator (the
rst author of this paper) acted as a nal judge for
any tweet where two previous annotators disagreed.
We analyzed the inter-rater reliability of the
annotated data using Cohen's . Out of 10 annotator
pairs, there are ve pairs with moderate agreement
( = 0:41 0:60), four pairs with fair agreement
( = 0:21 0:40), and one pair with slight agreement
( = 0:01 0:20). The highest score is 0.52 and the
lowest is 0.07. As a whole, we obtain fair agreement
with a mean of 0.33. The statistics of the annotated
data are outlined in Table 1.
3.3</p>
      </sec>
      <sec id="sec-4-5">
        <title>Network Data</title>
        <p>
          We extract two di erent networks from our Twitter
collection of 8,748 tweets. The rst is the mention
network. In literature, it is suggested that
mentioning other usernames in a tweet represents a more
direct form of communication than what is obtained
from a network based on follower connections [
          <xref ref-type="bibr" rid="ref19">18</xref>
          ].
The second network that we create is the hashtag
co-occurrence network The frequency of use for a
hashtag indicates its popularity. In the 2019
Indonesian presidential election, there are certain hashtags
created to support or oppose certain gures, such as
#jokowiamin to support Joko Widodo, the incumbent,
and #2019gantipresiden (\2019 change the
president") to support Prabowo, the challenger.
        </p>
        <p>The mention network is a weighted directed network
where posting usernames are de ned as the source
and mentioned usernames are the target of a directed
link. Link weight is determined by how many times
the source username mentions the target username.
The hashtag co-occurrence network is a weighted
undirected network in which two hashtags are connected if
they occur together in a tweet. Link weight is
determined by counting how many times the tags co-occur.</p>
        <p>In our experiments, we visualize the two networks
to analyse how true news and false news spread in
presidential election settings. For the network data,
misleading news will be merged under false news to
keep it straightforward and to simplify the
contrasting visualization between true news and false news. In
doing so, we actually model both networks as a
multigraph in which two nodes can be connected based on
how often they communicate or co-occur in both true
and fake news.
4</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Methods</title>
      <p>In this section, we rst present our text classi cation
methods using three di erent content-based feature
sets (Section 4.1) and voting ensembles to combine the
feature representations. Next, we present the network
analysis features that we use to analyze the
dissemination of true and false news in the Twitter network
(Section 4.2).
4.1</p>
      <sec id="sec-5-1">
        <title>Text classi cation</title>
        <p>Features. For the content-based classi cation, we
compare three types of features: orthography features,
sentiment lexicon features, and word n-grams.</p>
        <p>
          Social media such as Twitter is a common
example wherein there the conventions of orthographies are
sometimes lacking [
          <xref ref-type="bibr" rid="ref7">6</xref>
          ]. Therefore, orthography
patterns are commonly used for social media analysis
[
          <xref ref-type="bibr" rid="ref18 ref9">8, 17</xref>
          ]. We de ne ve orthography features: counts
of exclamation marks (E), question marks (Q),
uppercase letters (U), lowercase letters (L), and emojis (M).
        </p>
        <p>
          For sentiment features, we use the Indonesian
Sentiment Lexicon (InSet) [
          <xref ref-type="bibr" rid="ref10">9</xref>
          ] which comprises 3,609
positive words and 6,609 negative words5. The
sentiment scores range from -5 to 5, where negative scores
indicate negative words and positive scores indicate
positive words. Words with score 0 are disregarded
since the lexicon excludes neutral category. Along with
InSet, we use an Indonesian abusive lexicon [
          <xref ref-type="bibr" rid="ref9">8</xref>
          ],which
comprises 126 words that are considered abusive.6
Thus, we have three sentiment lexicon features: the
positive word count (P), the negative word count (N),
and the abusive word count (A). Before applying the
sentiment lexicons, we apply stop words removal and
text normalization7. The stop words dictionary is
adopted from [
          <xref ref-type="bibr" rid="ref21">20</xref>
          ].8 The text normalization dictionary
comprises of 11,034 terms which are mapped to a
normalized form. The dictionary is a continuous,
collective work from researches [
          <xref ref-type="bibr" rid="ref1 ref17 ref9">1, 8, 16</xref>
          ] on the Indonesian
language. In addition to lemmatization, the
dictionary also facilitates Indonesian abbreviations, slangs,
misspelled words, and even political gures' names.
Therefore, the normalized form often consists of more
than one word.
        </p>
        <p>
          For the word n-gram features the text was
lowercased, and URLs and punctuation were removed. For
mentioned usernames and hashtags, we removed the
@ and # symbols while the usernames and the
hashtag words themselves were kept because both are
instrumental parts of tweets to be identi ed and
distinguished [
          <xref ref-type="bibr" rid="ref14 ref16">13, 15</xref>
          ]. Some of the usernames and hashtags
are also included in the text normalization dictionary
and therefore are normalized as well. We used six
subsets of word n-grams to create vocabularies: Unigram,
bigram, trigram, uni-bigram, bi-trigram, and
uni-bitrigram. In all n-gram feature sets we use tf-idf as
term weight.
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>Classi cation models. We used the same clas</title>
        <p>
          si ers as prior work on Indonesian text classi cation
[
          <xref ref-type="bibr" rid="ref1 ref9">1, 8</xref>
          ]: Multinomial Naive Bayes (MNB), Support
Vector Machines (SVM) with SGD optimization [
          <xref ref-type="bibr" rid="ref26">25</xref>
          ], and
Random Forest (RF), all implemented in Scikit-learn.
We used the default hyperparameter settings for each
classi er. For SVM, this means that C = 1. For
RF, the number of estimators is 100 with no
maximum depth for the trees. The nal precision and
recall scores of each set of text feature are the average
scores of these three classi ers. Meanwhile, F1 scores
are calculated according to average precision and recall
scores.
        </p>
        <p>Voting ensembles. We assembled the results of
from each experiment with di erent text features. The
nal precision, recall, and F1 scores of each ensemble
follow the same approach with the text feature sets
after the voting ensemble is performed. We use majority
voting: the numbers for each label are compared and
the most voted label is selected. If there is not one
label with the most votes, the class will be determined
according to a text feature that has the best
performance. We construct two di erent ensembles:
Ensemble I is arranged from all combinations of each feature,
Ensemble II is arranged from the best combination of
each feature.
4.2</p>
      </sec>
      <sec id="sec-5-3">
        <title>Social network analysis</title>
        <p>
          We aim to analyse how true news and false news spread
between actors in the two networks described in
Section 3.3. For visualization, we use Gephi [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ], an
opensource tool for social network analysis. While we do
not directly model the precise di usion of the news
5https://github.com/fajri91/InSet as the network evolves, we do believe that these two
6https://github.com/okkyibrohim/ methods provide crucial insights in the reach of di
erid-multi-label-hate-speech-and-abusive-language-detection ent types of news and the network e ects involved in
7https://github.com/okkyibrohim/ the process.
id-multi-label-hate-speech-and-abusive-language-detection/
blob/master/new_kamusalay.csv Community detection is a method capable of
8https://github.com/stopwords-iso/ partitioning the network into communities (more
tightly connected groups with fewer connections to
other communities). Here, we use the well-known
Louvain modularity maximization algorithm to perceive
the potential of lter bubble e ects in a community [
          <xref ref-type="bibr" rid="ref8">7</xref>
          ].
        </p>
        <p>
          Filter bubbles are a phenomenon in which a person is
exposed to ideas, people, facts, or news that adhere to
or are consistent with a particular political or social
ideology, leaving alternative ideas unconsidered and in
some cases outrightly rejected [
          <xref ref-type="bibr" rid="ref13">12</xref>
          ]. We propose to
systematically identify every community to see the type
of news circulating in that community.
        </p>
        <p>
          Centrality measures assign a ranking to nodes
in a network based on their topological position in the
network. Here, we choose to use betweenness
centrality to identify the most in uential nodes. Betweenness
centrality measures for a particular node how many
other nodes are connected via a shortest path that
runs through that node. Therefore for the mention
network, the node or username acts as an important
hub in receiving and spreading information to other
nodes [
          <xref ref-type="bibr" rid="ref15">14</xref>
          ]. On an individual node level, betweenness
centrality captures information from neighboring users
who both consume and generate false news. For the
hashtag co-occurrence network, the hashtag is also an
important hub where it frequently co-occurs lot with
other hashtags.
5
        </p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Results and analysis</title>
      <p>We rst present results on the comparison of the e
ectiveness of the three di erent text feature types
(Section 5.1). After nding the most e ective text features,
we investigate the dissemination of true and false news,
using the network analysis metrics (Section 5.2).
5.1</p>
      <sec id="sec-6-1">
        <title>Results | text classi cation</title>
        <p>Experimental settings. We evaluate our classi ers
in two di erent types of experimental settings. The
rst setting is the data set with three classes, namely
True News, False News, and Misleading News. The
second setting is the data set with four classes: True,
False, Misleading, Other, and Unclear (where the three
annotators all assigned a di erent label) While the
3class setting is easier for the classi er to learn, the
5-class setting is more realistic because it includes the
tweets that are irrelevant but will occur in a real
Twitter stream as well. We used a xed random train{test
split of the data for evaluation of the models, with 20%
of the data for testing.</p>
        <p>Comparison of feature sets. We nd that in the
3-class classi cation, the best n-gram feature set is the
combination of unigrams and bigrams; in the 5-class
classi cation the best n-gram feature set is the use of
bigrams alone. The best orthography feature set for
the 3-class classi cation is the feature set with counts
of exclamation marks, question marks, lowercase
letters, and emojis; for the 5-class classi cation having
the uppercase letter count instead of the question mark
count is the most e ective set. Of the sentiment
lexicons, using a combination of positive and negative
sentiment words gives the best results for both
settings. The assemble of the best feature combinations
performed the best in the 3-class, while the assemble
from all feature combinations performed the best in
the 5-class. We compare the best feature combination
for each feature type in Table 2.</p>
        <p>The table shows that the n-gram features
outperform orthographies and sentiment lexicons in each
setting and each class. The ensemble methods are also
not able to improve over the n-gram features alone.
Nevertheless, the ensembling method allows
orthography and sentiment lexicons to be included as features
in text classi cation with better performance than
independently, especially from social media sphere.</p>
        <p>Final quality of text classi cation With the
best text features in the 5-class setting (which is more
di cult, but also more realistic than the 3-class
setting), we obtain precision scores of 55% for true news,
71% of false news, and 68% for misleading news.
Recall is 85% for true news, 62% for false news, and 26%
for misleading news. The low recall for misleading
news is caused by the small number of items in this
category.</p>
        <p>We analyzed the full collection of 8,784 tweets where
the unannotated data set (6,424 tweets) is labelled
by the SVM classi er with SGD optimization in the
5-class setting with the best-performing feature set
(word bigrams). We then do the social network
analysis on the automatically labelled dataset, which we
discuss in the next section.
5.2</p>
      </sec>
      <sec id="sec-6-2">
        <title>Results | social network analysis</title>
        <p>Table 3 shows the counts of nodes and edges (full
network, and for true and false news) in the labelled
Twitter networks. The last line of the table shows the
number of communities. For the 10 largest communities,
the distribution of true and false news by community
as well as the top 10 in uential actors are shown in
Figure 1 and 2 for the mention network and in Figure
5 and 6 for the hashtag co-occurrence network.</p>
        <p>The distributions are stacked column of true news
and false news, listing the number of nodes and edges
in each discovered community or actor (usernames for
the mention network work and hashtags for the
hashtag co-occurrence network). True news is de ned by
blue color while false news is de ned by orange color.</p>
        <p>In the visualization, communities are represented by
colours and betweenness centrality determined node
size, as shown in Figure 3 and 4 for the mentioned
net</p>
        <sec id="sec-6-2-1">
          <title>3 Classes</title>
        </sec>
        <sec id="sec-6-2-2">
          <title>5 Classes</title>
        </sec>
      </sec>
      <sec id="sec-6-3">
        <title>Features</title>
        <sec id="sec-6-3-1">
          <title>Uni-bigram</title>
          <p>EQLM
PN
Ensemble II
Bigram
EULM
PN
Ensemble I
P
work and Figure 7 and 8 for the hashtag co-occurrence
network. The visualization is formed by applying ego
network to the ego (determined username or hashtag)
within level 1 or its direct connection.</p>
          <p>Mention network Based on the analysis of the
mention network for the 2019 Indonesian presidential
elections on Twitter, we nd that:</p>
          <p>False news is more prevalent in the largest
communities and also being disseminated and received
more by top in uential usernames. However,
there are still more communities with a balanced
proportion between true news and false news.
Many news source accounts are found in these
bal</p>
          <p>While the proportions of true news and false news
are quite balanced in general, some usernames
show a very strong tendency towards false news
over true news, in particular a veri ed government
institution account bawaslu ri (shown in Figure
3 and 4) and two unveri ed accounts, caknur14
and hamaro id. One predominantly \true news"
username is cnnindonesia, which is a veri ed
news source account.</p>
          <p>Veri ed accounts tend to spread more false news
than true news, where three of the top four
inuential usernames disseminate more false news
than true news. The two largest, bawaslu ri9
(shown in Figure 3 and 4) and gunromli10, are
veri ed and politically-related account.</p>
          <p>One of the top \true news" in uential usernames
is divhumas polri11. This is to be expected since
9The o cial account of an Indonesian government
institution.</p>
          <p>10The o cial account of an Indonesian politician.
11The o cial account of Indonesian republic police force.
they have a cyber division dedicated to ght back
hoax.</p>
        </sec>
      </sec>
      <sec id="sec-6-4">
        <title>Hashtag co-occurrence network Based on the</title>
        <p>analysis of the hashtag co-occurrence network for the
2019 Indonesian presidential elections on Twitter, the
interesting ndings are:</p>
        <p>True news is more strongly associated with top
in uential hashtags.</p>
        <p>
          False news is more strongly associated with
sentiment-induced hashtags than with
hashtags about events or occurrences.
Examples are 2019gantipresiden (2019 change
the president, shown in Figure 7 and 8),
indonesianeedsprabowo and 01jokowilagi (01
Jokowi again), which show support for both
candidates. These results con rm the nding of
previous work [
          <xref ref-type="bibr" rid="ref6">5</xref>
          ] that emotions are important in
detecting false information.
        </p>
        <p>There is a community formed (Community 3)
where only false news circulate in it. This
community is lled with many slandering
hashtags towards the incumbent Jokowi, such as
jaekingoflies (Jae is one of derogatory title to
Jokowi), jaengibuldimanalagi (Where does Jae
lie again) and uninstalljaenow. However, none
of them is a hashtag with enough in uence.
The inclined \true news" in uential hashtags
are very general terms and not directly about
the presidential election, such as hoax and
Indonesia. Hashtag hoax is especially
noteworthy because any tweet which includes this hashtag
mostly warns that the topic is a hoax, therefore
ghting back hoax and is categorized as true news.
The particular case of this hashtag was also
outlined in the annotation guideline.</p>
        <p>The mentioned-based network shows that the in
uential users are not only receive more false news, but
also spread them as well. These usernames consists of
unveri ed and veri ed ones, with the top two in
uential usernames are veri ed and \false news" inclined.
This indicates that accounts with veri cation mark are
not always clean from hoaxes.</p>
        <p>Meanwhile, the hashtag-based network shows that
supportive or sentiment-induced hashtags tend to
relate more with false news, rather than more general
events or terms. This indicates that these hashtags
are more prone to information bias. Especially the
supportive hashtags for each candidate, where users
show fanatic support and attack the opposite
candidate as well, often with false information.</p>
        <p>As a reminder, these results illustrate the
circumstances of the 2019 Indonesian presidential election
event on Twitter. Furthermore, the news are selected
based on fact-checking websites, which con rming
circulating, trending topics on social media whether it is
true or false.
6</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Conclusions</title>
      <p>In this paper we trained classi ers for detecting false
news on Twitter and we analysed its dissemination
related to the 2019 Indonesian presidential elections. We
created a labelled dataset for true, false, and
misleading news that we publish for use by other researchers.12</p>
      <p>We found that the most prominent text feature to
detect and distinguish true news, false news, and
misleading news is word n-grams, in particular unigrams
and bigrams. We also experimented with orthography
features and sentiment features, but those did not
improve the n-gram baseline. Nevertheless, the ensemble
method allows the possibility to include and further
re ne these two text features in the future research.</p>
      <p>From the social network analysis perspective, we
found that the largest communities with top in
uential usernames tend to have more false news circulating
rather than true news. Some of these in uential users
are also veri ed accounts. Regarding the hashtags,
12The URL of the data repository will be added after
anonymous peer review.
the hashtags that relate to explicit support of an
election candidate occur more in false news messages than
hashtags related to general events. These supportive
or favouring hashtags tend to contain names or have
strong sentiments.</p>
      <p>In the 2019 Indonesian presidential election case,
our results show that the combination of text features
with social network analysis can provide valuable
insights for the study of false news on social media.
Hopefully these ndings pave the way for not only
detecting but also preventing the dissemination of false
news in elections.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>I.</surname>
          </string-name>
          <article-title>Al na et al</article-title>
          .
          <article-title>Hate Speech Detection in The Indonesian Language: A Dataset</article-title>
          and
          <string-name>
            <given-names>Preliminary</given-names>
            <surname>Study</surname>
          </string-name>
          .
          <source>2017 International Conference on Advanced Computer Science and Information Systems</source>
          ,
          <volume>233</volume>
          {
          <fpage>238</fpage>
          ,
          <year>October 2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>H.</given-names>
            <surname>Allcott</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Gentzkow</surname>
          </string-name>
          .
          <source>Social Media and Fake News in The 2016 Election. Journal of Economic Perspectives</source>
          ,
          <volume>31</volume>
          (
          <issue>2</issue>
          ):
          <volume>211</volume>
          {
          <fpage>236</fpage>
          , May
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bastian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Heymann</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Jacomy</surname>
          </string-name>
          .
          <article-title>Gephi: An Open Source Software for Exploring and Manipulating Networks</article-title>
          .
          <source>Third International AAAI Conference on Weblogs and Social Media</source>
          ,
          <year>March 2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Duan</surname>
            , Xinhuan, Elham Naghizade, Damiano Spina,
            <given-names>and Xiuzhen</given-names>
          </string-name>
          <string-name>
            <surname>Zhang</surname>
          </string-name>
          .
          <article-title>"RMIT at PAN-CLEF 2020: Pro ling Fake News Spreaders on Twitter."</article-title>
          <source>CLEF</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>X.</given-names>
            <surname>Duan</surname>
          </string-name>
          et al.
          <source>RMIT at PAN-CLEF</source>
          <year>2020</year>
          :
          <article-title>Pro ling Fake News Spreaders on Twitter</article-title>
          .
          <source>CLEF</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>B.</given-names>
            <surname>Ghanem</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rosso</surname>
          </string-name>
          and
          <string-name>
            <given-names>F.</given-names>
            <surname>Rangel</surname>
          </string-name>
          .
          <article-title>An Emotional Analysis of False Information in Social Media and News Articles</article-title>
          .
          <source>ACM Transactions on Internet Technology (TOIT)</source>
          ,
          <volume>20</volume>
          (
          <issue>2</issue>
          ):1{
          <fpage>18</fpage>
          ,
          <string-name>
            <surname>April</surname>
          </string-name>
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>K.</given-names>
            <surname>Gimpel</surname>
          </string-name>
          et al.
          <article-title>Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments</article-title>
          .
          <source>Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies</source>
          ,
          <volume>42</volume>
          {
          <fpage>47</fpage>
          ,
          <year>June 2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Grossetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. Du</given-names>
            <surname>Mouza</surname>
          </string-name>
          and
          <string-name>
            <given-names>N.</given-names>
            <surname>Travers</surname>
          </string-name>
          .
          <article-title>Community-Based Recommendations on Twitter: Avoiding the Filter Bubble</article-title>
          .
          <source>International Conference on Web Information Systems Engineering</source>
          ,
          <volume>212</volume>
          {
          <fpage>227</fpage>
          ,
          <string-name>
            <surname>November</surname>
          </string-name>
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M. O.</given-names>
            <surname>Ibrohim</surname>
          </string-name>
          and
          <string-name>
            <given-names>I.</given-names>
            <surname>Budi</surname>
          </string-name>
          <article-title>. Multi-label Hate Speech and Abusive Language Detection in Indonesian Twitter</article-title>
          .
          <source>Proceedings of The Third Workshop on Abusive Language Online</source>
          ,
          <volume>46</volume>
          {
          <fpage>57</fpage>
          ,
          <year>August 2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>F.</given-names>
            <surname>Koto</surname>
          </string-name>
          and
          <string-name>
            <given-names>G. Y.</given-names>
            <surname>Rahmaningtyas</surname>
          </string-name>
          . Inset Lexicon:
          <article-title>Evaluation of A Word List for Indonesian Sentiment Analysis in Microblogs</article-title>
          .
          <source>2017 International Conference on Asian Language Processing</source>
          ,
          <volume>391</volume>
          {
          <fpage>394</fpage>
          ,
          <year>December 2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>K.</given-names>
            <surname>Lamb</surname>
          </string-name>
          .
          <article-title>Fake News Spikes in Indonesia ahead of Elections. www</article-title>
          .theguardian.com/world/2019/mar/20/fake -news
          <article-title>-spikes-in-indonesia-ahead-of-electio ns</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>K.</given-names>
            <surname>Lamb</surname>
          </string-name>
          .
          <article-title>'I felt disgusted': Inside Indonesia's Fake Twitter Account Factories</article-title>
          . www.theguardian.com/world/2018/jul/23/indo nesias-fake
          <article-title>-twitter-account-factories-jaka rta-politic.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>N.</given-names>
            <surname>Lum</surname>
          </string-name>
          .
          <article-title>The Surprising Di erence between Filter Bubble and Echo Chamber. www.medium.com/@nicklum/the-surprising-diff erence-between-filter-bubble-and-echo-chamb er-b909ef2542cc.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>N.</given-names>
            <surname>Naveed</surname>
          </string-name>
          et al.
          <article-title>Bad News Travel Fast: A Content-based Analysis of Interestingness on Twitter</article-title>
          .
          <source>Proceedings of the 3rd International Web Science Conference</source>
          ,
          <volume>1</volume>
          {
          <fpage>7</fpage>
          ,
          <string-name>
            <surname>June</surname>
          </string-name>
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A. T.</given-names>
            <surname>Olanrewaju</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Rahayu</surname>
          </string-name>
          .
          <article-title>Examining The Information Dissemination Process on Social Media during The Malaysia 2014 Floods Using Social Network Analysis</article-title>
          .
          <source>Journal of Information and Communication Technology</source>
          ,
          <volume>17</volume>
          (
          <issue>1</issue>
          ):
          <volume>141</volume>
          {
          <fpage>166</fpage>
          ,
          <year>January 2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Ruan</surname>
          </string-name>
          et al.
          <source>Prediction of Topic Volume on Twitter. Proceedings of the 4th International ACM Conference on Web Science</source>
          ,
          <volume>397</volume>
          {
          <fpage>402</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>N. A.</given-names>
            <surname>Salsabila</surname>
          </string-name>
          et al.
          <source>Colloquial Indonesian Lexicon</source>
          .
          <source>2018 International Conference on Asian Language Processing</source>
          ,
          <volume>226</volume>
          {
          <fpage>229</fpage>
          ,
          <string-name>
            <surname>November</surname>
          </string-name>
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Saputri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mahendra</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Adriani</surname>
          </string-name>
          .
          <source>Emotion classi cation on indonesian Twitter Dataset</source>
          .
          <source>2018 International Conference on Asian Language Processing</source>
          ,
          <volume>90</volume>
          {
          <fpage>95</fpage>
          ,
          <string-name>
            <surname>November</surname>
          </string-name>
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>M.</given-names>
            <surname>Song</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Kim</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y. K.</given-names>
            <surname>Jeong. Analyzing</surname>
          </string-name>
          <article-title>The Political Landscape of 2012 Korean Presidential Election in Twitter</article-title>
          .
          <source>IEEE Intelligent Systems</source>
          ,
          <volume>29</volume>
          (
          <issue>2</issue>
          ):
          <volume>18</volume>
          {
          <fpage>26</fpage>
          ,
          <year>June 2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>S.</given-names>
            <surname>Suraya</surname>
          </string-name>
          and
          <string-name>
            <given-names>F. E. D.</given-names>
            <surname>Kadju</surname>
          </string-name>
          .
          <article-title>Jokowi Versus Prabowo Presidential Race for 2019 General Election on Twitter</article-title>
          .
          <source>Saudi Journal of Humanities and Social Sciences</source>
          ,
          <volume>4</volume>
          (
          <issue>3</issue>
          ):
          <volume>198</volume>
          {
          <fpage>212</fpage>
          ,
          <string-name>
            <surname>April</surname>
          </string-name>
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>F. Z.</given-names>
            <surname>Tala</surname>
          </string-name>
          .
          <article-title>A Study of Stemming E ects on Information Retrieval in Bahasa Indonesia. Institute for Logic, Language</article-title>
          and Computation Universiteit van Amsterdam,
          <year>December 2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Tempo</surname>
          </string-name>
          . Metodologi. www.cekfakta.tempo.co/metodologi.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>S.</given-names>
            <surname>Volkova</surname>
          </string-name>
          and
          <string-name>
            <given-names>J. Y.</given-names>
            <surname>Jang</surname>
          </string-name>
          .
          <article-title>Misleading or Falsication: Inferring Deceptive Strategies and Types in Online News</article-title>
          and
          <string-name>
            <given-names>Social</given-names>
            <surname>Media</surname>
          </string-name>
          .
          <source>Companion Proceedings of The Web Conference</source>
          <year>2018</year>
          ,
          <volume>29</volume>
          (
          <issue>2</issue>
          ):
          <volume>575</volume>
          {
          <fpage>583</fpage>
          ,
          <string-name>
            <surname>April</surname>
          </string-name>
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>S.</given-names>
            <surname>Vosoughi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Roy</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Aral</surname>
          </string-name>
          .
          <source>The Spread of True and False News Online. Science</source>
          ,
          <volume>359</volume>
          (
          <issue>6380</issue>
          ):
          <volume>1146</volume>
          {
          <fpage>1151</fpage>
          ,
          <string-name>
            <surname>March</surname>
          </string-name>
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          et al.
          <article-title>Using Hashtag Graph-based Topic Model to Connect Semantically-related Words without Co-occurrence in Microblogs</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          ,
          <volume>28</volume>
          (
          <issue>7</issue>
          ):
          <year>1919</year>
          {1933,
          <year>February 2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>R. G. J.</given-names>
            <surname>Wijnhoven and P. H. N. de With</surname>
          </string-name>
          .
          <article-title>Fast Training of Object Detection using Stochastic Gradient Descent</article-title>
          .
          <source>20th International Conference on Pattern Recognition</source>
          ,
          <volume>424</volume>
          {
          <fpage>427</fpage>
          ,
          <year>August 2010</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>