<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>CLEF</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Profiling fake news spreaders through stylometry and lexical features. UniOR NLP @PAN2020</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Raffaele Manna</string-name>
          <email>rmanna@unior.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Antonio Pascucci</string-name>
          <email>apascucci@unior.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Johanna Monti</string-name>
          <email>jmonti@unior.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>"L' Orientale" University of Naples - UNIOR NLP Research Group</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2020</year>
      </pub-date>
      <volume>22</volume>
      <fpage>22</fpage>
      <lpage>25</lpage>
      <abstract>
        <p>In this paper, we describe our approach to address the Profiling Fake News Spreaders on Twitter task at PAN 20201. The aim of the task is to profile users who are used to spread (consciously or unconsciously) fake news in two languages, namely English and Spanish. We use different machine learning algorithms combined with strictly stylometric features, categories of emojis and a bunch of lexical features related to the fake news headlines vocabulary. As results of the final official runs, our models achieve an accuracy of 72.50% for the Spanish sub-task (using the Logistic Regression algorithm) and an accuracy of 59.50% for the English sub-task (using the Random Forest algorithm).</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        The flow of information and news is growing day by day on social media. Social
media platforms now represent the primary means for personal information on events and
facts of different nature that happen around us in the real world. It could be said that
social media are what the agorà was once to the ancient Greeks, namely a crowded
place where people meet and exchange opinions and information on everyday events.
It also means that news are often not credible because they can be shared by unreliable
sources. In fact, these types of news show manipulative content and expose an
alternative of the facts. In other words, news does not represent reality and tries to influence the
reader [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Furthermore, the massive diffusion of fake news involves the polarization
of public opinion on certain debated issues, often increasing offensive attitudes and hate
speech towards other points of view and other groups of people [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. In this context,
cybersecurity techniques, digital forensics investigations and computational stylometry
are essential in monitoring and identifying the main sources of fake news. Moreover, a
second application scenario to counter the spread of fake news is the task of profiling
users based on their susceptibility in sharing texts with inaccurate information.
      </p>
      <p>The 2020 edition of the PAN Author Profiling task2 focuses on the classification of
potential fake news spreaders on Twitter, whether they propagate fake news
intentionally or unintentionally.</p>
      <p>
        In this paper, we propose a machine learning classification approach based on
stylometric features along with two lexical category features: persuasive words associated
with fake news [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] and words associated with subjectivity. The paper is organized as
follows: in section 2 we present related work, in section 3 we present the problem by
describing the author profiling task proposed at PAN 2020 and the dataset provided by
the shared task organizers and then we focus on the features and the algorithms used.
In section 4, we show the results obtained by our models and the evaluation framework
TIRA [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Finally, in section 5 we outline the conclusions.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>Considering the massive creation and rapid spread of fake news and the potential threat
to the opinion of users, in recent years particular attention was paid to social media
and how these represent breeding grounds for tendentious contents spreading, partisan
articles dissemination and, in general, invented or modified news to achieve particular
purposes.</p>
      <p>
        Scholars have shown how the impact of fake news can affect the creation of the
socalled "echo-chambers" as well as can influence the opinions of users during the months
before the 2016 US presidential election [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Besides, other scholars have shown the role
of bots in the diffusion of fake news and misinformation on particular political events to
damage a politician [
        <xref ref-type="bibr" rid="ref18 ref2">2, 18</xref>
        ]. Starting from the categorization of concepts related to fake
news, Zhou and Zafarani focused on the different aspects of fake and then analyzed the
false information conveyed up to the role played by users [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. Potthast et al. focused
on the hyper-partisan news writing style linked to fake news [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. The results show how
left-wing and right-wing writing styles are very similar and easily distinguishable from
the mainstream news. The same research reported some difficulties in the detection of
fake news based only on style features.
      </p>
      <p>
        Scholars have also investigated the role played by users in spreading fake news on
social media. Often users mix their personal contents with fake news for either satirical
or malicious purposes making the monitoring and classification of content and profiles
controversial. In fact, Ghanem et al. attempted to identify Twitter accounts suspected
of spreading fake news. The approach is based on a Recurrent Neural Network (RNN)
along with semantic and stylistic features [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The CheckerOrSpreader [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] is a system
based on a Convolutional Neural Network (CNN) and aims to differentiate between
checkers and spreaders. It consists of two different components: word embedding
component (based on the tweets posted on the users’ timeline), and psycho-linguistic
component that represents style pattern and personality traits that derive from the textual
content. Shu, Wang and Liu [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] focused on the correlation between user profiles and
fake/real news, showing that there are specific users that are most likely to trust fake
news. These users reveal different features from the users that are most likely to trust
real news.
      </p>
    </sec>
    <sec id="sec-3">
      <title>Dataset</title>
      <p>
        PAN event takes its name from the International Workshop on Plagiarism Analysis,
Authorship Identification, and Near-Duplicate Detection (PAN) [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] held in 2007. As the
years passed, PAN has become the main event for computational stylometry scholars.
PAN event can be described as a series of scientific events and shared task on issues
relating to digital forensics and computational stylometry, such as authorship analysis
(profiling and identification), computational ethics, and plagiarism detection.
      </p>
      <p>In this edition, four different shared task have been presented: Authorship
Verification, Celebrity Profiling, Profiling Fake News Spreaders on Twitter and Style Change
Detection3.</p>
      <p>Our team (UniOR NLP) decided to take part in the Profiling Fake News Spreaders
on Twitter task. The aim is to build a model able to identify possible fake news spreaders
on social media in a multilingual perspective: data are in fact in Spanish and English.
The dataset is made up of Twitter accounts for both languages considered in the task (i.e.
Spanish and English). Each account is composed of the author feed of 100 concatenated
tweets.</p>
      <p>Languages Train Test Total</p>
      <p>English 300 200 500</p>
      <p>Spanish 300 200 500</p>
      <p>As shown in Table 1, the dataset is divided into two sets, train and test, for a total of
500 Twitter accounts taken for the construction of the dataset task.</p>
      <p>The train set (made available for download by the task organizers) consists of 300
xml files per language, each one containing the author feed and named with an
alphanumeric code relating to the identity of the author. Moreover, URLs, mentions and
hashtags have been replaced with generic tags for tweets contained in the author feed.
The train set is balanced between the two classes. In fact, among 300 xml files, it
contains 150 accounts belonging to the spreaders class and 150 accounts belonging to the
no-spreaders class.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Methodology</title>
      <p>
        In order to identify and classify fake news spreaders, we include in our models two
categories of features. The first category is related specifically to stylometric features.
The second one focuses on lexical features divided into i) lexical elements expressing
personal opinion in online communications and ii) clickbait verbs and expressions in
fake news headlines [
        <xref ref-type="bibr" rid="ref12 ref13">12, 13</xref>
        ].
      </p>
      <sec id="sec-4-1">
        <title>3 https://pan.webis.de/clef20/pan20-web/index.html</title>
        <p>
          For these two groups of features, we used a bunch of features recognized as crucial
to identify fake news [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. The features computed by our model for both languages are
listed and described below. For each Twitter account we used the following features:
– Emoji: We calculated the average number of emojis for each account divided by the
total number of emojis for each class. In addition, we added the average number of
emojis belonging to several emotional characteristics and different characters
represented by emoji. We considered the emojis contained in the Unicode Emoji List4.
From this list, we selected and used emoji characters related to face-affection,
facetongue, face-neutral-skeptical, face-hand, face-concerned, emotions and
countryflag.
– Stylometric Features: The average number of each stylistic features of the tweets
divided by the total number of each stylistic features for each class. These
characteristics are: URLs count; space count; words count; initial capital letter words
count; capital words count; digits count; punctuation marks count; operators count;
average text length; brackets count; question and exclamation marks count; slashes
count; retweet, hashtag and user tags count; quotes style count and ellipsis count.
– Lexical features: We designed and computed the average number of the presence
of a series of lexical items, in both languages, related to:
1. Groups of words expressing personal opinions in addition to personal
pronouns;
2. Verbs and expressions related to clickbait headlines.
        </p>
        <p>As an example respectively for the two categories, groups of words, words and
typing shortcut in online communication such as: 1) "mine", "myself", "I", "IMO",
"IMHO", "yo", "tu", "personalmente" among others; 2) "videos", "link", "directa",
"latest", "click", "últimas", "última hora" among others were used.</p>
        <p>As a first step, machine learning algorithms combined with stylometric features,
categories of emojis and a bunch of lexical features have been tested in order to detect
the most performing model. We decided to run different machine learning algorithms
fed with the selected features into the virtual machine assigned to us by the organizers.
Then, we chose the best performing algorithm for each language on the basis of the
results obtained on the training set.</p>
        <p>
          During the development phase, we used well-known classifiers [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], namely
Logistic Regression (LR) [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], Random Forest (RF) [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], Multinomial Naïve Bayes (MNB)
[
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], Support Vector Machine (SVM) [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] and Gradient Boosting classifer (GBC) [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. All
these machine learning classifiers are provided by the Python Scikit-learn library5. We
decided to keep the basic classifier hyper-parameter in order to evalute the models only
on the basis of stylometric and lexical features.
        </p>
        <p>The submitted version of our model first classifies and predicts Twitter accounts in
English, then classifies and predicts the Spanish ones.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4 https://unicode.org/emoji/charts/full-emoji-list.html 5 https://scikit-learn.org/stable/</title>
        <p>In order to evaluate our selected classifiers, we created our own test set, splitting the
train set into 70% training data and 30% test data6.</p>
        <p>Languages LR RF MNB SVM GBC</p>
        <p>English 0.57 0.64 0.52 0.51 0.57</p>
        <p>Spanish 0.81 0.80 0.73 0.73 0.72</p>
        <p>As shown in Table 2, the best performing algorithms are Random Forest for English
and Logistic Regression for Spanish. For each algorithm and for both languages, we
used the same set of features listed in subsection 3.3.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Results</title>
      <p>
        For the final run on blind test set, we set up a model based on the Logistic Regression
[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] algorithm for the Spanish sub-task and a model based on the Random Forest [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]
algorithm for the English sub-task. To complete the submission of our software, we run
our model on the TIRA [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] platform.
      </p>
      <p>As shown in Tables 3 and 4, our model seems to better profile and predict by far
accounts related to Spanish users than Twitter accounts of English users. In addition, we
observe that all the features used to profile users seem to be more present in Spanish
tweets as they better discriminate the textual style of fake news spreaders accounts in
Twitter.
6 https:scikit-learn.orgstablemodulesgeneratedsklearn.model_selection.train_test_split.html</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusions</title>
      <p>
        In this paper, we have shown the results achieved by the UniOR NLP team for the
Profiling fake news spreaders task [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] at PAN 2020. Our approach is based on
stylometric features and two lexical category features: clickbait expressions associated with
fake news and words expressing personal opinions along with personal pronouns. Our
model achieved much better results in the Spanish sub-task (72.50%) compared to those
of the English sub-task (59.50%).
      </p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgements</title>
      <p>This research has been carried out in the context of two innovative industrial PhD
projects in computational stylometry supported by the PON Ricerca e Innovazione
2014-20 and the POR Campania FSE 2014-2020 funds.</p>
      <p>We sincerely thank the PAN organizers for the work done in order to enable us to
submit our system.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Allcott</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gentzkow</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Social media and fake news in the 2016 election</article-title>
          .
          <source>Journal of economic perspectives 31(2)</source>
          ,
          <fpage>211</fpage>
          -
          <lpage>36</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bessi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ferrara</surname>
          </string-name>
          , E.:
          <article-title>Social bots distort the 2016 us presidential election online discussion</article-title>
          .
          <source>First Monday</source>
          <volume>21</volume>
          (
          <issue>11-7</issue>
          ) (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Cortes</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vapnik</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Support-vector networks</article-title>
          .
          <source>Machine learning 20(3)</source>
          ,
          <fpage>273</fpage>
          -
          <lpage>297</lpage>
          (
          <year>1995</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Friedman</surname>
            ,
            <given-names>J.H.</given-names>
          </string-name>
          :
          <article-title>Greedy function approximation: a gradient boosting machine</article-title>
          . Annals of statistics pp.
          <fpage>1189</fpage>
          -
          <lpage>1232</lpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Ghanem</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ponzetto</surname>
            ,
            <given-names>S.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Factweet: profiling fake news twitter accounts</article-title>
          . arXiv preprint arXiv:
          <year>1910</year>
          .
          <volume>06592</volume>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Giachanou</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ríssola</surname>
            ,
            <given-names>E.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ghanem</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>The role of personality and linguistic patterns in discriminating between fake news spreaders and fact checkers</article-title>
          .
          <source>In: International Conference on Applications of Natural Language to Information Systems</source>
          . pp.
          <fpage>181</fpage>
          -
          <lpage>192</lpage>
          . Springer (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Granik</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mesyura</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Fake news detection using naive bayes classifier</article-title>
          .
          <source>In: 2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON)</source>
          . pp.
          <fpage>900</fpage>
          -
          <lpage>903</lpage>
          . IEEE (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Horne</surname>
            ,
            <given-names>B.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Adali</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>This just in: fake news packs a lot in title, uses simpler, repetitive content in text body, more similar to satire than real news</article-title>
          .
          <source>In: Eleventh International AAAI Conference on Web and Social Media</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Hosmer</given-names>
            <surname>Jr</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.W.</given-names>
            ,
            <surname>Lemeshow</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Sturdivant</surname>
          </string-name>
          ,
          <string-name>
            <surname>R.X.</surname>
          </string-name>
          :
          <article-title>Applied logistic regression</article-title>
          , vol.
          <volume>398</volume>
          . John Wiley &amp; Sons (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Jain</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kasbe</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Fake news detection</article-title>
          .
          <source>In: 2018 IEEE International Students' Conference on Electrical, Electronics and Computer Science (SCEECS)</source>
          . pp.
          <fpage>1</fpage>
          -
          <lpage>5</lpage>
          . IEEE (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Liaw</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wiener</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Classification and regression by randomforest</article-title>
          .
          <source>R News</source>
          <volume>2</volume>
          (
          <issue>3</issue>
          ),
          <fpage>18</fpage>
          -
          <lpage>22</lpage>
          (
          <year>2002</year>
          ), https://CRAN.R-project.org/doc/Rnews/
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Pérez-Rosas</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kleinberg</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lefevre</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mihalcea</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Automatic detection of fake news</article-title>
          .
          <source>arXiv preprint arXiv:1708.07104</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Piotrkowicz</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dimitrova</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Otterbacher</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Markert</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>The impact of news values and linguistic style on the popularity of headlines on twitter and facebook</article-title>
          .
          <source>In: Eleventh International AAAI Conference on Web and Social Media</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Popat</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mukherjee</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yates</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weikum</surname>
          </string-name>
          , G.:
          <article-title>Declare: Debunking fake news and false claims using evidence-aware deep learning</article-title>
          .
          <source>arXiv preprint arXiv:1809</source>
          .
          <volume>06416</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Potthast</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gollub</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wiegmann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stein</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>TIRA Integrated Research Architecture</article-title>
          . In: Ferro,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Peters</surname>
          </string-name>
          ,
          <string-name>
            <surname>C</surname>
          </string-name>
          . (eds.)
          <article-title>Information Retrieval Evaluation in a Changing World</article-title>
          . Springer (Sep
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Potthast</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kiesel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reinartz</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bevendorff</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stein</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>A stylometric inquiry into hyperpartisan and fake news</article-title>
          .
          <source>arXiv preprint arXiv:1702.05638</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Rangel</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Giachanou</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ghanem</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Overview of the 8th Author Profiling Task at PAN 2020: Profiling Fake News Spreaders on Twitter</article-title>
          . In: Cappellato,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Eickhoff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Névéol</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . (eds.)
          <article-title>CLEF 2020 Labs and Workshops, Notebook Papers</article-title>
          .
          <source>CEUR Workshop Proceedings (Sep</source>
          <year>2020</year>
          ),
          <article-title>CEUR-WS</article-title>
          .org
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Rangel</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Overview of the 7th author profiling task at pan 2019: Bots and gender profiling in twitter</article-title>
          .
          <source>In: Proceedings of the CEUR Workshop</source>
          , Lugano, Switzerland. pp.
          <fpage>1</fpage>
          -
          <lpage>36</lpage>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Rashkin</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Choi</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jang</surname>
            ,
            <given-names>J.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Volkova</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Choi</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Truth of varying shades: Analyzing language in fake news and political fact-checking</article-title>
          .
          <source>In: Proceedings of the 2017 conference on empirical methods in natural language processing</source>
          . pp.
          <fpage>2931</fpage>
          -
          <lpage>2937</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Rosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Profiling bots, fake news spreaders and haters</article-title>
          .
          <source>In: Proceedings of the Workshop on Resources</source>
          and
          <article-title>Techniques for User and Author Profiling in Abusive Language (</article-title>
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Shu</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , Liu, H.:
          <article-title>Understanding user profiles on social media for fake news detection</article-title>
          .
          <source>In: 2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)</source>
          . pp.
          <fpage>430</fpage>
          -
          <lpage>435</lpage>
          . IEEE (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Stein</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koppel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stamatatos</surname>
          </string-name>
          , E. (eds.):
          <source>SIGIR 07 Workshop on Plagiarism Analysis</source>
          ,
          <string-name>
            <given-names>Authorship</given-names>
            <surname>Identification</surname>
          </string-name>
          , and
          <article-title>Near-Duplicate Detection (PAN 07)</article-title>
          .
          <article-title>CEUR-WS.org (</article-title>
          <year>2007</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-276
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zafarani</surname>
          </string-name>
          , R.:
          <article-title>Fake news: A survey of research, detection methods, and opportunities</article-title>
          . arXiv preprint arXiv:
          <year>1812</year>
          .
          <volume>00315</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>