<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Credibility and Transparency of News Sources: Data Collection and Feature Analysis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ahmet Aker</string-name>
          <email>aker@is.inf.uni-due.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vincentius Kevin</string-name>
          <email>vincentius.kevin@stud.uni-due.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kalina Bontcheva</string-name>
          <email>k.bontcheva@shefield.ac.uk</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Duisburg-Essen</institution>
          ,
          <addr-line>Duisburg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Duisburg-Essen</institution>
          ,
          <addr-line>Duisburg, Germany and</addr-line>
          ,
          <institution>University of Shefield</institution>
          ,
          <addr-line>Shefield, England</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Shefield</institution>
          ,
          <addr-line>Shefield, England</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <abstract>
        <p>The ability to discern news sources based on their credibility and transparency is useful for users in making decisions about news consumption. In this paper, we release a dataset of 673 sources with credibility and transparency scores manually assigned. Upon acceptance we will make this dataset publicly available. Furthermore, we compared features which can be computed automatically and measured their correlation with credibility and transparency scores annotated by human experts. Our correlation analysis shows that there are indeed features which highly correlate with the manual judgments.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The Web has never been as big as it is now. It
contains tremendous amount of information represented
in form of articles, videos, images, blog and social
media posts and many other entries. One of the
reasons for this massive growth is that it is not anymore
shaped only by few experts or professional people or
institutions but by everyone who has access. Although
this new style of contribution towards web content has
led to immense information richness and diverse views
however, it has also brought new challenges. It has
stripped the traditional information providers, such as
news media, from their gate-keeping role [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and has
left the public in a jungle of web content with varying
quality from reliable and true information to
misinformation i.e., facts that are not true.
      </p>
      <p>
        Misinformation is interchangeably used with the
terms fake news. Douglas et al. refer to fake news
as a “deliberate publication of fictitious information,
hoaxes and propaganda” [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], and is similarly defined
by others [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Furthermore, it is reported that the
veracity of the information is highly connected to the
publisher, i.e. the source of information [
        <xref ref-type="bibr" rid="ref4 ref6">6, 4</xref>
        ]. Thus
instead of performing judgment on e.g. article level
such as performed in [
        <xref ref-type="bibr" rid="ref12 ref14 ref8">12, 8, 14</xref>
        ] there are services to
assess the sources publishing online news. NewsGuard1
is one of such services. NewsGuard analyses manually
each news publishing source in terms of credibility and
transparency and provides detailed information such
as references and reasoning, and the persons
accountable behind each analysis. The results are made
available to the public via a browser plugin.
      </p>
      <p>In this paper we use NewsGuard to manually
collect analyses results of 673 news sources. For each
news source we manually record the overall credibility
and transparency scores but also detailed information
that led to those overall decisions. We plan to make
1www.newsguardtech.com
this dataset freely available.2 Next,we collect a rich
set of well known metrics/features used by e.g. search
engines to assess the popularity of a web-site and run
correlation analysis between the features and
manually assigned NewsGuard scores. Our analysis show
that there are features which highly correlate with the
NewsGuard scores. This suggests that the manual
process done by NewsGuard could be automated.
2
2.1</p>
    </sec>
    <sec id="sec-2">
      <title>Data Collection</title>
      <sec id="sec-2-1">
        <title>NewsGuard:</title>
        <p>parency Scores</p>
      </sec>
      <sec id="sec-2-2">
        <title>Credibility and</title>
      </sec>
      <sec id="sec-2-3">
        <title>Trans</title>
        <p>NewsGuard’s team manually reviewed thousands of
news agencies, which are mostly based in the US,
to label them with nine criteria. A news agency is
rewarded credibility and transparency scores for each
criterion it fulfills. The criteria are listed below.
Credibility criteria:
• Does not repeatedly publish false content (22
points)
• Gathers and presents information responsibly (18
points)
• Regularly corrects or clarifies errors (12.5 points)
• Handles the diference between news and opinion
responsibly (12.5 points)
• Avoids deceptive headlines (10 points)
Transparency criteria:
• Website discloses ownership and financing (7.5
points)
• Clearly labels advertising (7.5 points)
• Reveals who’s in charge (5 points)
• The site provides the names of content creators,
along with either contact information or
biographical information (10 points)</p>
        <p>The total of credibility and transparency scores is
100 at maximum, and a news website is considered
“safe” if it has at least 60 points.
2.2</p>
      </sec>
      <sec id="sec-2-4">
        <title>News Sources</title>
        <p>The list of news sources we used were taken from
Media Bias Fact Check (MBFC). MBFC aims to
categorize sources by political bias. The categories are as
follows, with some descriptions (partially) quoted from
their website3.</p>
        <p>2https://github.com/ahmetaker/sourceCredibility
3mediabiasfactcheck.com
• Left/Right: “moderately to strongly biased
towar” liberal/ conservative causes, may be
untrustworthy.
• Left-Center/Right-Center: slight to moderate
bias toward liberal/conservative causes.
• Center (Least Biased): minimal bias, most
credible media sources.
• Pro-Science: “These sources consist of legitimate
science or are evidence based through the use of
credible scientific sourcing. ...”
• Conspiracy-Pseudoscience: “Sources in the
Conspiracy- Pseudoscience category may publish
unverifiable information that is not always
supported by evidence. ..”
• Questionable Sources: “extreme bias, consistent
promotion of propaganda/conspiracies, poor or no
sourcing to credible information, a complete lack
of transparency and/or is fake news.”
• Satire: “... humor, irony, exaggeration, or ridicule
to expose and criticize people’s stupidity or vices,
... these sources are clear that they are satire and
do not attempt to deceive”
• Re-Evaluated Sources: these are sources which
have been updated by MBFC. They are
duplicates, so this category is removed from our
analysis.</p>
        <p>We used the sources (in total 2714) from MBFC to
run over the NewsGuard (see next Section).
2.3</p>
      </sec>
      <sec id="sec-2-5">
        <title>Collection Procedure</title>
        <p>To collect NewsGuard judgments on the sources
collected from MBFC we performed a manual process.
We installed the NewsGuard as a browser plugin and
visited each of the MBFC source. The results shown by
the plugin were recorded. For instance for BBC.com,
NewsGuard lists the results shown in Figure 1. For
this source we recorded the values for the individual
labels as well as the overall NewsGuard score (in this
case 95). If the results are unavailable because
NewsGuard has not analysed the source, the news source is
discarded.</p>
        <p>We performed this procedure for all 2714 news
sources available in the nine categories at the time.
NewsGuard scores were available only for 673 of them.
Most of the sources in the “Satire” category were
unavailable. The scores were found to agree with
MBFC’s description of each category - in general,
least biased and pro-science sources are the most
credible ones, while extremely biased and
conspiracy/pseudoscience sources can be unreliable. Table 1
shows the average score and standard deviation per
category. The counts show how many sources are
available on NewsGuard out of all that were listed in
MBFC.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Correlation Analysis</title>
      <p>In the correlation analysis the automatic features are
compared to the manually annotated credibility and
transparency scores to analyze the correlation and
predictive power of the features. We calculated
specifically the correlation between each automatic
feature against the combined score (3 credibility +
transparency) from NewsGuard4.</p>
      <p>In the followings we outline features we selected as
well as the metric used to perform the correlation.</p>
      <p>4https://www.newsguardtech.com/ratings/rating-processcriteria/</p>
      <sec id="sec-3-1">
        <title>CheckPageRank</title>
        <p>CheckPageRank5 (cPR) provides a free online tool
which can report page rank score, alexa rank, and a
few other domain analysis results for any given
website.</p>
        <p>The tool does not provide any exact definition or
information on how the scores are calculated. However,
cPR provides scores which seem to be taken from
nonfree services such as Moz SEO and Majestic SEO tools.
While these tools highly limits usage for free users to
ten queries per month and a few queries per day
respectively (as of 2019), cPR allows one query every
thirty seconds, although it does not provide the full
information available in the other tools.</p>
        <p>Below is the most likely explanation we found for
the features provided by cPR, either because the
feature name is self-explanatory or the supposed
underlying services give exact or very close scores compared
to what is displayed by cPR.</p>
        <p>
          • Google Page Rank: A score from 0 to 10 which
estimates the importance of the website based on
the quantity and quality of links to it from other
websites.
• cPR Score: This is shown visually as one of the
most important scores in checkpagerank.net,
albeit without any given definition. We presume
that ‘cPR’ simply stands for ‘checkPageRank’ and
cPR score is calculated with a proprietary formula
or algorithm.
• Citation Flow and Trust Flow: These two scores
are most probably from Majestic6, an SEO
(Search Engine Optimization) tool. According to
Majestic’s glossary7, citation flow focuses on the
quantity and influential power of links to the
website, while trust flow focuses on links from
manually reviewed trusted sites. Majestic seems to
have crawled over 600 billion URLs by 2014 [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ].
• Topic Value: this score also most likely comes
from Majestic. Majestic provides a “Topical Trust
Flow” score, which, according to their glossary
“shows the relative influence [...] in any given
topic or category.” It is a likely explanation that
cPR show only the topic for which the website
has the best Topical Trust Flow, since the topic
names and value range are exactly the same in
cPR and Majestic.
5checkpagerank.net
6majestic.com
7https://majestic.com/help/glossary
• Backlinks: External backlinks mean links from
other websites to the subject website. This
excludes internal links, which usually exist to let
users navigate within the same website.
• Referring domains: this is the number of domains
which contains backlink(s) to the subject website.
• EDU and GOV backlinks and domains: Majestic
also provides the counts of educational and
governmental backlinks and domains.
• Domain Authority and Page Authority: the Moz8
SEO tool describes these scores as “the
ranking potential in search engines based on an
algorithmic combination of all link metrics”. While
MozRank is not used directly by search engines,
it is similar and correlated to ranking of major
search engines [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. We tested a few websites and
confirmed that cPR shows exactly the same scores
as Moz.
• Spam Score: This most likely represents the Moz
SEO spam flags explained in their website 9. The
lfags represent internal and external features of
websites that are indicative of ‘spam websites’ and
have been found to be penalized or banned by
Google.
• Alexa Rank: Alexa Rank is described as a
popularity measure which “is calculated using a
proprietary methodology that combines a site’s
estimated trafic and visitor engagement over the
past three months.”10
• Alexa Reach Rank: this score is based specifically
on the estimated number of people each website
is able to reach.
• Indexed URLs: This may be the number of URLs
indexed by Google, as is commonly provided in
SEO tools, but since there is no information
provided, this is only a guess.
3.1.2
        </p>
        <p>Twitter
• Number of followers: the number of users on
twitter.com who “subscribes” to the news’ Twitter
account. Posts made on Twitter will appear on the
followers’ home screen.
• Listed count: a Twitter user can make lists of
users to personally categorize other users. They
can keep the list private or publicly visible. Listed
count represents the number of public lists in
which the Twitter user appears.
8moz.com
9https://moz.com/blog/spam-score-mozs-new-metric-tomeasure-penalization-risk
10blog.alexa.com
3.1.3</p>
        <p>Facebook
• Page Likes: the number of Facebook users who
likes the Facebook page of the news source, by
simply clicking on the like button. Likes
information is publicly available.
• Page Followers: the number of Facebook users
who are following the page, which means any
posts by the page will be shown in the users’ home
screens. By default, when someone likes a page,
he automatically follows the page as well. The
user can then “Unfollow” while still keeping the
“Like”. It is also possible to follow a page without
liking it.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Pearson Correlation</title>
      </sec>
      <sec id="sec-3-3">
        <title>Transformation with</title>
      </sec>
      <sec id="sec-3-4">
        <title>Logarithmic</title>
        <p>
          First, we measured the Pearson correlation [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
Pearson only measures linear relationships. This means if
there is no such relationship Pearson is not a good
choice to compute the correlation. However, one way
of overcome this limitation is to convert the data to
logarithm form. Therefore, we also applied a logarithm
(base 10) on the features before calculating the
Pearson correlation (with “add one” to avoid math error
for the logarithm of zero) to capture the correlations
which follow the power law rather than linear.
        </p>
        <p>
          We expected features such as backlink counts and
number of likes in social media to follow the power
law, under the assumption that website links and user
networks in social media follow the pattern of a
scalefree network (preferential attachment) [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
        </p>
        <p>We also expect behavior of ranking features (e.g.
Alexa Rank) to be non-linear. Although it is not
necessarily logarithmic, ratio would be a better measure
than rank diference. By applying a logarithm kernel,
only the ratio is now considered, i.e. the diference
between rank 10 and 20 is considered as significant as
the diference between ranks 1,000 and 2,000.
3.3</p>
      </sec>
      <sec id="sec-3-5">
        <title>Spearman and Kendall Tau Correlations</title>
        <p>Since Pearson correlation only measures linear
correlation, we have also computed the Spearman and
Kendall Tau correlation scores. This may give a
better insight on which variables are more predictive of
the news source quality.</p>
        <p>
          Both Spearman [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] and Kendall Tau [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] are
rankbased correlation measurement, thus they work well on
monotonous correlations. Spearman does not handle
tied ranks, which occurs very often in our dataset due
to NewsGuard’s scoring method. Therefore, Kendall
Tau seems to be the better measurement and has been
used to sort the rows in the following table. We have
used the tau-b implementation available in scipy 11.
4
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Correlation Results</title>
      <p>Feature</p>
      <p>
        11https://docs.scipy.org/doc/scipy0.15.1/reference/generated/scipy.stats.kendalltau.html
One unexpected result is the negative
correlation between Facebook likes/follows and NewsGuard
scores. This may be caused by the availability of paid
“like farms” to get fake likes on the platform, such as
BoostLikes and SocialFormula. Even legitimate
Facebook ad campaigns can result in significant amounts of
such fake likes [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. However, it requires further
analysis of the corresponding Facebook pages to confirm
this.
      </p>
      <p>One should note that since the dataset comes from
NewsGuard, it is possible for unpopular news sources
to be under-represented.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>In this paper, we release a dataset of 673 sources with
credibility and transparency scores manually assigned.
The scores come from NewsGuard’s plugin. We
manually accessed the plugin for 2714 news sources
published by Media Bias Fact Check and recorded for
those 673 detailed scores about credibility and
transparency NewsGuard provides. For the remaining 2042
sources NewsGuard did not have judgments.</p>
      <p>We also extracted a rich set of features and
performed a correlation analysis. Our results show that
there are strong correlations between the NewsGuard
scores and features analysed in this work. This
indicates that the credibility and transparency scoring
could be automated.</p>
      <p>
        In our future work we aim to perform such a step
and create a regression model to automatically
predict the credibility and transparency scores. This will
allow to obtain credibility scores for any source that
is so far not judged by NewsGuard. Note since our
features are language independent this will allow us
to obtain credibility scores for any source reporting in
any language. We also plan to use the output of our
regression models as information nutrition label within
NewsScan12[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgements</title>
      <p>This work was partially supported by the European
Union under grant agreement No. 825297
WeVerify (http://weverify.eu) and the Deutsche
Forschungsgemeinschaft (DFG, German Research Foundation)
GRK 2167, Research Training Group “User-Centred
Social Media”.
12www.news-scan.com</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Baly</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karadzhov</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alexandrov</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Glass</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Nakov</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <article-title>Predicting factuality of reporting and bias of news media sources</article-title>
          . arXiv preprint arXiv:
          <year>1810</year>
          .
          <volume>01765</volume>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Barabási</surname>
            ,
            <given-names>A.-L.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Pósfai</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <article-title>Network science</article-title>
          . Cambridge University Press, Cambridge,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Benesty</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <article-title>Pearson correlation coeficient</article-title>
          .
          <source>In Noise reduction in speech processing</source>
          . Springer,
          <year>2009</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>4</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Burgoon</surname>
            ,
            <given-names>J. K.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Hale</surname>
            ,
            <given-names>J. L.</given-names>
          </string-name>
          <article-title>The fundamental topoi of relational communication</article-title>
          .
          <source>Communication Monographs</source>
          <volume>51</volume>
          ,
          <issue>3</issue>
          (
          <year>1984</year>
          ),
          <fpage>193</fpage>
          -
          <lpage>214</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>De</given-names>
            <surname>Cristofaro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Friedman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Jourjon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Kaafar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            , and
            <surname>Shafiq</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. Z.</surname>
          </string-name>
          <article-title>Paying for likes?: Understanding facebook like fraud using honeypots</article-title>
          .
          <source>In Proceedings of the 2014 Conference on Internet Measurement Conference</source>
          (New York, NY, USA,
          <year>2014</year>
          ), IMC '14, ACM, pp.
          <fpage>129</fpage>
          -
          <lpage>136</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Demchenko</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grosso</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Laat</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Membrey</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <article-title>Addressing big data issues in scientific data infrastructure</article-title>
          .
          <source>In Collaboration Technologies and Systems (CTS)</source>
          , 2013 International Conference on (
          <year>2013</year>
          ), IEEE, pp.
          <fpage>48</fpage>
          -
          <lpage>55</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Douglas</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ang</surname>
            ,
            <given-names>C. S.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Deravi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <article-title>Farewell to truth? conspiracy theories and fake news on social media</article-title>
          .
          <source>The Psychologist</source>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Hardalov</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koychev</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Nakov</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <article-title>In search of credible news</article-title>
          .
          <source>In International Conference on Artificial Intelligence: Methodology</source>
          ,
          <string-name>
            <surname>Systems</surname>
          </string-name>
          , and
          <string-name>
            <surname>Applications</surname>
          </string-name>
          (
          <year>2016</year>
          ), Springer, pp.
          <fpage>172</fpage>
          -
          <lpage>180</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Kendall</surname>
            ,
            <given-names>M. G.</given-names>
          </string-name>
          <article-title>The treatment of ties in ranking problems</article-title>
          .
          <source>Biometrika</source>
          <volume>33</volume>
          ,
          <issue>3</issue>
          (
          <year>1945</year>
          ),
          <fpage>239</fpage>
          -
          <lpage>251</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Kevin</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Högden</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schwenger</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sahan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Madan</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aggarwal</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bangaru</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Muradov</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Aker</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <article-title>Information nutrition labels: A plugin for online news evaluation</article-title>
          .
          <source>In Proceedings of the First Workshop on Fact Extraction and VERification (FEVER)</source>
          (Brussels, Belgium, Nov.
          <year>2018</year>
          ),
          <article-title>Association for Computational Linguistics</article-title>
          , pp.
          <fpage>28</fpage>
          -
          <lpage>33</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Klein</surname>
            ,
            <given-names>D. O.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Wueller</surname>
            ,
            <given-names>J. R.</given-names>
          </string-name>
          <article-title>Fake news: A legal perspective</article-title>
          .
          <source>Journal of Internet Law</source>
          <volume>20</volume>
          ,
          <issue>10</issue>
          (
          <year>2017</year>
          ),
          <fpage>6</fpage>
          -
          <lpage>13</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Markowitz</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Hancock</surname>
            ,
            <given-names>J. T.</given-names>
          </string-name>
          <article-title>Linguistic traces of a scientific fraud: The case of diederik stapel</article-title>
          .
          <source>PloS one 9</source>
          ,
          <issue>8</issue>
          (
          <year>2014</year>
          ),
          <year>e105937</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>Pardeep</given-names>
            <surname>Sud</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. T.</surname>
          </string-name>
          <article-title>Linked title mentions: A new automated link search candidate</article-title>
          .
          <source>Scientometrics</source>
          <volume>101</volume>
          (
          <year>2014</year>
          ),
          <fpage>1831</fpage>
          -
          <lpage>1849</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Rashkin</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Choi</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jang</surname>
            ,
            <given-names>J. Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Volkova</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Choi</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <article-title>Truth of varying shades: Analyzing language in fake news and political fact-checking</article-title>
          .
          <source>In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing</source>
          (
          <year>2017</year>
          ), pp.
          <fpage>2931</fpage>
          -
          <lpage>2937</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Spearman</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <article-title>The proof and measurement of association between two things</article-title>
          .
          <source>The American Journal of Psychology</source>
          <volume>15</volume>
          ,
          <issue>1</issue>
          (
          <year>1904</year>
          ),
          <fpage>72</fpage>
          -
          <lpage>101</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Themistoklis</given-names>
            <surname>Mavridis</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. L. S.</surname>
          </string-name>
          <article-title>Identifying valid search engine ranking factors in a web 2.0 and web 3.0 context for building eficient seo mechanisms</article-title>
          .
          <source>Engineering Applications of Artificial Intelligence</source>
          <volume>41</volume>
          (
          <year>2015</year>
          ),
          <fpage>75</fpage>
          -
          <lpage>91</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>