<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>FakeNews: Corona Virus and Conspiracies Multimedia Analysis Task at MediaEval 2021</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Konstantin Pogorelov</string-name>
          <email>konstantin@simula.no</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Daniel Thilo Schroeder</string-name>
          <email>daniels@simula.no</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefan Brenner</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Johannes Langguth</string-name>
          <email>langguth@simula.no</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Simula Metropolitan Center for Digital Engineering</institution>
          ,
          <country country="NO">Norway</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Simula Research Laboratory</institution>
          ,
          <country country="NO">Norway</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Stuttgart Media University</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Technical University of Berlin</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>University of Oslo</institution>
          ,
          <country country="NO">Norway</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <fpage>13</fpage>
      <lpage>15</lpage>
      <abstract>
        <p>The FakeNews: Corona Virus and Conspiracies Multimedia Analysis task, running for the second time as part of MediaEval 2021, focuses on the classification of tweet texts aiming detection of fastspreading misinformation. Task of this year extends the number of target conspiracy theories and introduces new challenges in terms of analysis complexity of the imbalanced dataset. This paper describes the task, including use case and motivation, challenges, the dataset with ground truth, the required participant runs, and the evaluation metrics.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        During the development of the COVID-crisis, a lot of new
COVIDrelated conspiracy theories have arise. Despite eforts of the major
social networks, mass-spread fake facts, irrational theories and
news-like posts are widely presented in the online media sources.
Rumors and other fast-spreading inaccurate, counterfactual, or
intentionally misleading information can quickly permeate public
consciousness and have severe real-world implications. Public
attention to the problem have already allowed content moderation
and partial limitation of freedom of speech in order to prevent
manipulation of COVID-related public opinion. Thus, fake news and
intentional missinformation are still among the top global risks in
the 21st century [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Consequentially, we are particularly interested
in detecting content associated with the fake news and
COVIDrelated missinformation. We further diferentiate between content
that does not contain misinformation and content attributed to
other misinformation. Our task ofers three subtasks, all require
text-based tweets classification.
      </p>
      <p>
        Similar to text-only classification challenges, e.g., [
        <xref ref-type="bibr" rid="ref1 ref4 ref7">1, 4, 7</xref>
        ], we
expect to see NLP approaches for tweet text analysis, but we aim wider
set of conspiracy theories and diferent-level detection
methodologies. Furthermore, we ask for evaluation of diferent approaches
with respect to real-world imbalanced datasets [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>The task is intended to be of interest to researchers in the
areas of online news, social media, multimedia analysis, multimedia
information retrieval, natural language processing, and meaning
understanding and situational awareness.</p>
    </sec>
    <sec id="sec-2">
      <title>DATASET DETAILS</title>
      <p>Our datasets creation can roughly be divided into four steps. First,
We used Twitters’ search API between January 17, 2020 and Jun
made publicly available and they are sent to the members of the
research team via the direct emails.</p>
      <p>After the challenge, the annotated datasets containing only tweet
IDs, but not the tweet text itself will be made publicly available.
These publicly available datasets will be shufled and supplied by
the additional content to prevent linking to the full-text datasets
was used during the challenge by the researcher team. An additional
tweet content download script will be provided to obtain the tweets
from their ids via the corresponding Twitter API using a
usersupplied API access keys.
3</p>
    </sec>
    <sec id="sec-3">
      <title>EVALUATION METRICS AND SUBTASKS</title>
      <p>
        The oficially reported metric used for evaluating the multi-class
classification performance is the multi-class generalization of the
Matthews correlation coeficient (MCC, Rk-statistic) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. This
metric provides an eficient and reliable comparison for multi-class
classifiers for both balanced and unbalanced datasets.
      </p>
      <p>In case of equal metric values, we use the timestamp of the
oficial run submission to rank the teams. For the evaluation, the
participants must submit at least one run for at least one subtask
defined below. Additionally, the participants optionally can submit
four more runs for any of the described subtasks, i.e., participants
can submit up to 15 runs in total.</p>
      <sec id="sec-3-1">
        <title>Text-Based Misinformation Detection: In this subtask, the</title>
        <p>participants receive a dataset consisting of tweet text blocks in
English related to COVID-19 and various conspiracy theories. The
participants are encouraged to build a multi-class classifier that can
lfag whether a tweet promotes/supports or discusses at least one
(or many) of the conspiracy theories. In the case if the particular
tweet promotes/supports one conspiracy theory and just discusses
another, the result of the detection for the particular tweet is
expected to be equal to "stronger" class: promote/support in the given
sample.</p>
      </sec>
      <sec id="sec-3-2">
        <title>Text-Based Conspiracy Theories Recognition: In this sub</title>
        <p>task, the participants receive a dataset consisting of tweet text
blocks in English related to COVID-19 and various conspiracy
theories. The main goal of this subtask is to build a detector that can
detect whether a text in any form mentions or refers to any of the
predefined conspiracy topics.</p>
      </sec>
      <sec id="sec-3-3">
        <title>Text-Based Combined Misinformation and Conspiracies</title>
        <p>Detection: In this subtask, the participants receive a dataset
consisting of tweet text blocks in English related to COVID-19 and
various conspiracy theories. The goal of this subtask is to build
a complex multi-labelling multi-class detector that for each topic
from a list of predefined conspiracy topics can predict whether a
tweet promotes/supports or just discusses that particular topic.</p>
        <p>All the subtask, in which the team has decided to participate,
requires one mandatory and four optional runs to be submitted.
The required mandatory run implements a pure NLP classification
of tweets based only on tweet text content without using any
additional sources of data. Optional runs gradually extend the amount
and types of allowed additional information by implementing
classification based on tweet text analysis in combination with
pretrained models and classification using any automatically scraped
data from any external sources. Manual annotation of tweets or
any externally scraped data is not allowed in any run.</p>
        <p>
          In the submitted runs participants are allowed to use an
additional Cannot Determine class. This additional class represents
cases, when the output of the classifier is not reliable. This
additional class is important for evaluation of multi-class classifiers. The
efect of using Cannot Determine class is described in the related
literature [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. In-short, marking a sample that classifier cannot
reliable classify as an unknown class afects the resulting classification
performance less negatively than marking the sample with a wrong
class label, exactly as it expected to be implemented in a real-world
classification tasks.
        </p>
        <p>With respect to the subtasks evaluation, the following
methodology is used. Text-Based Misinformation Detection subtask
is evaluated with Rk-statistic directly. Text-Based Conspiracy</p>
      </sec>
      <sec id="sec-3-4">
        <title>Theories Recognition and Text-Based Combined Misinfor</title>
        <p>mation and Conspiracies Detection subtasks are evaluated with
the two-steps evaluation procedure. First, evaluation of each
conspiracy theory individually and independently is performed using
Rk-statistic. Then all the computed Rk-statistic values across all
the conspiracy theories are averaged and the resulting averaged
value is used to compare results of diferent teams. Finally, results in
each conspiracy theory group are evaluated independently, but this
evaluation is auxiliary and do not afect the final teams ranking.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>DISCUSSION AND OUTLOOK</title>
      <p>The task itself can be seen as very atypical and challenging due to a
fairly limited amount of information available to support the tweet
classification process. This reflects the real-world conditions in
which online social media analysis systems are deployed. Thus, this
task is a practical attempt to make a step towards building a usable
multi-modal social network analysis system that is able to combine
isolated data source properties with inter-source relations. Due to
the importance of the use case, we hope to motivate researchers
from diferent research fields to present their approaches, thereby
performing research that can help society to fight against malicious
manipulations of social networks and threats to society in general.
We hope that the FakeNews task can help to raise awareness of the
topic, but also provide an interesting and meaningful use case to
researchers interested in this application.</p>
    </sec>
    <sec id="sec-5">
      <title>ACKNOWLEDGMENTS</title>
      <p>This work was funded by the Norwegian Research Council under
contracts #272019 and #303404 and has benefited from the
Experimental Infrastructure for Exploration of Exascale Computing (eX3),
which is financially supported by the Research Council of Norway
under contract 270053. We also acknowledge support from Michael
Kreil in the collection of Twitter data.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <fpage>2018</fpage>
          .
          <article-title>Toxic Comment Classification Challenge - Identify and classify toxic online comments</article-title>
          . (
          <year>2018</year>
          ). https://www.kaggle.com/c/ jigsaw-toxic
          <article-title>-comment-classification-challenge/</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Sabri</given-names>
            <surname>Boughorbel</surname>
          </string-name>
          , Fethi Jarray, and
          <string-name>
            <surname>Mohammed</surname>
          </string-name>
          El-Anbari.
          <year>2017</year>
          .
          <article-title>Optimal classifier for imbalanced data using Matthews Correlation Coeficient metric</article-title>
          .
          <source>PloS one 12</source>
          ,
          <issue>6</issue>
          (
          <year>2017</year>
          ),
          <year>e0177678</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Nitesh</surname>
            <given-names>V Chawla</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Nathalie</given-names>
            <surname>Japkowicz</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Aleksander</given-names>
            <surname>Kotcz</surname>
          </string-name>
          .
          <year>2004</year>
          .
          <article-title>Special issue on learning from imbalanced data sets</article-title>
          .
          <source>ACM SIGKDD explorations newsletter 6</source>
          ,
          <issue>1</issue>
          (
          <year>2004</year>
          ),
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Quan</given-names>
            <surname>Do</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Jigsaw Unintended Bias in Toxicity Classification</article-title>
          . (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Jan</given-names>
            <surname>Gorodkin</surname>
          </string-name>
          .
          <year>2004</year>
          .
          <article-title>Comparing two K-category assignments by a Kcategory correlation coeficient</article-title>
          .
          <source>Computational biology and chemistry 28</source>
          ,
          <fpage>5</fpage>
          -
          <lpage>6</lpage>
          (
          <year>2004</year>
          ),
          <fpage>367</fpage>
          -
          <lpage>374</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Lee</given-names>
            <surname>Howell</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Digital Wildfires in a Hyperconnected World</article-title>
          . https: //bit.ly/2GiEF4f. (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Akshay</given-names>
            <surname>Mungekar</surname>
          </string-name>
          , Nikita Parab, Prateek Nima, and
          <string-name>
            <given-names>Sanchit</given-names>
            <surname>Pereira</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Quora insincere question classification</article-title>
          .
          <source>National College of Ireland</source>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Konstantin</given-names>
            <surname>Pogorelov</surname>
          </string-name>
          , Daniel Thilo Schroeder, Petra Filkuková, Stefan Brenner, and
          <string-name>
            <given-names>Johannes</given-names>
            <surname>Langguth</surname>
          </string-name>
          .
          <year>2021</year>
          .
          <article-title>WICO Text: A Labeled Dataset of Conspiracy Theory and 5G-Corona Misinformation Tweets</article-title>
          .
          <source>In Proceedings of the 2021 Workshop on Open Challenges in Online Social Networks</source>
          .
          <fpage>21</fpage>
          -
          <lpage>25</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>