<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>MediaEval 2020: An Ensemble-based Multimodal Approach for Coronavirus and 5G Conspiracy Tweet Detection</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Chahat Raj</string-name>
          <email>chahatraj58@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mihir P Mehta</string-name>
          <email>mihirm3795@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Delhi Technological University</institution>
          ,
          <addr-line>Delhi</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Indian Institute of Management Raipur</institution>
          ,
          <addr-line>Chhattisgarh</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2020</year>
      </pub-date>
      <fpage>14</fpage>
      <lpage>15</lpage>
      <abstract>
        <p>In the wake of ongoing COVID-19 pandemic, a parallel stream of misinformation and conspiracies rises on the internet. People around the world are being flooded with texts and visuals claiming false statements linked with coronavirus disease. This paper presents a multi-modal fake news detection system that uses text and image features to detect conspiracy tweets. This research has been performed in context with the FakeNews: Coronavirus and 5G Conspiracy task of MediaEval 2020. The NLP subtask we have performed utilizes an ensemble of machine learning and deep learning algorithms for the analysis of textualvisual data. We demonstrate the performances of experiments performed for each modality and results obtained after their fusion.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 INTRODUCTION</title>
      <p>Scientists, Economics, Mathematicians, Analysts and many other
professionals have made their claim by formulating theories on
origination and spread of the Coronavirus Disease 2019
(COVID19). Research and Investment are made both on cure and tracing
the cause of the origination of pandemic. And along with the rising
number of these theories, the spread of misinformation related to
COVID-19, termed as ‘Infodemic’ has been on the rise too, a lot of
times from internet users, public figures and potentially trusted
sources. Messages and media carrying such misinformation are
spread both intentionally and unintentionally. Several times, they
have been linked with existing theories that make them sound
true despite not involving either substantial proof or logic. People
also get amused by the superficial texts and images carried by the
misinformation and tend not to verify the credibility that it
carries. Moreover, they pass it further to their friends and families
whom they are trusted by and ultimately the misinformation
manages to convince a large group of audience that is connected
via this network and thereby impacting the habit and lifestyle of
the people that accept it. These changes can have an adverse effect
or tend to be of no use and consume time and other material
resources of people. Hence it becomes necessary to identify,
evaluate and share the authenticity of every information,
especially those involving conspiracy claims.</p>
      <p>One such misinformation that has impacted the thoughts and
lifestyle of people and the emergence of technology and
revenue of several brands is 5G Corona Conspiracy. This
conspiracy has played its significant path to impact the minds
of consumers by creating ambiguity about the safety of using
5G communication technology.</p>
      <p>
        To fight the ongoing misinformation wave amidst the
pandemic, our NLP subtask at MediaEval 2020 uses ensemble
technique with multiple ML and DL models to identify 5G
related coronavirus conspiracies prevalent on Twitter. Detailed
overview of the task and dataset has been described by
Pogorelov et al. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>APPROACH</title>
      <p>We adopt an ensembling approach incorporating several machine
learning and deep learning-based text and image classifiers. We
divide our approach into three routines: text-based classification,
image-based classification and fusion of text and image models.</p>
      <p>The proposed architecture uses a combination of features
obtained from multiple classifiers. We experimented with
several text classifiers on the development dataset and decided
to use a fixed subset of them based on the results obtained on
each one of them separately. We have used Support Vector
Machine (SVM), Naïve Bayes (NB), K-Nearest Neighbour (KNN),
LSTM (Long-Short Term Memory) and Bi-LSTM
(BidirectionalLSTM) for the NLP classification task. Each tweet undergoes
preprocessing steps before being passed to these classifiers.
These include URL removal, punctation removal, lowercasing,
tokenization, stopword removal, stemming/lemmatization and
padding. We incorporate LSTM and Bi-LSTM with series of
Dense layers and setting Dropout value to 0.5. RMSprop
optimizer has been used while training LSTM and Bi-LSTM
models for 15 epochs each with a batch size equal to 64. For
text-based approach, classification results obtained from SVM,
NB, KNN, LSTM and Bi-LSTM are used for majority voting to
obtain final predictions.</p>
      <p>
        For visual classification, we filtered tweets containing
images and obtained 171 images with the label 5G Coronavirus
Conspiracy, 118 belonging to Other Conspiracy class and the
rest 791 were Non-Conspiracy tweet images. The test set
consisted of 617 images. We fine-tune and use three deep
learning models namely, VGG16 [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], Xception [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and
InceptionV3 [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] for classifying images and use their results for
majority voting to make final predictions.
      </p>
      <p>These models have been pre-trained previously and we
fine-tune them setting the dropout value to 0.5 and an added
batch normalization layer after the dropout layer. We have
used sigmoid activation in the last layer for binary predictions
and softmax for multiclass predictions. We have used Adam
optimizer for all visual classification models and trained them
for 15 epochs each setting batch size to 64.</p>
      <p>For multi-modal classification, we ensemble all text and
image-based classifiers utilized and employ max-voting for
final classification. Figure 1 demonstrates the ensembling
architecture. The class with the highest number of votes is
selected as the predicted class for each tweet. Development on
all runs has been performed by splitting the dataset into 7:3
ratio for training and validation. We provide the details of
models used and results obtained on validation in Table 1 and
Table 2.
3</p>
    </sec>
    <sec id="sec-3">
      <title>RESULTS AND ANALYSIS</title>
      <p>The MediaEval 2020 FakeNews: Coronavirus and 5G Conspiracy
NLP subtask requires classification of tweets related to
coronavirus and 5G conspiracy from other conspiracy and
nonconspiracy tweets. Table 2 and Table 3 show the classification
results on the development set and test set respectively. We
perform five runs on the given task which include three-class
classification and coarse two-class classification wherein
nonconspiracy and other conspiracy tweets are combined into a single
class. Our first run performs ternary classification using text
classifiers only. The second run combines text and image modality
classification results and return results based on both combined.
The third, fourth and fifth runs are coarse two-class classifiers
performing text-based, image-based and classification based on
text and image features combined, respectively.</p>
      <p>Observing the trend of results obtained in development and
training phases, we observe that binary classifier performed
better than three-class classifier. Our binary text classifier
achieved third highest score (0.4179) in the challenge. This
demonstrates that our model finds it easier to distinguish 5G
coronavirus conspiracies from all other conspiracies and real
tweets. Ternary text-based classification achieved a score of
0.3408. Image-based detection quality can be further improved
significantly. Low scores of models using image modality owe
to the small size of visual data. Proposed method with larger
dataset would perform eminently. We suggest the use of data
augmentation techniques for better performance.</p>
    </sec>
    <sec id="sec-4">
      <title>DISCUSSION</title>
      <p>In this paper, we employ machine learning and deep
learningbased ensembling technique that uses majority voting to deduce
predictions if a tweet is related to 5G Coronavirus conspiracy or
not. We perform a multimodal analysis utilizing text-based NLP
features from the tweet and visual features from the images posted
along with those tweets. We build a fusion model that incorporates
both textual and visual features and generates prediction based on
each modality separately and their combination. Our classification
approach plays with both binary and ternary classifiers to
experiment with the efficiency of the ensemble models. The
limitation we encounter is the lack of sufficient training data and
propose to fix it in future works using data augmentation
techniques on both text and image data to receive better
performance and healthier conspiracy detection.</p>
      <p>FakeNews: Corona virus and 5G conspiracy</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Pogorelov</surname>
            , Konstantin, Daniel Thilo Schroeder, Luk Burchard,
            <given-names>Johannes</given-names>
            Moe, Stefan Brenner, Petra Filkukova and Johannes
          </string-name>
          <string-name>
            <surname>Langguth</surname>
          </string-name>
          .
          <article-title>"FakeNews: Corona Virus and 5G Conspiracy Task at MediaEval</article-title>
          <year>2020</year>
          .
          <article-title>"</article-title>
          <source>Proc. of the MediaEval 2020 Workshop</source>
          , Online,
          <fpage>14</fpage>
          -
          <lpage>15</lpage>
          December
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2] Schroeder, Daniel Thilo, Konstantin Pogorelov, and
          <string-name>
            <given-names>Johannes</given-names>
            <surname>Langguth</surname>
          </string-name>
          .
          <article-title>"FACT: a Framework for Analysis and Capture of Twitter Graphs."</article-title>
          <source>In 2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS)</source>
          , pp.
          <fpage>134</fpage>
          -
          <lpage>141</lpage>
          . IEEE,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Karen</given-names>
            <surname>Simonyan</surname>
          </string-name>
          and
          <string-name>
            <given-names>Andrew</given-names>
            <surname>Zisserman</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Very Deep Convolutional Networks for Large-Scale Image Recognition</article-title>
          . In International Conference on Learning Representations
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Chollet</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Xception: Deep learning with depthwise separable convolutions</article-title>
          .
          <source>In Proceedings of the IEEE conference on computer vision and pattern recognition</source>
          (pp.
          <fpage>1251</fpage>
          -
          <lpage>1258</lpage>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Szegedy</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jia</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sermanet</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reed</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Anguelov</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , ... &amp;
          <string-name>
            <surname>Rabinovich</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Going deeper with convolutions</article-title>
          .
          <source>In Proceedings of the IEEE conference on computer vision and pattern recognition</source>
          (pp.
          <fpage>1</fpage>
          -
          <lpage>9</lpage>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>