<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Bidirectional Encoder Representations from Transformers for the COVID-19 vaccine stance classification</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Abhinav Kumar</string-name>
          <email>abhinavanand05@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pradeep Kumar Roy</string-name>
          <email>pradeep.roy@iiitsurat.ac.in</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jyoti Prakash Singh</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Workshop Proceedings</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science &amp; Engineering, Siksha 'O' Anusandhan Deemed to be University</institution>
          ,
          <addr-line>Bhubaneswar</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Indian Institute of Information Technology Surat</institution>
          ,
          <addr-line>Gujarat</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>National Institute of Technology Patna</institution>
          ,
          <country country="IN">India</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Vaccine-related information is awash on social media platforms like Twitter and Facebook. One party supports vaccination, while the other opposes vaccination and promotes misconceptions and misleading information about the risks of vaccination. The analysis of social media posts can give significant information into public opinion on vaccines, which can help government authorities in decision-making. In this work, an ensemble-based BERT model has been proposed for the classification of COVID19 vaccine-related tweets into AntiVax, ProVax, and neural sentiment classes. The proposed model performed significantly well with a micro  1-score of 0.532 and an accuracy of 0.532 and achieved the second rank in the shared competition.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        The COVID-19 outbreak shows no signs of slowing down, and vaccination looks to be the only
long-term cure. People all around the globe began to share their thoughts about the vaccination
when the first vaccine with a 90 percent eficacy rate was revealed on November 9, 2020 [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        Many people, however, are sceptical of vaccinations for a variety of reasons. Social media sites
like Twitter and Facebook are inundated with vaccine-related information [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. One group of
individuals is in favour of vaccination, while another opposes vaccination and spreads myths
and false information about the dangers of vaccination. The study of social media posts can
provide valuable insight of public opinion on vaccinations, which can aid government agencies
in making decisions about their future steps.
      </p>
      <p>
        User-generated social media data has been used in the past during disasters to inform others
about the situation and assist victims [
        <xref ref-type="bibr" rid="ref3 ref4 ref5 ref6 ref7">3, 4, 5, 6, 7</xref>
        ]. Singh et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] extracted several textual
features from the flood related tweets to classify it into informative and not-informative tweets.
Kumar et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] proposed deep multi-modal neural network by combining imagery and textual
contents of the disaster-related Twitter posts to classify it into informative and not-informative
contents. Kumar el al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] utilized earthquake related tweets to identify vocation names
mentioned in the tweets. Baweja et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] proposed a machine learning-based model to first
identify the need and availability of various resources during the disaster and then extract
tweets of individuals expressing the same need and availability.
      </p>
      <p>
        A few work [
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref8 ref9">8, 9, 10, 11, 12</xref>
        ] have been reported by the researchers where they tried to
identify COVID-19 fake news from the social media. Glazkova et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] developed a fine-tuned
ensemble-based model composed of three parallel BERT models pre-trained on COVID-19
social media postings in order to detect COVID-19 fake news. Wani et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] experimented
with a number of deep learning models, including CNN, LSTM, and several BERT versions
whereas [
        <xref ref-type="bibr" rid="ref10 ref11 ref12">10, 11, 12</xref>
        ] used various conventional machine learning classifiers such as Decision
Tree, Gradient Boosting, Support Vector Machine, Random Forest, Logistic Regression, and
Naive Bayes to identify COVID-19 fake news. Recently, Cotfas et al. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] proposed several
conventional machine learning classifiers such as Naive Bayes, Random Forest, Support Vector
Machine and deep learning models such as Bi-LSTM, CNN, and BERT to classify tweets into
neural, against, and in favour of vaccination classes.
      </p>
      <p>This work proposes an ensemble-based BERT (Bidirectional Encoder Representations from
Transformers) model to classify COVID-19 vaccination related tweets into AntiVax (“the tweet
is against the use of vaccines”), ProVax (“the tweet supports / promotes the use of vaccines”),
and Neural (“the tweet does not have any discernible sentiment expressed towards vaccines
or is not related to vaccines”) classes. The proposed model is validated with the shared task
IRMiDis FIRE-2021.1</p>
      <p>The rest of the sections are organized as follows: section 2 discusses the proposed
ensemblebased BERT model in detail. Section 3 list the finding of the proposed system and finally the
paper is concluded in section 4.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Methodology</title>
      <p>The systematic diagram for the proposed ensemble-based BERT model can be seen in Figure
1. The proposed model is validated by the dataset published in the IRMiDis FIRE-2021 shared
task.2. The dataset contains 1,010 tweets of neural , 991 tweets of ProVax, and 791 tweets of
AntiVax classes.</p>
      <p>
        The proposed model uses CT-BERT (COVID-Twitter-BERT) [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] to fine-tuned with the of
COVID-19 vaccination stance classification tasks. The CT-BERT model was trained using
160 million tweets from January 12 to April 16, 2020, all of which included at least one of
the keywords “wuhan,” “ncov,” “coronavirus,” “covid,” or “sars-cov-2.” These tweets were then
ifltered and preprocessed, yielding a total training sample of 22.5 million tweets (containing
40.7 million sentences and 633 million tokens). The detail description of the CT-BERT model
can be seen in Müller et al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
      <p>First, the CT-BERT model is fine-tuned with the three diferent validation split. To split the
provided training set into train and validation, 42, 10, and 20 are used as the random state (RS).
To fined-tuned the CT-BERT on our dataset, a maximum length of the tweets is used as 30,
1https://sites.google.com/view/irmidisfire2021/home?authuser=0
2https://sites.google.com/view/irmidisfire2021/home?authuser=0
iecn lepo ad
c
av ep en
9 g g
id1 in A
cov fakm itlaon</p>
      <p>u
enw tco pop</p>
      <p>e e
teh ffe #D
ife ied t?e
n s il
iIagm saah ifrn
e</p>
      <sec id="sec-2-1">
        <title>BERT (CT-BERT)</title>
        <p>(Validation split (RS-42))</p>
      </sec>
      <sec id="sec-2-2">
        <title>BERT (CT-BERT)</title>
        <p>(Validation split (RS-10))</p>
      </sec>
      <sec id="sec-2-3">
        <title>BERT (CT-BERT)</title>
        <p>(Validation split (RS-20))</p>
        <sec id="sec-2-3-1">
          <title>P1(Neutral)</title>
        </sec>
        <sec id="sec-2-3-2">
          <title>P1(ProVax)</title>
        </sec>
        <sec id="sec-2-3-3">
          <title>P2(AntiVax)</title>
        </sec>
        <sec id="sec-2-3-4">
          <title>P2(Neutral)</title>
        </sec>
        <sec id="sec-2-3-5">
          <title>P2(ProVax)</title>
        </sec>
        <sec id="sec-2-3-6">
          <title>P3(AntiVax)</title>
        </sec>
        <sec id="sec-2-3-7">
          <title>P3(Neutral)</title>
        </sec>
        <sec id="sec-2-3-8">
          <title>P3(ProVax) Avg.</title>
        </sec>
        <sec id="sec-2-3-9">
          <title>P(AntiVax) Avg.</title>
        </sec>
        <sec id="sec-2-3-10">
          <title>P(ProVax) Avg.</title>
        </sec>
        <sec id="sec-2-3-11">
          <title>P(Neutral)</title>
        </sec>
        <sec id="sec-2-3-12">
          <title>AntiVAx</title>
        </sec>
        <sec id="sec-2-3-13">
          <title>ProVAx</title>
        </sec>
        <sec id="sec-2-3-14">
          <title>Neural</title>
          <p>batch size of 32, a learning rate of 2 −5 and we trained the model for 20 epochs. After training
three diferent models with diferent validation split, the class probability is then predicted
for provided testing samples. The class-wise probability form all the three trained models is
then averaged to get the final probability for each class AntiVax, ProVax, and Neutral (as can be
seen in Figure 1. Finally, the test sample belonging to that class which has the highest average
probability.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Results</title>
      <p>The result of the proposed model is measured in terms of macro  1-score and accuracy. Along
with this, class-wise precision, recall, and  1-score for the validation data are also shown to
understand the class-wise performance of the model on the validation set. The results for the
three diferent CT-BERT models fine-tuned on the diferent validation sets are listed in Table 1.</p>
      <p>The CT-BERT (RS-42) achieved both macro  1-score and an accuracy of 0.85. Similarly,
CTBERT (RS-20) achieved macro  1-score and accuracy of 0.86 whereas CT-BERT (RS-10) model
achieved a macro  1-score of 0.86 and an accuracy of 0.85.</p>
      <p>Three diferent models (i) Ensemble-based CT-BERT model, (ii) CT-BERT (RS-42), and (iii)
CT-BERT (RS-10) were submitted for the final evaluation on the private testing dataset provided
by the organizer. The result of the proposed ensemble-based CT-BERT model is listed in Table
2. The proposed ensemble-based CT-BERT model achieved a micro  1-score of 0.556 and an
accuracy of 0.555. The CT-BERT (RS-42) achieved a macro  1-score of 0.548 and an accuracy of
0.549. The CT-BERT achieved a macro  1-score of 0.532 and an accuracy of 0.532.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>During the COVID-19 pandemic, social media such as Twitter and Facebook are flooded with
COVID-related information. A significant amount of fake information and myths are also
posted by the people on the vaccination. In this work, we have proposed an ensemble-based
model that classified COVID-19 vaccination-related tweets into three categories such as AntiVax,
ProVax, and Neutral. The proposed model is performed significantly well in the shared task
and achieved a macro  1-score of 0.532 and an accuracy of 0.532. In the future, a more robust
system can be made by integrating linguistic, character-level, and word-level features together
with the ensemble-based model.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>L.-A.</given-names>
            <surname>Cotfas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Delcea</surname>
          </string-name>
          , I. Roxin,
          <string-name>
            <given-names>C.</given-names>
            <surname>Ioanăş</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. S.</given-names>
            <surname>Gherai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Tajariol</surname>
          </string-name>
          ,
          <article-title>The longest month: Analyzing covid-19 vaccination opinions dynamics from tweets in the month following the first vaccine announcement</article-title>
          ,
          <source>IEEE Access 9</source>
          (
          <year>2021</year>
          )
          <fpage>33203</fpage>
          -
          <lpage>33223</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>M. M. Müller</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Salathé</surname>
          </string-name>
          ,
          <article-title>Crowdbreaks: tracking health trends using public social media data and crowdsourcing, Frontiers in public health 7 (</article-title>
          <year>2019</year>
          )
          <fpage>81</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y. K.</given-names>
            <surname>Dwivedi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. P.</given-names>
            <surname>Rana</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. K.</given-names>
            <surname>Kapoor</surname>
          </string-name>
          ,
          <article-title>Event classification and location prediction from tweets during disasters</article-title>
          ,
          <source>Annals of Operations Research</source>
          <volume>283</volume>
          (
          <year>2019</year>
          )
          <fpage>737</fpage>
          -
          <lpage>757</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y. K.</given-names>
            <surname>Dwivedi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. P.</given-names>
            <surname>Rana</surname>
          </string-name>
          ,
          <article-title>A deep multi-modal neural network for informative twitter content classification during emergencies</article-title>
          ,
          <source>Annals of Operations Research</source>
          (
          <year>2020</year>
          )
          <fpage>1</fpage>
          -
          <lpage>32</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <article-title>Location reference identification from tweets during emergencies: A deep learning approach</article-title>
          ,
          <source>International journal of disaster risk reduction 33</source>
          (
          <year>2019</year>
          )
          <fpage>365</fpage>
          -
          <lpage>375</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Saumya</surname>
          </string-name>
          ,
          <article-title>A comparative analysis of machine learning techniques for disaster-related tweet classification</article-title>
          ,
          <source>in: 2019 IEEE R10 Humanitarian Technology Conference (R10-HTC)</source>
          (
          <volume>47129</volume>
          ), IEEE,
          <year>2019</year>
          , pp.
          <fpage>222</fpage>
          -
          <lpage>227</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Baweja</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Aggarwal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Goyal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mehta</surname>
          </string-name>
          ,
          <article-title>Automatic retrieval of actionable information from disaster-related microblogs</article-title>
          .,
          <source>in: FIRE (Working Notes)</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>43</fpage>
          -
          <lpage>45</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Glazkova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Glazkov</surname>
          </string-name>
          , T. Trifonov,
          <article-title>g2tmn at constraint@ aaai2021: exploiting ct-bert and ensembling learning for covid-19 fake news detection</article-title>
          , arXiv preprint arXiv:
          <year>2012</year>
          .
          <volume>11967</volume>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Wani</surname>
          </string-name>
          , I. Joshi,
          <string-name>
            <given-names>S.</given-names>
            <surname>Khandve</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Wagh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Joshi</surname>
          </string-name>
          ,
          <article-title>Evaluating deep learning approaches for covid19 fake news detection</article-title>
          ,
          <source>in: Combating Online Hostile Posts in Regional Languages during Emergency Situation: First International Workshop</source>
          , CONSTRAINT 2021,
          <article-title>Collocated with AAAI 2021</article-title>
          ,
          <string-name>
            <given-names>Virtual</given-names>
            <surname>Event</surname>
          </string-name>
          ,
          <source>February</source>
          <volume>8</volume>
          ,
          <year>2021</year>
          , Revised Selected Papers, Springer Nature,
          <year>2021</year>
          , p.
          <fpage>153</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>P.</given-names>
            <surname>Patwa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sharma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Pykl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Guptha</surname>
          </string-name>
          , G. Kumari,
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Akhtar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ekbal</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. Das</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Chakraborty</surname>
          </string-name>
          ,
          <article-title>Fighting an infodemic: Covid-19 fake news dataset</article-title>
          ,
          <source>Communications in Computer and Information Science</source>
          <volume>1402</volume>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Mahlous</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Al-Laith</surname>
          </string-name>
          ,
          <article-title>Fake news detection in arabic tweets during the covid-19 pandemic</article-title>
          ,
          <source>International Journal of Advanced Computer Science and Applications</source>
          <volume>12</volume>
          (
          <year>2021</year>
          ). URL: http://dx.doi.org/10.14569/IJACSA.
          <year>2021</year>
          .
          <volume>0120691</volume>
          .
          <source>doi:1 0 . 1 4</source>
          <volume>5 6</volume>
          <fpage>9</fpage>
          <string-name>
            <surname>/ I J A C S</surname>
          </string-name>
          <article-title>A</article-title>
          .
          <volume>2 0 2 1 . 0 1 2 0 6 9 1 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>T.</given-names>
            <surname>Felber</surname>
          </string-name>
          ,
          <year>Constraint 2021</year>
          :
          <article-title>Machine learning models for covid-19 fake news detection shared task</article-title>
          ,
          <source>arXiv preprint arXiv:2101.03717</source>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M.</given-names>
            <surname>Müller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Salathé</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. E.</given-names>
            <surname>Kummervold</surname>
          </string-name>
          ,
          <article-title>Covid-twitter-bert: A natural language processing model to analyse covid-19 content on twitter</article-title>
          , arXiv preprint arXiv:
          <year>2005</year>
          .
          <volume>07503</volume>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>