<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>COVID-19 MEASURES SENTIMENT ANALYSIS BASED ON A SOCIAL NETWORK DATASET</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Kateryna Kononova</string-name>
          <email>kateryna.kononova@karazin.ua</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rostyslav Lutsenko</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>D.Sc., professor, Karazin Kharkiv National University</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>M.S., Karazin Kharkiv National University</institution>
        </aff>
      </contrib-group>
      <fpage>8</fpage>
      <lpage>17</lpage>
      <abstract>
        <p>The paper discusses the prototype of a public opinion monitoring system toward Covid-19 measures using machine learning and sentiment analysis, which includes: 1) data collection; 2) data preprocessing and statistical analysis of a created corpus; 3) semantic analysis of the corpus; 4) sentiment analysis of the data. The sample of posts, which was collected for the period April-May 2020 using 360 unique search queries, includes 6726 publications of Ukrainian Facebook users. The assessment obtained by the proposed methodology is confirmed by the results of the survey about supporting the government's Covid activities, according to which only about 10% of respondents are positive about the government's actions, and more than 60% are negative. Thus, the proposed system, developed as a set of Python and SQL scripts, can be recommended for implementation.</p>
      </abstract>
      <kwd-group>
        <kwd>coronavirus</kwd>
        <kwd>government</kwd>
        <kwd>public</kwd>
        <kwd>survey</kwd>
        <kwd>machine learning</kwd>
        <kwd>semantic analysis</kwd>
        <kwd>sentiment analysis</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>The SARS-COV-2 pandemic has been going on already for more than a year. It is
characterized by rapid spread, high mortality, and lack of specific treatment. The
pandemic has become a factor that forced the governments of many countries to
reconsider economic and social policies and formulate new development priorities.</p>
      <p>
        The disease, which started in December 2019 in Wuhan (Hubei Province, China), was
recognized by the WHO as a pandemic on March 1
        <xref ref-type="bibr" rid="ref1">1, 2020</xref>
        <xref ref-type="bibr" rid="ref13 ref16 ref17 ref2 ref3 ref4 ref5">(World Health Organization,
2020)</xref>
        . Now the pandemic spread to almost all countries (Fig. 1). After the Covid outbreak
in March-April 2020, the number of infected reached the plateau. However, in late
August, the second wave of the disease began.
      </p>
      <p>
        In Ukraine, the first case of the coronavirus was registered in the Chernivtsi region on
March 3, 2020; the active phase of the pandemic began on March 25
        <xref ref-type="bibr" rid="ref12 ref13 ref14 ref15 ref16 ref17 ref2 ref3 ref4 ref5 ref7">(Public Health
Center of the Ministry of Health of Ukraine, 2020)</xref>
        . The introduction of a lockdown in
March-April delayed the spread of the coronavirus. However, after the weakening of the
quarantine regime, the number of patients started growing again
        <xref ref-type="bibr" rid="ref12 ref13 ref2 ref3 ref4 ref5 ref6">(World Data Center for
Geoinformatics and Sustainable Development, 2020)</xref>
        . In May-July 2020, there was some
stabilization, but just after the summer season, the number of infected people began to rise
sharply. In autumn-winter 2020, the epidemical situation in Ukraine deteriorated, leading
to the second lockdown in January 2021 (World Center data on geoinformatics and
sustainable development, 2021).
      </p>
      <p>Covid-19 has an extremely negative impact on the global economy: according to the
World Bank, a global GDP contraction of 5.2% was expected in 2020 (Table 1). The
income per capita might fall 3.6%, leaving millions of people in poverty.</p>
      <p>Governments are eager to find and implement proper actions to combat the coronavirus
spread, as well as to overcome the negative economic consequences of the pandemic. The
key to success in overcoming the pandemic is the consolidation of trust between
governments and people: the effectiveness of Covid-19 control measures depends on the
conscious attitude and responsibility of each citizen.</p>
      <p>This work aims to develop a prototype of a public opinion monitoring system toward
Covid-19 measures. This could be done in a form of a regular survey or by applying
machine learning and sentiment analysis to the data retrieved from social media.
1 percent change from the previous year
2 estimate
3 forecast</p>
    </sec>
    <sec id="sec-2">
      <title>A survey approach</title>
      <p>In a pandemic, Gallup International experts have studied the issue of the government’s
actions public support4. According to a three-wave survey in June 2020, the highest
support5 was observed in Georgia (94%), Malaysia (94%), and the Republic of Korea
(85%). The lowest levels were in Japan (34%), Bosnia and Herzegovina (35%), the United
Kingdom (38%), and the United States (40%). The results in the dynamics are also quite
interesting (Fig. 2): the data shows that support may increase (Kazakhstan, Malaysia,
Republic of Korea, and Switzerland), decrease (Austria, Italy, Macedonia, and the United
Kingdom), or fluctuate (India, Pakistan, Philippines, Russia, and the United States).
4 The question was: to what extent do you agree or disagree with the following statement – ‘I think the
government is doing well with the coronavirus’
5 Strongly agree and agree with the statement above</p>
      <p>Unfortunately, Ukraine has not participated in this survey. To measure Ukrainians’
sentiments towards the Covid measures, we conducted an online survey6, in which 1600
respondents took part. The results showed that only about 10% of Ukrainians are positive
about the government's actions, while more than 60% are negative about them (Fig. 3).</p>
    </sec>
    <sec id="sec-3">
      <title>3. A sentiment analysis approach</title>
      <p>
        To develop a prototype of a public opinion monitoring system toward Covid-19
measures we proposed applying machine learning and sentiment analysis to the content of
the social network7. As a platform for the analysis, Facebook was chosen, because this
social network is the largest social media used by more than two billion people every
month. According to Research &amp; Branding Group, the number of active users in Ukraine
is about 63% of the population, which is approximately 20.8 million people
        <xref ref-type="bibr" rid="ref8">(FutureNow,
2020)</xref>
        . Moreover, Facebook has features that could enrich the results of the semantic
analysis: a quantity of ‘likes’ allows assessing the content relevance, the types of reactions
show user's attitude to the post, the number of reposts reflects its distribution among other
users.
      </p>
      <p>The development of the system includes the following steps, which are implemented in
the form of SQL and Python scripts:</p>
      <p>I. Data collection.</p>
      <p>II. Data preprocessing and statistical analysis of a created corpus.</p>
      <p>III. Semantic analysis of the corpus.</p>
      <p>IV. Sentiment analysis of the data.
6 Poll link (https://docs.google.com/forms/d/e/1FAIpQLSe3pk8AONnLr6PB1VJCBAHWzF7
tBhnS2wk8VKkEoL-SVHiwDQ/viewform) was posted on Telegram channels
7 According to the Institute of Sociology NAS of Ukraine, most Ukrainians receive information about the
pandemic from the media and social networks</p>
      <p>At the first stage, raw data was downloaded from Facebook using the DATA365 API.
To identify posts discussing the Covid measures, more than 360 unique search queries
have been created. The sample, which was collected for the period April-May 2020,
includes 6726 publications of Ukrainian users (Fig. 4).
5).
At the second stage, the texts were pre-processed:
1. Lowercase letters formatting to avoid repeating words;
2. Redundant characters removing (numbers, punctuation, etc.);
3. Stop words deleting (such as ‘this’, ‘and’, etc.);
4. Stemming (reducing words to the root form).</p>
      <p>After the corpus had been created, the list of the most used words was analyzed (Fig.</p>
      <p>
        Semantic analysis of the corpus (stage III) was performed using Word2Vec tools. The
Word2Vec model is a two-layer neural network that learns to reconstruct the linguistic
context of words. As a result, words that have a common context in the corpus are placed
close to each other in the vector space. This makes it possible to analyze the main
8 The data includes id, time, text, language, tags, reaction_count, comment_count, share_count,
reaction_like_count, reaction_love_count, reaction_haha_count, reaction_wow_count, reaction_sad_count,
reaction_angry_count
narratives. Using the Python word2vec library
        <xref ref-type="bibr" rid="ref10">(Python word2vec Library, 2020)</xref>
        , we
calculated the distances between the words (Table 2).
      </p>
      <p>CORONAVIR
USstate budget
epidemic
complications
pandemic
accident
outbreak
measures
morbidity</p>
      <p>GOVERNME
NTresistance
infographics
ministry
continue
extraordinary
attenuated
strengthen</p>
      <p>As we can see from Table 2, posts about the coronavirus most often discuss the state
budget and the Covid-19 control measures; in the context of the lockdown people talk
about its duration, penalties, and violations, when talking about the government, it is
about pandemic resistance and infographics on anti-epidemical measures.</p>
      <p>
        At the fourth stage, dictionaries of positive and negative words were constructed
        <xref ref-type="bibr" rid="ref9">(Text
Analysis API, 2020)</xref>
        . In the positive dictionary (Fig. 6) one can see the words that are
important for people in difficult times (‘life’, ‘help’, etc.). The words ‘death’, ‘crisis’, and
‘restrictions’ are on the top in the negative dictionary. Dictionaries comparing shows that
on average, words with a positive connotation are used 30% more often than pessimistic
ones. Even the words ‘life’ and ‘death’, which are at the top of both dictionaries, are
correlated as 1.7 to 1.
      </p>
      <p>
        Analysis of fig. 7 shows that the word ‘coronavirus’ causes very controversial emotions
– ‘laughter’ and ‘anger’ are almost on the same level. The mention of the words
‘government’ and ‘lockdown’ often causes ‘anger’ and ‘sadness’; ‘mask’ – ‘love’ and
‘anger’, ‘president’ and ‘economy’ – ‘laughter’ and ‘anger’. According to Plutchik’s
wheel of emotions
        <xref ref-type="bibr" rid="ref11">(Bunyak, 2017)</xref>
        , we see that ‘anger’ in combination with other
emotions causes complex negative sentiments: aggression, contempt, pity, and
disapproval. In particular, the sentiments about the President and Lockdown = ‘laughter’ +
‘anger’ = ‘aggression’, the Government = ‘anger’ + ‘sadness’ = ‘contempt’.
      </p>
      <p>These results are in line with the survey results that allow to recommend implementing
the public opinion monitoring system toward Covid-19 measures based on the social
network's content using machine learning and sentiment analysis.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusions</title>
      <p>The paper discusses the prototype of the public opinion monitoring system toward
Covid19 measures based on the social network's content using machine learning and sentiment
analysis, which includes: 1) Data collection; 2) Data preprocessing and statistical analysis
of a created corpus; 3) Semantic analysis of the corpus; 4) Sentiment analysis of the data.
The sample of posts, which was collected for the period April-May 2020 using 360 unique
search queries, includes 6726 publications of Ukrainian Facebook users.</p>
      <p>The most frequently used words of the retrieved corpus were ‘coronavirus’,
‘pandemic’, ‘lockdown’, ‘mask’, ‘power’, ‘state’, ‘president’. Semantic analysis of the
corpus using Word2Vec tools showed that posts about coronavirus most often discussed
the state budget and the Covid measures; in the context of the lockdown people talk about
its duration, penalties and violations, when talking about the government, it is about
pandemic resistance and infographics on anti-epidemical measures.</p>
      <p>To analyze public sentiments, dictionaries of positive and negative words were created.
Dictionaries comparing shows that on average, words with a positive connotation are used
30% more often than pessimistic ones. Analysis of reactions to the posts showed that the
word ‘coronavirus’ causes very controversial emotions – ‘laughter’ and ‘anger’. At the
same time, the mention of the words ‘government’ and ‘lockdown’ most often causes
‘anger’ and ‘sadness’, ‘president’ and ‘economy’ – ‘laughter’ and ‘anger’ (‘contempt’ and
‘aggression’ according to Plutchik's methodology).</p>
      <p>The assessment obtained by the proposed methodology is confirmed by the results of
the survey about supporting the policy responses to Covid. According to this survey, only
about 10% of respondents are positive about the government's actions, and more than 60%
are negative.</p>
      <p>Thus, the proposed public opinion monitoring system toward Covid-19 measures based
on the social network's content using machine learning and sentiment analysis can be
recommended for implementation.
5.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Gallup</given-names>
            <surname>International</surname>
          </string-name>
          . (
          <year>2020</year>
          ). Retrieved from https://www.gallup-international.com/wpcontent/uploads/2020/08/Covid-19Wave
          <string-name>
            <surname>-</surname>
          </string-name>
          3
          <article-title>-1</article-title>
          .pdf
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. The world bank. (
          <year>2020</year>
          ).
          <article-title>Global economic prospects Retrieved from https://www</article-title>
          .worldbank.org/en/publication/global-economic-prospects
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3. World Health Organization. (
          <year>2020</year>
          ). Retrieved from https://Covid19.who.int/
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>State</given-names>
            <surname>Institution</surname>
          </string-name>
          <article-title>"Public Health Center of the Ministry of Health of Ukraine"</article-title>
          . (
          <year>2020</year>
          ). Retrieved from https://www.phc.org.ua/
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <article-title>World Data Center for Geoinformatics and Sustainable Development</article-title>
          . (
          <year>2020</year>
          ). Retrieved from http://wdc.org.ua/uk/Covid19-second
          <string-name>
            <surname>-</surname>
          </string-name>
          wave-growth; http://wdc.org.ua/uk/Covid19-transformation
          <article-title>-after-pandemic-europe</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Pandemic</surname>
            <given-names>COVID</given-names>
          </string-name>
          -19
          <article-title>and its consequences in the field of tourism in Ukraine</article-title>
          . (
          <year>2020</year>
          ). Retrieved from http://www.ntoukraine.org/assets/files/EBRD-COVID19
          <string-name>
            <surname>-</surname>
          </string-name>
          Report-UKR.pdf
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7. Kyiv International Institute of Sociology. (
          <year>2020</year>
          ).
          <article-title>Sociological and marketing research Retrieved from http://kiis</article-title>
          .com.ua/?lang=
          <source>ukr&amp;cat=reports&amp;id=964&amp;page=3</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8. FutureNow. (
          <year>2020</year>
          ).
          <article-title>Future is here! The future is already with us! Retrieved from https://futurenow</article-title>
          .com.ua/21-najpopulyarnishyh
          <article-title>-sotsialnyh-merezh-svitu-ta-rejtyng-ukrayiny/</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Text</given-names>
            <surname>Analysis</surname>
          </string-name>
          <string-name>
            <surname>API</surname>
          </string-name>
          . (
          <year>2020</year>
          ). Retrieved from https://neticle.com/textanalysisapi/en/analysismethod
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Python</surname>
          </string-name>
          <article-title>word2vec libraries</article-title>
          . (
          <year>2020</year>
          ). Retrieved from https://papers.nips.cc/paper/2013/file/9aa42b31882ec039965f3c4923ce 901b-Paper.pdf
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Bunyak</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>General psychology: lectures (part II)</article-title>
          .
          <article-title>Ternopil: published by TNTU</article-title>
          . I. Pulyuya
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <article-title>Influence of COVID-19 and quarantine restrictions on the economy of Ukraine. (</article-title>
          <year>2020</year>
          ). Retrieved from https://www.me.gov.ua/Documents/Download?id=
          <fpage>bc5d2c61</fpage>
          -1a7f
          <string-name>
            <surname>-</surname>
          </string-name>
          4ec7-
          <fpage>8071</fpage>
          - b996f2ad2b5a
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <article-title>World Data Center for Geoinfomatics and Sustainable Development</article-title>
          . (
          <year>2020</year>
          ).
          <article-title>COVID-19 in Ukraine: analysis and forecast</article-title>
          . Retrieved from https://wdc-ukraine.maps.arcgis.com/apps/ opsdashboard/index.html#/ 07b38b42264d4a2ea5a05d578e235559
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <article-title>Official website of the Ministry of Economy, Trade and Agriculture of Ukraine</article-title>
          . (
          <year>2020</year>
          ). Retrieved from https://www.me.gov.ua/Documents/List?lang=uk-UA&amp;tag=Konsensusprognoz
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <article-title>Official information portal of the Cabinet of Ministers of Ukraine “Coronavirus in Ukraine”</article-title>
          . (
          <year>2020</year>
          ). Retrieved from https://Covid19.gov.ua/
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <article-title>Official site of the Ministry of Health of Ukraine</article-title>
          . (
          <year>2020</year>
          ). Retrieved from https://moz.gov.ua/; https://moz.gov.ua/koronavirus-2019-ncov
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <article-title>Official site of the National Health Service of Ukraine</article-title>
          . (
          <year>2020</year>
          ). Retrieved from https://nszu.gov.ua/
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>