<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>ORCID:</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>COVID-19 Future Forecasting based on Time-series statistical analysis using Machine Learning Model</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jaspreet Kaur</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Prabhpreet Kaur</string-name>
          <email>prabhpreet.cst@gndu.ac.in</email>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>The epidemic COVID-19 has shaken the globe through its cruelty, and its spread rate continues to rise daily. This paper highlights the clinical stance in the COVID-19 research studies, where time-series statistical analysis has been performed by using Prophet Model. It is widely used to understand the trend of the current epidemic after 2nd May 2020 with data at the worldwide state. The prophet model is an open-source model obtained by the data science panel on Facebook for performing predicting operations. It assists to make fast and accurate predictions for existing data samples. The Prophet model is simple to implement because its open authorized repository exists on GitHub. The time-series data analysis refers to the confirmed, recovered, and death rates for the time of 2nd May 2021 to 17th January 2022. The statistical validation strategy is followed by the implementation of a T-test on the evaluated time-series data. The expected data generated by the predictive model can be further used by the official authorities, medical departments of various countries. Moreover, the model is used to provide new graphical insights into past, present, and future trends.</p>
      </abstract>
      <kwd-group>
        <kwd>1 COVID-19</kwd>
        <kwd>Prophet</kwd>
        <kwd>Statistical analysis</kwd>
        <kwd>time-series forecasting system</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>The coronavirus emerged from Wuhan city, China in December 2019. Originally, an unidentified
infected case was reported which was examined by respiratory experts affirmed as pneumonia.
Afterward, it was stated by WHO (World Health Organization) as COVID-19 [1]. It is at the seventh
number of the coronavirus family, collectively with MERS (Middle-East Respiratory Syndrome) [2]
and SARS (Severe Acute Respiratory Syndrome) [3] which can transmit to humans [4]. The
spreading rate of the coronavirus rapidly increases worldwide. There are 236,624,144 confirmed
cases, 213,744,952 recovered cases, and 4,832,164 death cases globally on 6th October 2021 [5]. The
COVID-19 affecting individuals worldwide with mild to moderate symptoms as cough, fever, and
fatigue. The infection can cause serious medical problems such as heart complications, lung
infections, blood clotting, and severe kidney damage, bacterial and viral infection. The incubation
phase of COVID-19 can proceed for 2 weeks or be extended. The data showed that the virus gets
widespread from one individual to another in a limit of six feet or two meters [6]. Presently, the
governments are considering preventive measures like sanitization, social distancing, strict
lockdowns, etc. The study employ with time-series-based statistical framework model as “Prophet”,
which shown the accurate results in forecasting both short and long-term prediction measures. It
allows evaluating the considerable trends, seasonality, cyclic effect, and abnormality measures. The
main purpose of the article is to represent the approximate 8.5 months (260 days) predict the trends
for confirmed, recovered, and death cases for different countries. Figure 1 determines the map of
countries showing confirmed cases from the sample dataset.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Literature Survey</title>
      <p>Anastassopoulou et al. [7] performed the experimental analysis on the COVID-19 data by using
the SIDR (Susceptible-Infectious-Recovered-Dead) framework. This model provides approximations
of the basic infectious reproduction number (R0). It is used to predict the growth rate for up to three
weeks. Alsaeedy et al. [8] conducted work based on recognition of areas, which are highly prone to
spreading COVID-19 utilizing a wireless network. In this study, they employed an end-user machine
(UM) that connected with a wireless cellular network mechanism for better inference of regions. M.
B. Jamshidi et al. [9] conducted a DL (Deep learning) approach for the detection of COVID-19
infection. They performed AI-based strategies to diagnose the coronavirus infection by using GAN
(Generalized Adversarial Network), LSTM (Long Short-Term Memory), and ELM (Extreme
Learning Machine). AL-Rousan et al. [10] conducted the COVID-19 analysis, which describes the
exponential growth of the infected cases in South Korea. Lutz et al. [11] conducted the infectious
disease forecasting framework, using various mathematical models. In this study, the research has
been done on human beings and the way how it interrelated with infectious ailments with handling
approaches. Guo et.al [12] performed a forecasting model by using Prophet for (MPD)
MaximumPower Demand along with adaptive Kalman filter. Chae et al. [13] conducted a forecasting system
based on deep learning and a big data approach for time analysis of COVID-19. In Figure 2, the
diagrammatical representation of the COVID-19 prediction framework for the next 260 days.</p>
      <sec id="sec-2-1">
        <title>COVID-19 Dataset</title>
        <p>Data Pre-processing</p>
        <p>Clean the data
Remove missing values</p>
      </sec>
      <sec id="sec-2-2">
        <title>Validation Process</title>
        <p>Hypothetical Testing
“T-Test analysis”
Prediction / Forecasting
Parameters Setting
Fit Prophet Model
Apply Algorithms</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Data Description and Materials</title>
      <p>The Prophet framework is a practice for time-series-based forecasting of the COVID-19 data. It is
depending on the additive modeling where non-linear data trends correspond to year, biweekly,
datewise. The information is highly recommended to evaluate seasonal results and must have considering
several seasons of past historical patterns. This model is completely automated which assists to get
logical forecasting on unarranged data without manual support [14]. The Prophet model is accessible
in Python and R language and used a similar standard code for the fitting of the model. It rapidly
detects the changes in the linear, exponential, or logistic growth patterns by selecting switch points
from the analyzed data. Figure 3 corresponds to the components plot of the Prophet Model, which
gives information about the model that is fitted.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Proposed Methodology</title>
      <p>In the proposed study, the framework is used to extract and collect COVID-19 samples
related data from multiple sources. The data is based on time-series analysis, which is meant
to be constantly changing of data with time. In this study, the prophet model is fit the existing
data including confirmed, recovered, death cases on the mentioned days. The
time-seriesbased predictive trend curve indicates that the COVID-19 cases will increase or decrease over
time in the future 260 days.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Results and Discussion</title>
      <p>In this study, the plotted graphs for the confirmed, death, and recovered cases from the
chosen sample dataset and compare these circumstances relating to various countries.
Moreover, the visual inferences and statistical validation measures are performed by
employing the “T-Test” in SPSS (Statistical Package for Social Sciences). The overall
analysis of confirmed and recovered cases has been shown in Figure 4. Figure 5 depicts the
graphs of the corresponding confirmed, recovered, and death cases concerning their
observation dates.
(a) Confirmed Cases
(b) Deaths Cases
(c) Recovered Cases</p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion</title>
      <p>In the situation of COVID-19, Predictive analysis strategies are followed to reduce the spreading
rate of the pandemic. The prophet model is the most profitable way to forecast the future in a very
efficient and accurate manner. This model framework is designed to recognize the infectious points
from which the trend is highly variable and tackle outliers entirely. The statistical analysis will be of
great significance to the official authorities, health departments, and medical organizations to produce
drugs more quickly. This research provides a straightforward way to track the COVID-19 cases for
forthcoming days at the global level. The paper describes the overall impact of lockdown extensions,
social distancing, etc. to flatten the curve. One disadvantage of this approach, if the data have
biweekly or quarterly time-series, then the framework will be inflexible to predict the future. To
conquer this problem, all the suitable parameters must be configured manually.</p>
      <p>References
1. WORLDOMETER (2020) COVID-19 coronavirus pandemic (2020). In: WHO.</p>
      <p>https://www.worldometers.info/coronavirus/. Accessed 5 Jan 2021
2. MERS-CoV (2020) WHO (Middle East respiratory syndrome coronavirus).
https://www.who.int/health-topics/middle-east-respiratory-syndrome-coronavirusmers. Accessed 9 Oct 2021
3. SARS-CoV (2020) WHO (Severe Acute Respiratory Syndrome).
https://www.who.int/health-topics/severe-acute-respiratory-syndrome#tab=tab_1.</p>
      <p>Accessed 9 Oct 2021
4. CDC (2020) Coronavirus (Human Coronavirus Types).</p>
      <p>https://www.cdc.gov/coronavirus/types.html. Accessed 9 Oct 2021
5. WHO (2020) WHO Coronavirus Disease (COVID-19) Pandemic (2020). In: WHO.
https://www.who.int/emergencies/diseases/novel-coronavirus-2019. Accessed 30 Jan
2021
6. WHO (2020) WHO Coronavirus disease (COVID-19) dashboard (2020). In: WHO.</p>
      <p>https://covid19.who.int/. Accessed 29 Jan 2021
7. Anastassopoulou C, Russo L, Tsakris A, Siettos C (2020) Data-based analysis,</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>modeling and forecasting of the COVID-19 outbreak</article-title>
          .
          <source>PLoS One</source>
          <volume>15</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>21</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          https://doi.org/10.1371/journal.pone.0230405
          <string-name>
            <surname>Alsaeedy</surname>
            <given-names>AAR</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chong</surname>
            <given-names>EKP</given-names>
          </string-name>
          (
          <year>2020</year>
          )
          <article-title>Detecting Regions At Risk for Spreading COVID19 Using Existing Cellular Wireless Network Functionalities</article-title>
          .
          <source>IEEE Open J Eng Med Biol</source>
          <volume>1</volume>
          :
          <fpage>187</fpage>
          -
          <lpage>189</lpage>
          . https://doi.org/10.1109/ojemb.
          <year>2020</year>
          .3002447
          <string-name>
            <surname>Jamshidi</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lalbakhsh</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Talla</surname>
            <given-names>J</given-names>
          </string-name>
          , et al (
          <year>2020</year>
          )
          <article-title>Artificial Intelligence</article-title>
          and COVID-
          <volume>19</volume>
          :
          <article-title>Deep Learning Approaches for Diagnosis and Treatment</article-title>
          .
          <source>IEEE Access</source>
          <volume>8</volume>
          :
          <fpage>109581</fpage>
          -
          <lpage>109595</lpage>
          . https://doi.org/10.1109/ACCESS.
          <year>2020</year>
          .3001973 AL
          <string-name>
            <surname>-Rousan</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>AL-Najjar</surname>
            <given-names>H</given-names>
          </string-name>
          (
          <year>2020</year>
          )
          <article-title>Data analysis of coronavirus COVID-19 epidemic in South Korea based on recovered and death cases</article-title>
          .
          <source>J Med Virol</source>
          <volume>92</volume>
          :
          <fpage>1603</fpage>
          -
          <lpage>1608</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          https://doi.org/10.1002/jmv.25850
          <string-name>
            <surname>Lutz</surname>
            <given-names>CS</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huynh</surname>
            <given-names>MP</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schroeder</surname>
            <given-names>M</given-names>
          </string-name>
          , et al (
          <year>2019</year>
          )
          <article-title>Applying infectious disease forecasting to public health: A path forward using influenza forecasting examples</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <source>BMC Public Health</source>
          <volume>19</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          . https://doi.org/10.1186/s12889-019-7966-8
          <string-name>
            <surname>Guo</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ge</surname>
            <given-names>Q</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jiang</surname>
            <given-names>H</given-names>
          </string-name>
          , et al (
          <year>2020</year>
          )
          <article-title>Maximum Power Demand Prediction Using Fbprophet with Adaptive Kalman Filtering</article-title>
          .
          <source>IEEE Access</source>
          <volume>8</volume>
          :
          <fpage>19236</fpage>
          -
          <lpage>19247</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          https://doi.org/10.1109/ACCESS.
          <year>2020</year>
          .2968101 Chae S,
          <string-name>
            <surname>Kwon</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            <given-names>D</given-names>
          </string-name>
          (
          <year>2018</year>
          )
          <article-title>Predicting infectious disease using deep learning and big data</article-title>
          .
          <source>Int J Environ Res Public Health</source>
          <volume>15</volume>
          : https://doi.org/10.3390/ijerph15081596 Facebook Research (
          <year>2020</year>
          )
          <article-title>Prophet: forecasting at scale</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          https://research.fb.com/blog/2017/02/prophetforecasting-%
          <source>0Aat-scale. Accessed 9 Oct 2021</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>