<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Direct Multi-Step Forecasting with Multiple Time Series Using XGBoost: Projecting COVID-19 Positive Hospitalization Census for a Southern Idaho Health System</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Drake Anshutz</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrew Crisp</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>James Ford</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Onur Torusoglu</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Justin Smith</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Digital</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Analytics: Advanced Analytics</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>St. Luke's Health System</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Boise</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Digital</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Analytics</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>St. Luke's Health System</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Boise</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>anshutzd@slhs.org</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>smitjust@slhs.org</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>COVID-</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Background</string-name>
        </contrib>
      </contrib-group>
      <abstract>
        <p>Forecasting hospitalization census for the novel COVID-19 virus is a challenging task for numerous reasons including many unknowns, limited historical data, and other issues related to model misspecification. Most modeling techniques aimed at predicting hospitalization census for respiratory epidemics often create contradictory projections from a wide variety of scenarios. This often creates massive confidence intervals for projections as most models are based on manually adjusted assumptions which ultimately provide inconsistent, unreliable results. This case-study introduces a machine learning approach that helps overcome limited historical data while adjusting for model misspecification and creating consistent, easily understood results. This model has been deployed and automated with daily updates within a large health system for executive use and is reliably forecasting a one-month projection within an acceptable margin of error as determined by executive leadership.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        COVID-19 is a highly infectious novel disease that was
declared a pandemic by the World Health Organization on
March 11th, 2020
        <xref ref-type="bibr" rid="ref14 ref2">(Meehan et al, 2020; Baloch, 2020; Roda
et al 2020)</xref>
        . Modeling for the novel severe acute respiratory
syndrome coronavirus 2 (SARS-CoV-2) which causes the
coronavirus disease (COVID-19) has challenged data
scientists and statisticians across the world for numerous reasons,
to include but not limited to a novel disease with many
unknowns, limited historical data, and policy shifts affecting
the trajectory of the d
        <xref ref-type="bibr" rid="ref13">isease (Roda et al, 2020</xref>
        ; Wang, 2020).
Accurately predicting an outcome of COVID-19 positive
patient hospitalization census has become an extreme task
in this regard as rapidly changing policy enactments, shifts
in human behavior, and other events such as masking
ordinances and masking compliance strongly influence an
outcome such as hospital
        <xref ref-type="bibr" rid="ref13">ization census (Roda et al, 2020</xref>
        ;
E
        <xref ref-type="bibr" rid="ref13">ikenberry, 2020</xref>
        ). A strong modeling system that accounts for
such factors is needed to inform future policies such as
organizational decision making at an execut
        <xref ref-type="bibr" rid="ref13">ive level
(McBryde, 2020</xref>
        ).
      </p>
      <p>
        Early indications of COVID-19 hospitalization census are
known to display cyclic tendencies due to influxes and
outflows of patients over an extended t
        <xref ref-type="bibr" rid="ref13">ime-period (Roda et al,
2020</xref>
        ; F
        <xref ref-type="bibr" rid="ref13">iore et al, 2020</xref>
        ). Traditional epidemiologic forecasts
used to study disease behavior, such as basic SEIR
(Susceptible Exposed Infectious Recovered) models, generally do
not account for these cyclic tendencies and assume a
smooth, normally distributed projection generally for a large
scale population which creates discrepancies for localized
clinical projections such as a singular hospital
        <xref ref-type="bibr" rid="ref1 ref6">(Anirudh,
2020; Chen et al, 2020)</xref>
        . If a traditional model is aimed at
adjusting for an influx or outflow of patients for a particular
population, the modeler can manually adjust certain
assumptions, such as transmissibility, susceptible patients, and
hospitalization rate within the model but such assumptions
are highly prone to inaccuracies in future projections
        <xref ref-type="bibr" rid="ref7">(Eksin,
Paarporn, Weitz, 2019; Huppert and Katrield. 2013)</xref>
        . This
not only creates inaccuracies in the distribution of patients
over time but competing assumptions within various models
often contradict each other resulting in unknown accuracy
across modeling techniques
        <xref ref-type="bibr" rid="ref21">(Wang et al, 2016)</xref>
        .
      </p>
    </sec>
    <sec id="sec-2">
      <title>Model Selection</title>
      <p>
        Direct Multi-Step Forecasting with Multiple Time Series is
a method that directly projects a continuous outcome
specific to each time-step
        <xref ref-type="bibr" rid="ref17 ref3 ref9">(Redell, 2020; Guillaume and
Chevillon, 2005; Taieb and Atiya, 2016)</xref>
        . Other forecasting
methodologies generally predict in a recursive nature to follow
the trajectory of a projection
        <xref ref-type="bibr" rid="ref9">(Guillaume and Chevillon,
2005)</xref>
        . Direct Forecasting specifies a target date and uses
lagged variables and dynamic predictive features across
forecasting horizons to predict certain outcomes at a
des
        <xref ref-type="bibr" rid="ref13">ignated point in time (Redell, 2020</xref>
        ). Strengths to this
forecasting methodology include robustness to policy enactments,
the ability to train on less data eliminating the cold-start
problem, addresses a concept called model misspecification
within a reasonable time-period and provides consistent
model results
        <xref ref-type="bibr" rid="ref17 ref6 ref9">(Redell, 2020; Guillaume and Chevillon,
2005; Chen et al, 2020)</xref>
        .
      </p>
      <p>
        XGBoost is an ensemble learning method that follows a
gradient boosted tree format and is frequently used in other
healthcare machine learning models because of its
performance
        <xref ref-type="bibr" rid="ref11 ref23 ref6">(Chen et al, 2020; Xu et al, 2019; Liu et al, 2018)</xref>
        .
This algorithm is run for a continuous outcome (XGBoost
for regression) and is particularly useful in analyzing
variable importance eliminating the “black-box” issue common
among other machine learning techniques
        <xref ref-type="bibr" rid="ref5">(Chen and Carlos,
2016)</xref>
        .
      </p>
    </sec>
    <sec id="sec-3">
      <title>Sample and Data Sources</title>
      <p>The dataset used is an automated blending of two data
sources. The first is a dataset updated daily from an internal
data management team that provides a refreshed dataset on
hospitalization census, including admissions and discharges
from prior mid-night census. The patient volumes for this
model are refreshed daily and the groupings for
hospitalization census include all COVID-19 positive patients within
four separate hospitals and intensive care unit census split
between two regions (Two intensive care units for one
region and four for the other region).</p>
      <p>
        An additional data source is collected from the Johns
Hopkins University Center for Systems Science and
Engineering using the R package covid19.analytics. This dataset
measures the total of new positive COVID-19 cases as
reported by the State of
        <xref ref-type="bibr" rid="ref13">Idaho (Ponce, 2020</xref>
        ). New positive
COVID-19 cases are defined by county and allocated to
relevant hospitals within the determined service region of each
hospital (i.e. if a county is located within a hospitals service
region, the positive cases from that county as reported by the
covid19.analytics package will be attributed to that
particular hospital). State reporting was ultimately chosen to depict
new cases of COVID-19 as internal data sources for testing
remain unstandardized in terms of shifting market shares for
testing across competing health systems. Counties and
hospital locations are specifically withheld from this report
to ensure patient privacy.
      </p>
      <p>
        Additionally, Idaho State Reopening Phases (i.e.
mandates that require certain businesses such as bars and
restaurant’s to close or remain partially open for a specific period
of time) were incorporated as well as holiday weekends and
both were included as binary dynamic features
        <xref ref-type="bibr" rid="ref20 ref8">(State of
Idaho, 2020)</xref>
        . Day numbers and week numbers were also
included as dynamic features.
      </p>
      <p>The final dataset ultimately includes two separate types
of features or predictor variables. The continuous lagged
features are total hospitalization census, total discharges,
total admissions, and total positive cases. The dynamic
features included are Idaho State Reopening Phases, day
number, week number, and holiday weekends.</p>
    </sec>
    <sec id="sec-4">
      <title>HIPAA Compliance</title>
      <p>
        All patient data has been aggregated and anonymously
displayed within de-identified hospitals. No individual patient
data was used for analysis and the project followed the
Privacy Rule as stated by the Health Insurance Portability and
Accountability Act for de-identifying all 18 elements of
identity
        <xref ref-type="bibr" rid="ref1 ref20 ref8">(US Department of Health and Human Services,
2020)</xref>
        . In addition, no demographics were reported to ensure
patient privacy.
      </p>
    </sec>
    <sec id="sec-5">
      <title>Analyses</title>
      <p>
        R was used for all analyses
        <xref ref-type="bibr" rid="ref16">(R Core Team, 2013)</xref>
        . Direct
Multi-Step Forecasting with Multiple Time Series using the
Machine Learning Algorithm XGBoost was employed as
the model to forecast hospitalization mid-night census and
intensive care unit mid-night census. The R package used
for analys
        <xref ref-type="bibr" rid="ref13">is was forecastML (Redell, 2020</xref>
        ). The
parameters used for the two outcomes of hospitalization census and
intensive care unit census are as follows; Lookback: 140
days for both Hospital and ICU, Horizons: 1, 14, and
30days for both Hospital and ICU, Frequency: 1-day for both
Hospital and ICU.
      </p>
      <p>XGBoost was employed to project the value per
timestep. Regression for squared error was chosen as the
objective. Default settings within the XGBoost parameters and
function were employed. The validation metric used is
Mean Absolute Error. Prediction confidence intervals of
+/2 were used.</p>
    </sec>
    <sec id="sec-6">
      <title>Validation Process</title>
      <p>
        The validation process follows a nested cross-validation
setup
        <xref ref-type="bibr" rid="ref17 ref4">(Redell, 2020; Bergmeir, Hyndman, Koo, 2017)</xref>
        .
Validation is examined by extracting validation windows and
analyzes model performance with those validation windows
across selected horizons. The amount of time (in this report,
days) within the validation window ultimately serves as the
testing set for model performance. So, if a validation
window of 9 days is selected, 9 projections will be created for
each of those days within the validation window. The
differences between the projection and actual will serve as the
validation metric and this report has selected mean absolute
error to represent the metric for model performance. This
report evaluates the validation window selected for three
separate horizons (1-day, 14-day, and 30-day) and combines the
results from all validation windows and horizons to provide
a global mean absolute error. The report also examines
validation windows across time, separated by model horizons
to depict accuracy of the model horizon performance
throughout time.
      </p>
    </sec>
    <sec id="sec-7">
      <title>Results</title>
      <p>The global mean absolute error (GMAE) for both models
(Hospitalization and Intensive Care Units) at the window
and skip ratio of 9-day windows and 21-day skips was used
to assess model performance. Validation was also analyzed
over-time via mean absolute error (MAE) to assess the
accuracy of the model’s performance at the same 9-day
window and 21-day skip ratio.</p>
      <sec id="sec-7-1">
        <title>Hospital</title>
        <p>Hospital 1
Hospital 2
Hospital 3
Hospital 4
ICU 1
ICU 2
Global Mean Absolute Error
1.43
2.37
2.00
1.22
1.21
1.08</p>
        <sec id="sec-7-1-1">
          <title>Global Mean Absolute Error</title>
          <p>The forecasting GMAE stayed consistent across Hospitals
and Intensive Care Units with highest GMAE found at 2.37
which is displayed in Table 1. Other validation window and
skip ratios were evaluated and the highest GMAE found was
at a 30-day window and 30-skip with a value of 3.55. The
ranges for the 30-day window and 30-day skip were 1.26 –
3.55 across both the hospitalization and intensive care unit
models.</p>
        </sec>
        <sec id="sec-7-1-2">
          <title>Mean Absolute Error Across Windows and Horizons</title>
          <p>The forecasting MAE displayed in Figures 1 and 2 display
the fluctuations across forecasting windows which remain
consistent across time with a maximum MAE of 5.96 for
Hospitalizations and 6.17 for ICU. The other validation ratio
of 30-day windows and 30-day skips found a maximum
MAE of 9.61 with a range of 0.63 – 9.61 for
Hospitalizations. The same 30-day window and 30-day skip for ICU’s
found a maximum MAE of 5.38 with a range of 0.90 – 5.38.</p>
        </sec>
        <sec id="sec-7-1-3">
          <title>Variable Importance Assessment</title>
          <p>Variable importance was analyzed and assessed daily. Most
variable importance gain was identified in total
hospitalizations lag variables, discharge and admissions lagged
variables and positive testing lagged variables. The consensus of
variable importance indicates the model was dependent on
which model horizon the model was assessing. For instance,
the one-day projection was relatively dependent on the
previous weeks’ total hospitalizations, admissions, discharges,
and positive testing. However, a thirty-day forecast would
utilize much different variables that were dependent on
longer term historical data. Additionally, Idaho State Phases
were rarely identified in the model importance. However, a
holiday weekend would indicate a high value in gain if a
holiday were within a horizon.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Discussion</title>
      <p>
        This paper describes a novel approach toward predicting
COVID-19 positive patient hospitalization census that has
not been seen in recent literature as of current date. The
model is currently in use and projecting within a reasonable
mean absolute error and most training validations were
performed with less than 6-months’ worth of historical data.
The value of this predictive model also includes the
introduction of future policy decisions of which can be
automated to be included in future iterat
        <xref ref-type="bibr" rid="ref13">ions of the model
(Redell, 2020</xref>
        ).
      </p>
      <p>
        The models forecast also adjusts for the forecasting
horizons for the final projection delivered to executive leaders.
This ultimately lowers error across forecasting horizons as
the model uses the one-day forecast for the first prediction
in the final projection, the fourteen-day forecast for days
2:14, and the thirty-day forecast for days 15:30 (Sel
        <xref ref-type="bibr" rid="ref13">im et al,
2020</xref>
        ; Ta
        <xref ref-type="bibr" rid="ref13">ieb et al, 2020</xref>
        ). After evaluating the results of the
forecasting error validation windows, this will start at
generally the strongest model projection and lead into slightly
weaker forecasting projections.
      </p>
      <p>Variable importance depicted within the model help
illustrate why the model is predicting what it is predicting and
uncovering the “black-box” of the algorithm. Understanding
variable importance often allows a decision-maker to
understand potential interventions that could limit a negative
outcome (i.e. if a decision-maker is able to understand a future
negative-outcome relationship with a modifiable predictor
variable, the decision-maker may be able to adjust that
predictor variable to establish a potential positive outcome).
Unfortunately, most lagged features within this model will
not be able to be influenced as the occurrence has already
happened. However, a dynamic feature such as State
Reopening Phases may be altered if a decision-maker chooses
to assess and enact potential interventions.</p>
      <sec id="sec-8-1">
        <title>Limitations</title>
        <p>
          This model is still evolving and being tuned during a highly
unpredictable time-period. This methodology can also be
prone to issues described as a broken-curve and is slightly
prone to overfitting h
          <xref ref-type="bibr" rid="ref13">istorical data (Selim et al, 2020</xref>
          ).
        </p>
        <p>At this point, the model is strongly recommended to
assess the trajectory of hospitalization census and not be used
for individual day decision-making. The current models’
projection, in comparison to actuals, tends to project
influxes and outflows consistently with actual data. However,
the specific date of the influx or outflow tends to occur
within one to three days of the actual result. An actual influx
of hospitalization census may occur on a Wednesday, but
the projection may predict the influx would happen on the
prior Tuesday or the following Friday.</p>
      </sec>
      <sec id="sec-8-2">
        <title>Strengths</title>
        <p>
          The model is currently validated within reasonable margin
of error as determined by executive leadership for business
decision making. Given most time-series models require an
abundant data source with multiple years to project future
outcomes, this model can assess a future projection w
          <xref ref-type="bibr" rid="ref13">ith a
limited dataset (Redell, 2020</xref>
          ). The addition of future
policies may also be incorporated to help predict future
interact
          <xref ref-type="bibr" rid="ref13">ions within the model (Redell, 2020</xref>
          ). The model is also
intended to gain strength in predictions as more data is
incorporated into the models. Added robustness towards model
misspecification is an additional strength within direct
forecasting
          <xref ref-type="bibr" rid="ref12">(Marcellino, Stock, Watson. 2006)</xref>
          . Other strengths
include that the model is currently refreshed daily to provide
more accurate results as the future dates occur. A robust
internal dataset is also an asset as most organizations outside
of health systems are not granted access to live data sources
with a large quantity of potential predictive features.
Ultimately, the model can help determine future trends to assist
executive leadership with resource utilization across a
health system.
        </p>
      </sec>
    </sec>
    <sec id="sec-9">
      <title>Acknowledgments</title>
      <sec id="sec-9-1">
        <title>Trevor Wilford, MBA. Keegan Gunderson. St. Luke’s Digital and Analytics Department: Advanced Analytics.</title>
        <p>St. Luke’s Digital and Analytics Department: Data
Management and Business Intelligence.</p>
        <p>St. Luke’s Health System.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Anirudh A.</surname>
          </string-name>
          <year>2020</year>
          .
          <article-title>Mathematical modeling and the transmission dynamics in predicting the Covid-19 - What next in combating the pandemic</article-title>
          .
          <source>Infectious Disease Modelling</source>
          ,
          <volume>5</volume>
          ,
          <fpage>366</fpage>
          -
          <lpage>374</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Baloch</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baloch</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zheng</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Pei</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          <year>2020</year>
          .
          <article-title>The Coronavirus Disease 2019 (COVID-19) Pandemic</article-title>
          .
          <source>The Tohoku journal of experimental medicine</source>
          ,
          <volume>250</volume>
          (
          <issue>4</issue>
          ),
          <fpage>271</fpage>
          -
          <lpage>278</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Ben</given-names>
            <surname>Taieb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            , &amp;
            <surname>Atiya</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. F.</surname>
          </string-name>
          <year>2016</year>
          .
          <article-title>A Bias and Variance Analysis for Multistep-Ahead Time Series Forecasting</article-title>
          .
          <source>IEEE transactions on neural networks and learning systems</source>
          ,
          <volume>27</volume>
          (
          <issue>1</issue>
          ),
          <fpage>62</fpage>
          -
          <lpage>76</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Bergmeir</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Hyndman</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Koo</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <year>2017</year>
          .
          <article-title>A Note on the Validity of Cross-Validation for Evaluating Autoregressive Time Series Predictions</article-title>
          . Elsevier. robjhyndman.com/papers/cv-wp.pdf Chen,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Robinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Janies</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            , &amp;
            <surname>Dulin</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <year>2020</year>
          .
          <article-title>Four Challenges Associated With Current Mathematical Modeling Paradigm of Infectious Diseases and Call for a Shift</article-title>
          .
          <source>Open forum infectious diseases</source>
          ,
          <volume>7</volume>
          (
          <issue>8</issue>
          ),
          <year>ofaa333</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carlos</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <year>2016</year>
          .
          <article-title>XGBoost: A Scalable Tree Boosting System</article-title>
          .
          <article-title>Association for Computing machinery</article-title>
          , KDD '
          <volume>16</volume>
          ,
          <fpage>785</fpage>
          -
          <lpage>794</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Benesty</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khotilovich</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cho</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            , K., Mitchell,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cano</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xie</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Geng</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <year>2020</year>
          .
          <article-title>xgboost: Extreme Gradient Boosting</article-title>
          .
          <source>R package version 1.1.1</source>
          .1. https://CRAN.R-project.org/package=xgboost Eikenberry,
          <string-name>
            <given-names>S. E.</given-names>
            ,
            <surname>Mancuso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Iboi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Phan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Eikenberry</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Kuang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            ,
            <surname>Kostelich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            , &amp;
            <surname>Gumel</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. B.</surname>
          </string-name>
          <year>2020</year>
          .
          <article-title>To mask or not to mask: Modeling the potential for face mask use by the general public to curtail the COVID-19 pandemic</article-title>
          .
          <source>Infectious Disease Modelling</source>
          ,
          <volume>5</volume>
          ,
          <fpage>293</fpage>
          -
          <lpage>308</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Eksin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paarporn</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Weitz</surname>
            ,
            <given-names>J. S.</given-names>
          </string-name>
          <year>2019</year>
          .
          <article-title>Systematic biases in disease forecasting - The role of behavior change</article-title>
          .
          <source>Epidemics</source>
          ,
          <volume>27</volume>
          ,
          <fpage>96</fpage>
          -
          <lpage>105</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          2020.
          <article-title>Containment of future waves of COVID-19: simulating the impact of different policies and testing capacities for contact tracing, testing, and isolation. medRxiv : the preprint server for health sciences</article-title>
          ,
          <year>2020</year>
          .
          <volume>06</volume>
          .05.20123372.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Guillaume</surname>
          </string-name>
          , &amp;
          <string-name>
            <surname>Chevillon</surname>
          </string-name>
          .
          <year>2005</year>
          .
          <article-title>DIRECT MULTI-STEP ESTIMATION</article-title>
          AND FORECASTING N °
          <year>2005</year>
          -
          <fpage>10</fpage>
          Juillet
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          https://doi.org/10.1111/j.1467-
          <fpage>6419</fpage>
          .
          <year>2007</year>
          .
          <volume>00518</volume>
          .
          <string-name>
            <surname>x Huppert</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Katriel</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <year>2013</year>
          .
          <article-title>Mathematical modelling and prediction in infectious disease epidemiology. Clinical microbiology and infection: the official publication of the European Society of Clinical Microbiology</article-title>
          and Infectious Diseases,
          <volume>19</volume>
          (
          <issue>11</issue>
          ),
          <fpage>999</fpage>
          -
          <lpage>1005</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fei</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>F. X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>H. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pan</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <year>2018</year>
          .
          <article-title>An interpretable boosting model to predict side effects of analgesics for osteoarthritis</article-title>
          .
          <source>BMC systems biology, 12(Suppl 6)</source>
          ,
          <fpage>105</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Marcellino</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stock</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Watson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2006</year>
          .
          <article-title>A Comparison of Direct and Iterated Multistep AR Methods for Forecasting Macroeconomic Time Series</article-title>
          .
          <source>Journal of Econometrics</source>
          .
          <year>2006</year>
          ,
          <volume>135</volume>
          :
          <fpage>499</fpage>
          -
          <lpage>526</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Caldwell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            ,
            <surname>Pak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Rojas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. P.</given-names>
            ,
            <surname>Williams</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. M.</given-names>
            , &amp;
            <surname>Trauer</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. M.</surname>
          </string-name>
          <year>2020</year>
          .
          <article-title>Role of modelling in COVID-19 policy development</article-title>
          .
          <source>Paediatric respiratory reviews</source>
          ,
          <volume>35</volume>
          ,
          <fpage>57</fpage>
          -
          <lpage>60</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Meehan</surname>
            ,
            <given-names>M. T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rojas</surname>
            ,
            <given-names>D. P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Adekunle</surname>
            ,
            <given-names>A. I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Adegboye</surname>
            ,
            <given-names>O. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Caldwell</surname>
            ,
            <given-names>J. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Turek</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Williams</surname>
            ,
            <given-names>B. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marais</surname>
            ,
            <given-names>B. J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trauer</surname>
            ,
            <given-names>J. M.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>McBryde</surname>
            ,
            <given-names>E. S.</given-names>
          </string-name>
          <year>2020</year>
          .
          <article-title>Modelling insights into the COVID19 pandemic</article-title>
          .
          <source>Paediatric respiratory reviews</source>
          ,
          <volume>35</volume>
          ,
          <fpage>64</fpage>
          -
          <lpage>69</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Ponce</surname>
            <given-names>M.</given-names>
          </string-name>
          <year>2020</year>
          .
          <article-title>covid19.analytics: Load and Analyze Live Data from the CoViD-</article-title>
          19 Pandemic.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <given-names>R Core</given-names>
            <surname>Team</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>R: A language and environment for statistical computing</article-title>
          .
          <source>R Foundation for Statistical Computing</source>
          , Vienna, Austria.
          <source>ISBN 3-900051-07-0</source>
          , URL: http://www.R-project.org/.
          <source>R package version 1.1</source>
          .1. https://CRAN.R-project.
          <source>org/package=covid19.analytics.</source>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Redell N.</surname>
          </string-name>
          <year>2020</year>
          .
          <article-title>forecastML: Time Series Forecasting with Machine Learning Methods</article-title>
          .
          <source>R package version 0.9.0.</source>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          https://CRAN.R-project.org/package=forecastML Roda,
          <string-name>
            <given-names>W. C.</given-names>
            ,
            <surname>Varughese</surname>
          </string-name>
          , M. B.,
          <string-name>
            <surname>Han</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>M. Y.</given-names>
          </string-name>
          <year>2020</year>
          .
          <article-title>Why is it difficult to accurately predict the COVID-</article-title>
          19
          <source>epidemic? Infectious Disease Modelling</source>
          ,
          <volume>5</volume>
          ,
          <fpage>271</fpage>
          -
          <lpage>281</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Selim</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Feng</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Alam</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          <year>2020</year>
          .
          <article-title>Reducing error propagation for long term energy forecasting using multivariate prediction</article-title>
          .
          <source>EPiC Series in Computing. 69</source>
          ,
          <fpage>161</fpage>
          -
          <lpage>169</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <source>State of Idaho</source>
          .
          <year>2020</year>
          .
          <article-title>Idaho Rebound: Path to Prosperity. Idaho Official Government Website</article-title>
          . Retrieved from https://rebound.idaho.gov/stages-of-reopening/ Taieb, S,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Bontempi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Atiya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            , &amp;
            <surname>Sorjamaa</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          <year>2011</year>
          .
          <article-title>A review and comparison of strategies for multi-step ahead time series forecasting based on the NN5 forecasting competition</article-title>
          .
          <source>Expert Systems with Applications</source>
          . URL: http://souhaib-bentaieb.com/pdf/2012_esa_review.
          <source>pdf US Department of Health and Human Services</source>
          .
          <year>2020</year>
          .
          <article-title>HIPAA Privacy Rule: Information for Researchers</article-title>
          . Retrieved from https://privacyruleandresearch.nih.gov/ Wang J.
          <year>2020</year>
          .
          <article-title>Mathematical models for COVID-19: applications, limitations, and potentials</article-title>
          .
          <source>Journal of public health and emergency, 4</source>
          ,
          <fpage>9</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          <string-name>
            <surname>Zhong</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Tang</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Gao</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Stanly</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <article-title>Predicting the epidemic threshold of the susceptible-infected-recovered model</article-title>
          .
          <source>Sci Rep</source>
          ,
          <volume>6</volume>
          (
          <issue>24676</issue>
          ),
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peng</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ge</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xiong</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Yi</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <year>2019</year>
          .
          <article-title>Extreme Gradient Boosting Model Has a Better Performance in Predicting the Risk of 90-Day Readmissions in Patients with Ischaemic Stroke</article-title>
          .
          <source>Journal of stroke and cerebrovascular diseases: the official journal of National Stroke Association</source>
          ,
          <volume>28</volume>
          (
          <issue>12</issue>
          ),
          <fpage>104441</fpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>