=Paper=
{{Paper
|id=Vol-3892/paper9
|storemode=property
|title=Simulation of Influenza Dynamics With Lstm Deep Learning Model
|pdfUrl=https://ceur-ws.org/Vol-3892/paper9.pdf
|volume=Vol-3892
|authors=Mykola Butkevych,Ievgen Meniailov,Kseniia Bazilevych,Yurii Parfeniuk,Dmytro Chumachenko,Sebastian Górecki,Wiktoria Duszczyk,Zuzanna Huda,Andrzej Faryna,Aleksandra Tatka,Mohamed Adel,Mohamed Aborizka,Yurii Oliinyk,Mariia Kapshuk,Leonid Oliinyk,Inna Rozlomii,Andrii Yarmilko,Serhii Naumenko
|dblpUrl=https://dblp.org/rec/conf/iddm/ButkevychMBPC24
}}
==Simulation of Influenza Dynamics With Lstm Deep Learning Model==
Simulation of influenza dynamics with LSTM deep
learning model ⋆
Mykola Butkevycha,†, Ievgen Meniailovb,†, Kseniia Bazilevycha,†, Yurii Parfeniukb,†
and Dmytro Chumachenkoa,†,*
a
National Aerospace University “Kharkiv Aviation Institute”, Vadym Manko str., 17, Kharkiv, 61070, Ukraine
b
V.N. Karazin Kharkiv National University, Freedom sq., 4, Kharkiv, 61022, Ukraine
Abstract
Influenza remains a significant public health concern worldwide, especially in regions affected by conflict,
where healthcare systems are compromised. This study aims to develop a forecasting model for influenza in
the war-affected Kharkiv Oblast of Ukraine, leveraging a Long Short-Term Memory (LSTM) deep learning
approach. The LSTM model was trained using monthly influenza incidence data from January 2013 to April
2024 to predict short-term outbreak dynamics. Hyperparameters such as LSTM units, dropout rates, and
optimizers were fine-tuned to optimize the model. The model’s performance was evaluated using mean
squared error (MSE) and mean absolute percentage error (MAPE), achieving a best MSE of 0.0255 and a
MAPE of 6.3976. The study contributes both scientifically and practically by applying machine learning
techniques to influenza forecasting in a conflict zone, a context not extensively studied before. The LSTM
model demonstrated robustness in handling incomplete and irregular data, offering reliable short-term
forecasts critical for public health interventions in regions with disrupted healthcare systems. The findings
of this study provide a framework that can be applied to other conflict-affected regions facing similar public
health challenges.
Keywords
Epidemic model, LSTM, deep learning, infectious disease simulation
1. Introduction
Influenza is a widespread viral infection that affects the upper respiratory system, caused by the
influenza virus. It is highly contagious and is transmitted through respiratory droplets when infected
individuals cough, sneeze, or talk [1]. The symptoms of influenza include fever, cough, sore throat,
body aches, headaches, fatigue, and sometimes gastrointestinal issues such as nausea and diarrhea
[2]. While influenza typically results in mild to moderate symptoms, it can become severe and even
fatal in high-risk populations, such as young children, the elderly, pregnant women, and individuals
with pre-existing medical conditions [3]. The seasonal nature of influenza, with peaks typically
occurring during the colder months and its propensity for antigenic variation through mutation and
reassortment, requires ongoing public health surveillance and interventions, including annual
vaccination campaigns to match circulating strains and reduce the impact of outbreaks [4].
On a global scale, influenza remains a persistent public health threat, causing significant morbidity
and mortality each year. The World Health Organization (WHO) estimates that seasonal influenza
epidemics result in 3 to 5 million cases of severe illness and up to 650,000 respiratory deaths
worldwide annually [5]. These figures underline the substantial burden influenza places on healthcare
systems, especially during peak flu season when hospitals and clinics often face surges in admissions
and demand for medical resources. Influenza is a dynamic virus that can mutate rapidly, requiring
health authorities to adjust vaccine formulations annually to keep pace with evolving strains. Despite
IDDM’24: 7th International Conference on Informatics & Data-Driven Medicine, November 14 - 16, 2024, Birmingham, UK
∗
Corresponding author.
†
These authors contributed equally.
m.v.butkevych@khai.edu (M. Butkevych); evgenii.menyailov@gmail.com (I. Meniailov); ksenia.bazilevich@gmail.com
(K. Bazilevych); parfuriy.l@gmail.com (Y. Parfeniuk); dichumachenko@gmail.com (D. Chumachenko)
0000-0001-8189-631X (M. Butkevych); 0000-0002-9440-8378 (I. Meniailov); 0000-0001-5332-9545 (K. Bazilevych); 0000-
0001-5357-1868 (Y. Parfeniuk); 0000-0003-2623-3294 (D. Chumachenko)
© 2023 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
CEUR
ceur-ws.org
Workshop ISSN 1613-0073
Proceedings
the availability of vaccines, the flu continues to spread due to varying vaccine coverage rates, vaccine
hesitancy, and the potential mismatch between the vaccine and circulating virus strains [6].
International collaboration through institutions such as WHO and the Global Influenza Surveillance
and Response System is vital in monitoring flu trends, updating vaccines, and coordinating response
efforts to mitigate the virus's impact [7].
In the context of Ukraine, influenza has become an increasingly pressing issue due to the Russian
full-scale invasion that began in 2022. The invasion has severely disrupted Ukraine’s healthcare
system, affecting the delivery of medical services, including routine vaccinations against influenza
[8]. With millions of people displaced both within and outside the country, overcrowded living
conditions in refugee camps, shelters, and temporary housing have increased the transmission of
infectious diseases, including influenza [9]. Furthermore, the disruption of healthcare infrastructure
has limited access to essential medicines and medical care, particularly in conflict zones. Vulnerable
groups, such as the elderly and children, face heightened risks in this precarious situation, as
overcrowding and insufficient medical attention create optimal conditions for the virus to spread
rapidly [10]. The situation is further compounded by psychological stress and poor nutrition, which
can weaken immune systems and increase susceptibility to infections [11]. Effective influenza
management in Ukraine under these conditions requires innovative public health approaches,
including disease modeling and simulation, to predict outbreaks and allocate resources where they
are most needed.
The field of data-driven healthcare, specifically public health informatics, has seen rapid
advancement in recent years, driven by the increasing availability of healthcare data and advances in
machine learning and artificial intelligence (AI) [12]. Public health informatics focuses on collecting,
managing, and analysing health data to inform decision-making and improve health outcomes. In this
context, machine learning models are increasingly critical in predicting disease outbreaks, monitoring
health trends, and guiding public health responses [13]. By leveraging large datasets that include
historical health records, vaccination coverage, population density, and environmental factors,
machine learning models can accurately uncover hidden patterns and predict disease spread [14].
These models have been particularly useful in influenza research, where predictive modelling helps
public health officials forecast flu seasons, identify at-risk populations, and develop vaccine
distribution and outbreak prevention strategies [15]. Furthermore, machine learning models enable
real-time analysis, allowing health authorities to quickly adjust interventions based on evolving data,
which is crucial in managing fast-spreading infectious diseases like influenza [16].
Simulating the spread of infectious diseases, such as influenza, is essential for understanding
transmission dynamics and planning effective interventions. Infectious disease simulation models
allow researchers to test various scenarios and predict how diseases might spread under different
conditions, such as changes in vaccination rates, population movements, or healthcare interventions
[17]. These models are important tools in epidemiology, as they can provide insights into how a virus
like influenza spreads through populations over time. Models can be constructed to account for
various variables, including population density, age distribution, social behaviours, contact patterns,
vaccination coverage, and the effects of public health interventions such as quarantines or school
closures [18]. Among these simulation models, Long Short-Term Memory (LSTM) networks, a type
of recurrent neural network (RNN), have shown significant promise in predicting the spread of
diseases due to their ability to capture temporal dependencies in sequential data [19]. LSTM networks
are particularly useful for modelling time-series data, which is crucial in understanding how diseases
progress over time and space. By training LSTM models on historical influenza data, researchers can
simulate the spread of the virus and forecast future outbreaks. These simulations are vital for public
health officials to plan interventions, allocate resources, and make data-driven decisions responding
to potential flu outbreaks. In complex situations, such as Ukraine's ongoing conflict, these simulations
can offer valuable insights into how the influenza virus might spread among displaced populations
and help health authorities prioritize vaccination and other public health measures to control the
disease's impact.
The aim of the study is to develop the forecasting model of influenza in war-affected Kharkiv
Oblast of Ukraine based on LSTM deep learning approach.
Current research is a part of the comprehensive information system for assessing the impact of
emergencies on the spread of infectious diseases described in [20].
2. Current Research Analysis
Machine learning methods have become increasingly valuable tools for simulating and predicting the
spread of influenza due to their ability to process large datasets and identify complex patterns in
disease transmission. Various models, such as decision trees, support vector machines (SVMs), and
deep learning algorithms, have been applied to influenza forecasting with promising results. These
models can analyze multiple factors that influence the spread of the virus, including environmental
conditions, population mobility, and vaccination coverage. Among these approaches, recurrent RNNs,
particularly LSTM models, have gained attention for their effectiveness in capturing temporal
dependencies in sequential data, such as disease progression over time. Machine learning models can
offer accurate predictions critical for public health planning and timely interventions by utilizing
historical data on influenza outbreaks.
The paper [21] presents a framework for predicting influenza-like illness (ILI) trends using
multiple machine learning models. The study developed and validated four models: ARIMA, random
forest, support vector regression, and extreme gradient boosting, and then combined their outputs
using an ensemble method called stacking. The models were trained on historical influenza data from
2008 to 2014 and tested on data from 2015 to 2017. The ensemble approach provided accurate real-
time predictions of ILI visits up to 4 weeks in advance, improving public health preparedness. A
notable limitation is the model’s decreased accuracy over longer time horizons, and while it performs
well for short-term forecasting, it struggles with predicting longer-term trends.
The study [22] explores how human mobility data can be used to predict the spread of influenza.
By leveraging anonymized mobility data from smartphones, the authors developed a metapopulation
model to forecast influenza outbreaks in the U.S. and Australia. Their approach showed that mobility
data enhances forecasting performance, especially compared to traditional models based on static
data like commuter surveys. The study demonstrates the utility of real-time mobility data for
predicting disease transmission across geographic boundaries. However, the work has some
limitations. The accuracy of predictions depends heavily on the quality and timeliness of mobility
data, and the model struggles to account for variations in population dwell times and transient
populations, such as tourists.
Research [23] explores the effectiveness of using ensemble forecasting to predict seasonal
influenza outbreaks. The authors combined 21 forecasting models using a stacking ensemble
technique and tested them in real-time during the 2017-2018 influenza season. The multi-model
ensemble outperformed individual models and a simple averaging approach by incorporating past
performance to weight the models. This approach was particularly successful during one of the most
severe flu seasons in recent history, improving the accuracy and reducing variability across regions.
A limitation of the study is that the performance of the ensemble models relied heavily on historical
data, making it less adaptable for novel, unexpected scenarios, such as pandemics without historical
precedence.
The paper [24] evaluates using an LSTM deep learning model for predicting short-term ILI trends.
The model was trained using historical influenza, climate, and population data from the Centers for
Disease Control and Prevention (CDC), the National Center for Environmental Information (NCEI),
and the U.S. Census Bureau. Results indicated that temperature and precipitation were the most
significant predictors of ILI rates, with the LSTM model outperforming traditional regression models
like random forests and ARIMA. The LSTM model demonstrated a notable reduction in prediction
errors for one-week forecasts, making it highly effective for short-term influenza forecasting.
However, its performance declined for longer-term predictions, indicating limitations in accurately
predicting flu trends beyond five to ten weeks.
The study [25] proposes a novel method for improving long-term influenza forecasts by leveraging
worldwide surveillance data. The authors introduce a two-stage data selection approach, which first
evaluates the predictive power of foreign surveillance data and then incrementally adds relevant data
to improve the accuracy of influenza forecasts for a target country. This method, tested across
multiple countries using six machine learning models, demonstrates that well-selected foreign data
can enhance long-term forecasting performance. However, the study has limitations, particularly the
dependency on the quality and consistency of surveillance data from various countries. Additionally,
the proposed scheme requires validation set evaluation, which may not be robust if significant pattern
differences exist between the validation and testing sets. This could impact the model’s
generalizability across diverse real-world settings.
Research [26] explores the development of a predictive model for influenza outbreaks in 12 French
regions using a variety of data sources. The authors combine electronic health records (EHRs), web
searches, Twitter data, and climate information to enhance the accuracy of real-time and short-term
flu forecasts. Using an ensemble machine learning approach, the study finds that integrating these
diverse data sources improves the model’s performance in predicting ILI at the regional level. The
ensemble model, called ARGONet, outperforms individual models, particularly for one- and two-
week-ahead forecasts. However, the paper notes limitations, including the model’s dependence on
timely and high-quality data from all sources.
The paper [27] compares the performance of various time series forecasting models, XGBoost,
ARIMA, and SARIMA, for predicting monthly influenza cases in Saudi Arabia. The study
demonstrates that XGBoost outperforms both ARIMA and SARIMA models in terms of accuracy,
particularly in capturing the nonlinear relationships in the data. By utilizing a comprehensive dataset
from 2017 to 2022, the authors show that XGBoost provides superior forecasting with lower errors in
metrics like MSE, MAE, and RMSE. However, a limitation of the study is its reliance on past seasonal
data, which may not account for anomalies or significant changes in influenza trends, such as those
caused by pandemics or environmental factors.
The study [28] introduces a novel deep learning model, SAIFlu-Net, designed for regional influenza
forecasting. The model combines LSTM networks, which effectively capture time-series patterns,
with a self-attention mechanism to identify similarities between regions’ influenza occurrence
patterns. This hybrid approach allows the model to improve both short- and long-term forecasting
accuracy. The authors validate the model’s performance on datasets from the U.S. and Japan,
demonstrating that SAIFlu-Net outperforms several existing forecasting models, such as CNNRNN
and Cola-GNN, regarding RMSE and Pearson correlation coefficient. Despite its strengths, the model
shows reduced accuracy for longer-term forecasts, and the authors acknowledge that using only
historical influenza data, without incorporating external factors like weather or social data, may limit
its predictive power.
Research [29] examines the utility of combining deep learning models with traditional statistical
approaches to improve influenza forecasting. The study leverages various models, including beta
regression, ARIMA, LASSO, and multivariate adaptive regression splines (MARS), alongside a deep
learning feedforward neural network (FNN). By employing Bayesian model averaging (BMA) to fuse
these models, the authors aimed to enhance the accuracy of both point and probabilistic forecasts for
ILI in Dallas County. The results show that the ensemble approach using BMA with continuous
ranked probability score as weights outperforms individual models, particularly in one- and two-
week ahead predictions. However, a limitation of this study is its reliance on historical data and the
exclusion of spatial factors, which may reduce its generalizability across different regions.
The paper [30] proposes a framework for improving the accuracy of influenza forecasts by
combining multiple data sources, including public health surveillance data, EHR, internet search
traffic, and social media activity. The authors employ a hierarchical framework using multi-linear
regression and greedy optimization to select the most predictive combination of data sources. The
results show that combining public health surveillance data with EHR data significantly enhances
forecast accuracy. In contrast, though abundant, internet and social media data do not contribute as
much to predictive performance. One key limitation of the study is the exclusion of real-time social
media and search data in the final model, which may miss timely signals during influenza outbreaks.
In light of the analysis of recent studies, it is clear that machine learning models, especially when
combined with diverse data sources, offer substantial improvements in influenza forecasting
accuracy. However, many existing approaches, such as those leveraging social media, search engine
queries, or electronic health records, face limitations regarding data quality, real-time availability,
and generalizability across regions. Our study focuses on developing a forecasting model specifically
tailored to the unique conditions in the war-affected Kharkiv Oblast of Ukraine. The challenges posed
by disrupted healthcare systems, displaced populations, and altered social dynamics require a model
that can accurately predict influenza trends in such a complex environment. While previous studies
have highlighted the potential of machine learning models, particularly LSTM, for forecasting
influenza, their applications have been largely limited to stable environments with consistent data.
Our research aims to address this gap by utilizing the LSTM deep learning approach, which excels at
capturing temporal dependencies, to forecast influenza outbreaks in a setting where traditional data
sources may be unreliable or incomplete. This focus on a conflict zone not only fills a critical gap in
the current body of research but also offers insights into how advanced machine learning techniques
can support public health efforts in crises.
3. Materials and Methods
The LSTM network is a type of RNN designed to capture long-term dependencies in sequential data,
making it ideal for time-series forecasting, such as influenza incidence prediction. LSTMs address the
limitations of traditional RNNs, particularly issues with vanishing or exploding gradients, by using
specialized gates, forget, input, and output gates, that control the network’s information flow [31].
This allows the model to selectively remember, update, or discard information at each time step.
The forget gate determines which information from previous time steps should be discarded. It
allows the model to remove irrelevant data while retaining important historical information
necessary for future predictions.
The input gate decides which information should be added to the cell state. This ensures that only
relevant new data is incorporated into the network’s memory at each time step.
The output gate controls which part of the cell state will be used as the output for the current time
step, ensuring that the most relevant information is passed on for prediction.
For this study, we trained the LSTM model using monthly influenza incidence data from January
2013 to April 2024 in the Kharkiv Oblast. The data was split into training, validation, and testing sets,
ensuring the model was evaluated on unseen data to assess its generalization capabilities.
We implemented dropout as a regularization technique to reduce overfitting, a common issue in
deep neural networks. Dropout involves randomly “deactivating” a proportion of neurons and their
connections during each training iteration. By disabling different neurons in each iteration, the
network is forced to learn more generalized data patterns instead of memorizing the training set.
After training, all neurons are reactivated for prediction. Dropout was controlled by a parameter that
sets the probability of neuron deactivation, ensuring the model avoids overfitting while maintaining
robust performance.
The model was trained using the Adamax optimizer, a variant of the Adam optimizer, which is
particularly well-suited for large, high-dimensional datasets. Adamax improves on Adam’s adaptive
learning rate capabilities, allowing for more efficient convergence during training. During the
training process, we experimented with several configurations of hyperparameters, including batch
size, number of epochs, and LSTM units. Specifically, we tested LSTM units of 4, 8, 16, 32, and 64, and
ran the model for 400, 450, and 500 epochs. These hyperparameters were fine-tuned using an
empirical approach tailored to the specific characteristics of the dataset.
We used mean squared error (MSE) as the loss function to optimize performance, measuring the
difference between the predicted and actual influenza incidence values. Regular backpropagation
through time (BPTT) was used to update the network’s weights, allowing the model to learn from
previous prediction errors and adjust its parameters accordingly.
The combination of LSTM’s sequential learning capability, dropout regularization, and the
Adamax optimizer enabled the model to efficiently learn and predict influenza trends in the war-
affected Kharkiv Oblast, focusing on minimizing overfitting and maximizing generalizability across
different periods.
For this experimental study, we utilized monthly influenza incidence data from January 2013 to
April 2024 in the Kharkiv Oblast, sourced from the Kharkiv Oblast Centre for Disease Control and
Prevention. Figure 1 presents a histogram of the distribution of new influenza cases in the Kharkiv
oblast.
Figure 1: Histogram of new influenza cases in Kharkiv oblast.
Figure 2 presents the local peaks of new cases of influenza in the Kharkiv oblast.
Figure 2: The local peaks of new influenza cases in Kharkiv oblast.
4. Results
Figure 3 presents the forecasted dynamics of influenza in Kharkiv oblast. The blue line represents the
incidence data, while the red line shows the predicted values. The model demonstrates an overall
ability to capture the patterns of influenza outbreaks, particularly during peak incidence periods.
There is a noticeable alignment between the predicted and actual values, especially during the sharp
increases and decreases in case numbers. However, some discrepancies are observed during the peaks,
where the predicted values slightly underestimate the actual data.
Figure 3: The forecast of new influenza cases in Kharkiv oblast.
Table 1 shows the model’s best hyperparameters. These parameters were fine-tuned empirically
to achieve optimal performance.
Table 2
The model’s best hyperparameters
Hyperparameter Value
Units 4
Dropout 250
Optimizer Adamax
Table 2 shows the model performance.
Table 2
The model’s performance
Metric Value
Best MSE 0.0255
Best MAPE 6.3976
Best MAPE inv 27.2086
5. Discussion
This study aimed to develop an LSTM-based model for forecasting influenza outbreaks in the war-
affected Kharkiv Oblast, Ukraine, where traditional healthcare systems have been severely disrupted
due to the ongoing conflict. The study's results highlight the LSTM model's effectiveness in predicting
the short-term dynamics of influenza incidence, particularly during peak transmission periods. The
model’s ability to capture temporal dependencies in the data is reflected in the close alignment
between predicted and actual case numbers, especially during sudden surges in influenza incidence.
The model achieved a best Mean Squared Error (MSE) of 0.0255 and a Mean Absolute Percentage
Error (MAPE) of 6.3976, demonstrating high accuracy in forecasting short-term influenza trends.
These results suggest that the LSTM model has strong potential for providing real-time predictions
that can inform public health interventions.
The scientific novelty of this study lies in its application of the LSTM model within a conflict-
affected region. This context has not been extensively explored in previous research on influenza
forecasting. Most prior studies in this field have focused on stable regions where healthcare data
collection systems function consistently. However, healthcare systems are under immense strain in
war-affected areas like Kharkiv Oblast, and data collection may be irregular or incomplete. This study
demonstrates the adaptability of LSTM models to handle such challenges, providing accurate
predictions even when traditional data collection methods are disrupted. The application of machine
learning in a volatile and crisis-stricken environment highlights the robustness of the LSTM
approach, which can process fragmented data while still producing reliable forecasts.
Another innovative aspect of this study is its focus on forecasting influenza in a region
experiencing large-scale population displacement, overcrowded shelters, and limited access to
healthcare services. Including data from a war zone adds complexity to the model's predictions, as
influenza transmission patterns in such regions may differ significantly from those in more stable
environments. The ability of the LSTM model to account for and predict outbreaks in this challenging
setting represents a significant contribution to the field of infectious disease modelling, particularly
for regions affected by conflict or other crises.
The practical novelty of this research is its potential to provide actionable insights for public health
authorities operating in conflict-affected regions. By offering real-time and accurate forecasts of
influenza outbreaks, the LSTM model can assist public health officials in making informed decisions
regarding resource allocation, vaccination campaigns, and healthcare planning. In a region like
Kharkiv Oblast, where healthcare infrastructure is compromised, such predictive models are crucial
for optimizing the limited resources available to address public health needs. Accurate short-term
forecasts can help ensure that medical supplies, vaccines, and healthcare personnel are effectively
deployed, particularly during peak influenza transmission periods.
Moreover, the model’s application extends beyond Ukraine, offering valuable lessons for other
regions facing similar challenges. For instance, conflict zones in the Middle East, Africa, and other
parts of the world, where healthcare systems are similarly disrupted, could benefit from implementing
machine learning models like LSTM to forecast outbreaks of infectious diseases. Using such models
in conflict or crisis scenarios represents a shift towards data-driven decision-making, even when data
is fragmented or unreliable.
Despite its promising results, the study has several limitations that should be addressed in future
research. One significant limitation is the model’s reduced accuracy over longer time horizons. Like
many machine learning models used for time-series forecasting, the LSTM model performed well in
short-term predictions but struggled to maintain accuracy for forecasts extending several weeks into
the future. This limitation is consistent with findings from other studies that have employed LSTM
for disease forecasting. Future research could explore hybrid models that combine LSTM with other
machine learning or statistical approaches to improve long-term forecasting accuracy.
Another limitation of the study is its reliance on the quality and completeness of the available
data. While the LSTM model is robust in handling sequential data, its performance still depends on
the availability of accurate and timely data inputs. In conflict zones like Kharkiv Oblast of Ukraine,
data collection is often disrupted, leading to potential gaps in the dataset. Incomplete data can reduce
the model's effectiveness and lead to unreliable forecasts. This limitation could be addressed by
incorporating real-time data sources, such as mobility data from mobile phones or satellite data,
which could provide additional insights into population movement patterns and their impact on
influenza transmission.
6. Conclusions
This study developed an LSTM-based model to forecast influenza outbreaks in the war-affected
Kharkiv Oblast, Ukraine, where the ongoing Russian full-scale invasion has severely impacted the
healthcare infrastructure. The model’s ability to provide accurate short-term forecasts, as evidenced
by its low MSE and MAPE values, underscores its potential for application in regions where timely
and reliable public health data is scarce or incomplete. This research demonstrated that LSTM
networks, which are well-suited for handling sequential data, can be effectively applied to predict
disease transmission dynamics even in highly disrupted environments.
A key contribution of this study is its novel application of LSTM in a conflict zone, expanding the
scope of machine learning in public health forecasting to settings where data collection systems are
often unreliable. The model’s successful adaptation to the challenges posed by such a region provides
a robust framework for predicting influenza and potentially other infectious diseases in crises. This
capability is crucial for public health decision-making, particularly in regions facing complex social
and healthcare challenges.
Looking forward, future research should address the limitations of this study by improving long-
term forecasting accuracy. This could involve combining the LSTM model with other forecasting
methods or integrating additional data sources such as real-time mobility, environmental, or social
data to capture better the full spectrum of factors influencing influenza transmission. Moreover,
extending the application of this model to other infectious diseases in similarly affected regions would
enhance its utility in public health management. Further validation of the model in different settings,
particularly those experiencing large-scale humanitarian crises, will also be essential for confirming
its generalizability and broadening its global relevance.
This study significantly contributes to disease modelling by demonstrating the practical utility of
LSTM networks for influenza forecasting in a war-affected region. The results open the door to
further innovations in machine learning applications for public health, particularly in environments
where traditional healthcare data and infrastructure are compromised.
7. Acknowledgements
The study was funded by the National Research Foundation of Ukraine in the framework of the
research project 2023.03/0197 on the topic “Multidisciplinary study of the impact of emergency
situations on the infectious diseases spreading to support management decision-making in the field
of population biosafety”.
8. Declaration on Generative AI
During the preparation of this paper, the authors used ChatGPT and Grammarly in order to identity
and correct grammatical errors, typos, and writing mistakes. After using these tools, the authors
reviewed and edited the content as needed and take full responsibility for the publication’s content.
9. References
[1] Y. Bai and X. Tao, “Comparison of COVID-19 and influenza characteristics,” Journal of Zhejiang
University-SCIENCE B, vol. 22, no. 2, pp. 87–98, Feb. 2021, doi: 10.1631/jzus.b2000479.
[2] M. Javanian, M. Barary, S. Ghebrehewet, V. Koppolu, V. Vasigala, and S. Ebrahimpour, “A brief
review of influenza virus infection,” Journal of Medical Virology, vol. 93, no. 8, pp. 4638–4646,
Apr. 2021, doi: 10.1002/jmv.26990.
[3] Z. Dai et al., “Risk factors for influenza B virus–associated pneumonia in adults,” American
Journal of Infection Control, vol. 48, no. 2, pp. 194–198, Aug. 2019, doi: 10.1016/j.ajic.2019.07.010.
[4] L. J. Keilman, “Seasonal Influenza (Flu),” Nursing Clinics of North America, vol. 54, no. 2, pp. 227–
243, Jun. 2019, doi: 10.1016/j.cnur.2019.02.009.
[5] World Health Organization, “Influenza (Seasonal),” Who.int, Oct. 03, 2023.
https://www.who.int/news-room/fact-sheets/detail/influenza-(seasonal) (accessed Sep. 30, 2024).
[6] A. R. Tanner, R. B. Dorey, N. J. Brendish, and T. W. Clark, “Influenza vaccination: protecting the
most vulnerable,” European Respiratory Review, vol. 30, no. 159, p. 200258, Mar. 2021, doi:
10.1183/16000617.0258-2020.
[7] A. J. Hay and J. W. McCauley, “The WHO global influenza surveillance and response system
(GISRS)—A future perspective,” Influenza and Other Respiratory Viruses, vol. 12, no. 5, pp. 551–
557, Sep. 2018, doi: 10.1111/irv.12565.
[8] N. Bushkovska, “Vaccination in the war in Ukraine,” BMJ, vol. 381, p. p963, Jun. 2023, doi:
10.1136/bmj.p963.
[9] Jeff Clyde Corpuz, “War in Ukraine: Humanitarian Interventions,” Disaster Medicine and Public
Health Preparedness, vol. 17, p. e458, Jan. 2023, doi: 10.1017/dmp.2023.123.
[10] W. A. Awuah et al., “Inside the Ukraine war: health and humanity,” Postgraduate Medical Journal,
vol. 98, no. 1160, p. postgradmedj-2022-141801, Apr. 2022, doi: 10.1136/postgradmedj-2022-
141801.
[11] R. Goto, I. Pinchuk, O. Kolodezhny, N. Pimenova, Y. Kano, and N. Skokauskas, “Mental Health
of Adolescents Exposed to the War in Ukraine,” JAMA Pediatrics, vol. 98, no. 1160, pp. 408–410,
Mar. 2024, doi: 10.1001/jamapediatrics.2024.0295.
[12] A. S. Nechyporenko, R. Radutny, V. V. Alekseeva, G. Titova, and V. V. Gargin, “Complex
Automatic Determination of Morphological Parameters for Bone Tissue in Human Paranasal
Sinuses,” The Open Bioinformatics Journal, vol. 14, no. 1, pp. 130–137, Nov. 2021, doi:
10.2174/18750362021140100130.
[13] I. Izonin, R. Tkachenko, K. Yemets, and M. Havryliuk, “An interpretable ensemble structure with
a non-iterative training algorithm to improve the predictive accuracy of healthcare data
analysis,” Scientific Reports, vol. 14, no. 1, p. 12947, Jun. 2024, doi: 10.1038/s41598-024-61776-y.
[14] L. Mochurad and R. Panto, “A Parallel Algorithm for the Detection of Eye Disease,” Lecture notes
on data engineering and communications technologies, vol. 158, pp. 111–125, Jan. 2023, doi:
10.1007/978-3-031-24475-9_10.
[15] A. Kozak, S. Fedushko, and K. Szymaniec-Mlicka, “The Role of Information and Communication
Technology in the Social and Healthcare Services Co-production,” Lecture notes on data
engineering and communications technologies, vol. 208, pp. 125–140, Jan. 2024, doi: 10.1007/978-3-
031-59131-0_6.
[16] V. M. Babaiev, I. M. Kadykova, Yu. Yu. Husieva, and I. V. Chumachenko, “The method of
adaptation of a project-oriented organization’s strategy to exogenous changes,” Naukovyi Visnyk
Natsionalnoho Hirnychoho Universytetu, vol. 2, pp. 134–140, 2017.
[17] D. Pogozhykh et al., “Experimental Evaluation of Food-Grade Semi-Refined Carrageenan
Toxicity,” International journal of molecular sciences (Online), vol. 22, no. 20, pp. 11178–11178,
Oct. 2021, doi: 10.3390/ijms222011178.
[18] M. T. Sofonea, S. Cauchemez, and P.-Y. Boëlle, “Epidemic models: why and how to use
them,” Anaesthesia Critical Care & Pain Medicine, vol. 41, no. 2, p. 101048, Apr. 2022, doi:
10.1016/j.accpm.2022.101048.
[19] M. U. Tariq and S. B. Ismail, “Deep learning in public health: Comparative predictive models for
COVID-19 case forecasting,” PloS one, vol. 19, no. 3, p. e0294289, Mar. 2024, doi:
10.1371/journal.pone.0294289.
[20] D. Chumachenko et al., “Methodology for assessing the impact of emergencies on the spread of
infectious diseases,” Radioelectronic and Computer Systems, vol. 3, no. 111, pp. 6-26, 2024, doi:
10.32620/reks.2024.3.01.
[21] H.-Y. Cheng et al., “Applying Machine Learning Models with An Ensemble Approach for
Accurate Real-Time Influenza Forecasting in Taiwan: Development and Validation
Study,” Journal of Medical Internet Research, vol. 22, no. 8, p. e15394, Aug. 2020, doi:
10.2196/15394.
[22] S. Venkatramanan et al., “Forecasting influenza activity using machine-learned mobility
map,” Nature Communications, vol. 12, no. 1, p. 726, Feb. 2021, doi: 10.1038/s41467-021-21018-5.
[23] N. G. Reich et al., “Accuracy of real-time multi-model ensemble forecasts for seasonal influenza
in the U.S.,” PLOS Computational Biology, vol. 15, no. 11, p. e1007486, Nov. 2019, doi:
10.1371/journal.pcbi.1007486.
[24] A. B. Amendolara, D. Sant, H. G. Rotstein, and E. Fortune, “LSTM-based recurrent neural network
provides effective short term flu forecasting,” BMC Public Health, vol. 23, no. 1, p. 1788, Sep. 2023,
doi: 10.1186/s12889-023-16720-6.
[25] J. Moon, S. Jung, S. Park, and E. Hwang, “Machine Learning-Based Two-Stage Data Selection
Scheme for Long-Term Influenza Forecasting,” Computers, Materials & Continua, vol. 68, no. 3,
pp. 2945–2959, 2021, doi: 10.32604/cmc.2021.017435.
[26] C. Poirier et al., “Influenza forecasting for French regions combining EHR, web and climatic data
sources with a machine learning ensemble approach,” PLOS ONE, vol. 16, no. 5, p. e0250890, May
2021, doi: 10.1371/journal.pone.0250890.
[27] S. M. Alzahrani and F. E. Guma, “Improving Seasonal Influenza Forecasting Using Time Series
Machine Learning Techniques,” Journal of Information Systems Engineering and Management,
vol. 9, no. 4, p. 30195, Sep. 2024, doi: 10.55267/iadt.07.15132.
[28] S. Jung, J. Moon, S. Park, and E. Hwang, “Self-Attention-Based Deep Learning Network for
Regional Influenza Forecasting,” IEEE Journal of Biomedical and Health Informatics, vol. 26, no.
2, pp. 922–933, Jul. 2021, doi: 10.1109/jbhi.2021.3093897.
[29] M. Soliman, V. Lyubchich, and Y. R. Gel, “Complementing the power of deep learning with
statistical model fusion: Probabilistic forecasting of influenza in Dallas County, Texas,
USA,” Epidemics, vol. 28, p. 100345, Sep. 2019, doi: 10.1016/j.epidem.2019.05.004.
[30] Z. Ertem, D. Raymond, and L. A. Meyers, “Optimal multi-source forecasting of seasonal
influenza,” PLOS Computational Biology, vol. 14, no. 9, p. e1006236, Sep. 2018, doi:
10.1371/journal.pcbi.1006236.
[31] S. Hochreiter and J. Schmidhuber, “Long Short-Term Memory,” Neural Computation, vol. 9, no.
8, pp. 1735–1780, Nov. 1997, doi: 10.1162/neco.1997.9.8.1735.