<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>H. Khudov, O. Makoveychuk, I. Butko, I. Khizhnyak. A model for prediction of geospatial data
in systems for processing geospatial information. Systems of Arms and Military Equipment</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Temperature Forecasting with LSTM: A Case Study on Kyiv Weather Data*</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Oleksandr</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Makoveichuk</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Golubenko</string-name>
          <email>o.golubenko@istu.edu.ua</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kukhtyk</string-name>
          <email>stan.kukhtyk@istu.edu.ua</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Artem</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Antonenko</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Viktoriia Bereznychenko</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrii Iatsyshyn</string-name>
          <email>iatsyshyn.andriy@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Center for Information-Analytical and Technical Support of Nuclear Power Facilities Monitoring, National Academy of Sciences of Ukraine</institution>
          ,
          <addr-line>Akademika Palladina 34a, 03142 Kyiv</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Institute of Electrodynamics of NAS of Ukraine</institution>
          ,
          <addr-line>Beresteyskyi 56, 03057 Kyiv</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>National University of Life and Environmental Sciences of Ukraine</institution>
          ,
          <addr-line>Heroiv oborony 15, 03041 Kyiv</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Yuriy Bugay International Scientific and Technical University</institution>
          ,
          <addr-line>Volodymyrska 7, 04025 Kyiv</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <volume>128</volume>
      <issue>2021</issue>
      <fpage>123</fpage>
      <lpage>128</lpage>
      <abstract>
        <p>Accurate temperature forecasting is essential for urban planning, energy management, and environmental monitoring in smart cities. This study evaluates the use of long short-term memory (LSTM) networks for weekly average temperature (TAVG) prediction in Kyiv, using ARIMA as a baseline model. Historical temperature time series from 1960 onward were employed, and multiple look-back windows were tested to capture seasonal and long-term dependencies. Forecast performance was assessed using RMSE and MAE metrics, showing that LSTM provides more accurate predictions, while ARIMA effectively captures trends and seasonal components. Forecast errors were further analyzed via normal distribution fitting to compare model characteristics. The study emphasizes the importance of rigorous model comparison, including alternatives such as Prophet, and highlights opportunities for long-term analysis to investigate climate trends, global warming effects, or anomalies linked to astronomical, climatic, or anthropogenic factors. These findings demonstrate the potential of deep learning approaches to support data-driven decisionmaking and sustainable urban management.</p>
      </abstract>
      <kwd-group>
        <kwd>time series forecasting</kwd>
        <kwd>LSTM</kwd>
        <kwd>ARIMA</kwd>
        <kwd>seasonal trends</kwd>
        <kwd>climate data analysis 1</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Time series analysis is a statistical methodology that involves the systematic collection of
observations at fixed time intervals to detect underlying patterns and long-term trends. This
approach is widely employed to support informed decision-making and to construct accurate
forecasts based on historical data. Time series forecasting, in turn, refers to the process of predicting
future values by analyzing past trends and latent regularities. It encompasses a wide spectrum of
approaches, ranging from classical statistical models to contemporary deep learning techniques.
Each class of models is grounded in a distinct mathematical framework, reflecting different
assumptions about the structure of temporal data.</p>
      <p>In the broader context of smart cities, time series forecasting underpins numerous domains where
continuous data collection allows proactive and adaptive urban management. Forecasting models
support transportation and mobility optimization by anticipating traffic congestion and public transit
demand; enable efficient energy and utility distribution through load prediction; and contribute to
environmental monitoring by</p>
      <p>
        modeling air quality, temperature, and noise variations. In
infrastructure management, predictive analytics facilitates maintenance scheduling and enhances
the reliability of critical systems. Moreover, forecasting aids public safety by anticipating demand
for emergency services and supports evidence-based decision-making in economic planning and
healthcare management. Thus, time series forecasting serves as a cornerstone of smart city
intelligence, ensuring sustainable, resilient, and responsive urban operations [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4 ref5">1–5</xref>
        ].
      </p>
      <p>
        Air temperature forecasting is particularly essential for smart city applications, as it directly
influences urban planning, energy management, and public safety. Predictive insights enable the
optimized operation of heating, ventilation, and air conditioning systems, support the efficient
allocation of energy resources, and guide real-time responses to extreme weather events.
Furthermore, accurate temperature forecasts facilitate environmental monitoring, traffic
management, and health-related interventions, making them a critical component of data-driven
urban infrastructure [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Classical and Modern Forecasting Methods</title>
      <p>Forecasting time series encompasses a broad spectrum of approaches, ranging from classical
statistical models to modern deep learning techniques. Each class of models is grounded in a distinct
mathematical framework, reflecting different assumptions about the structure and dynamics of
temporal data.</p>
      <p>
        The classical linear family of models begins with the Autoregressive Integrated Moving Average
(ARIMA) model [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. ARIMA seeks to capture three key phenomena in time series: autoregression
(dependence on previous values), differencing (to remove non-stationarity), and moving averages
(dependence on past forecast errors). Formally, an ARIMA( ,  ,  ) model is expressed as
  ( )(1 −  )   =   ( )  ,
(1)
where   denotes the observed value of the time series at time  ,   ( ) = 1 −  1 −  2 2 − ⋯ −
    is the autoregressive polynomial,   ( ) = 1 −  1 −  2 2 − ⋯ −     is the
movingaverage polynomial,  is the backshift operator (   =   −1, i.e.,     =   − ),  is the order of
differencing, and   is white noise. A diagram illustrating the ARIMA model is shown in Figure 1.
      </p>
      <p>This compact structure makes ARIMA particularly powerful for capturing linear correlations
across time.</p>
      <p>
        An extension of this framework, the Seasonal ARIMA (SARIMA) model [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], introduces seasonal
components into the ARIMA structure. A SARIMA( ,  ,  )( ,  ,  ) process is given by
  (  )  ( )(1 −  ) (1 −   )   =   (  )  ( )  ,
(2)
where  denotes the seasonal period (e.g., 12 for monthly data with annual seasonality). Here,
  (  ) and   (  ) represent seasonal autoregressive and moving-average polynomials, while
(1 −   ) models seasonal differencing. This allows SARIMA to explicitly encode recurrent seasonal
fluctuations, a central feature of many economic and meteorological datasets [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>
        The advantage of SARIMA models over traditional ARIMA models lies in their inherent ability to
explicitly account for seasonality. This feature is crucial when the time series exhibits recurring
patterns over fixed intervals, such as monthly sales, quarterly financial data, or annual climatic
cycles. While ARIMA models effectively handle various time series through their autoregressive,
differencing, and moving average components, they may struggle to capture periodic seasonal
patterns, potentially reducing forecast accuracy [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>
        While ARIMA-type models are linear, more recent frameworks adopt nonlinear and additive
formulations. A widely used example is Prophet, developed by Facebook (Meta) [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Prophet assumes
that a time series can be decomposed into interpretable components — trend, seasonality, holiday
effects, and noise — via an additive model:
      </p>
      <p>=   +   + ℎ +   ,
default, defined as
holiday or event-driven shocks, and</p>
      <p>is the error term.
where   captures long-term growth,   represents periodic seasonal fluctuations, ℎ accounts for
The trend component   can take two primary forms. The piecewise linear model is Prophet’s
  = {  +  ,
 ≤    +  +  ( −  ),
 &gt;  ,
(3)
(4)
(5)
(6)
events such as public holidays or sales campaigns.
where 
denotes the seasonal period (e.g., 365.25 for yearly effects) and 
is the number of
harmonics. This flexible basis expansion allows Prophet to capture complex periodic patterns beyond
simple sinusoids. Weekly seasonality, by contrast, is often represented by dummy variables
indicating weekdays, while holidays ℎ are introduced as binary regressors marking known calendar</p>
      <p>
        Despite its simplicity of form, Prophet relies on Bayesian estimation to identify optimal trend
change points, regularize parameters, and quantify forecast uncertainty. As such, it provides a
balance between interpretability and automation, making it suitable for business and policy
applications [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <p>Beyond additive statistical models, the field has increasingly turned to deep learning
architectures, which can approximate highly nonlinear dynamics. A Recurrent Neural Network (RNN)
is a class of artificial neural networks designed to process sequential data by maintaining a hidden
state that captures information from previous time steps. Unlike feedforward networks, which
assume independence between inputs, RNNs incorporate temporal dependencies by recurrently
where  is the slope,  the offset,  a change point, and  the change in slope.</p>
      <p>Alternatively, the logistic growth model describes nonlinear growth saturating at a carrying
capacity  :</p>
      <p>=</p>
      <p>1 +  − ( − ),
with growth rate  and offset  . This is suitable for scenarios where growth slows as a limit is
approached, such as product adoption or population dynamics.</p>
      <p>
        The seasonality component   is modeled using a truncated Fourier series:

 =1
  =
     
+   
,
updating their hidden state based on both the current input and the prior hidden state. This structure
makes RNNs particularly suitable for modeling time series, natural language, and other sequential
data, as they can, in principle, learn long-term dependencies within sequences [
        <xref ref-type="bibr" rid="ref13 ref14">13, 14</xref>
        ].
      </p>
      <p>RNN is the foundational structure, defined recursively as
  =  (    −1 +     +  ),
  =     +  ,
(7)
(8)
where   is the hidden state,   the input at time  ,  a nonlinear activation function,  ,  biases, and
  ,   ,   weight matrices. A schematic representation of the RNN model is presented in Figure 2.</p>
      <p>
        Long Short-Term Memory (LSTM) networks are an extension of Recurrent Neural Networks
(RNNs) specifically designed to overcome the vanishing and exploding gradient problems that limit
the ability of standard RNNs to capture long-term dependencies. LSTMs introduce a memory cell
along with gating mechanisms—input, forget, and output gates—that regulate the flow of
information. This architecture enables the network to selectively retain, update, or discard
information across long sequences, making LSTMs particularly effective for complex time series
forecasting, speech recognition, and natural language processing tasks [
        <xref ref-type="bibr" rid="ref15 ref16">15, 16</xref>
        ].
      </p>
      <p>LSTM networks can be described by the following set of equations:
  =  (    +     −1 +   ),
  =  (    +     −1 +   ),
  =  (    +     −1 +   ),
 ̃ =  (    +     −1 +   ),
  =   ⨀   −1 +   ⨀  ̃ −1,
ℎ =   ⨀</p>
      <p>
        ℎ   ℎ   ,
where the symbols denote:
  — the input vector at time step  ;
  −1 — the hidden state from the previous time step  − 1;
(9)
(10)
(11)
(12)
(13)
(14)
  — the memory cell state at time step  ;

 — the input gate, controlling how much new information is added to the memory;
  — the forget gate, regulating which portion of the previous memory is retained;
  — the output gate, determining which part of the memory contributes to the output;
 ̃ — the candidate memory cell, representing new information to be potentially incorporated;
  ,   ,   ,   — weight matrices applied to the input   ;
  ,   ,   ,   — weight matrices applied to the previous hidden state   −1
;
  — the hidden state at time step  , also serving as the output and input to the next time step;
  ,   ,   ,   — bias vectors;
 () — the sigmoid activation function, constraining values to the range [
        <xref ref-type="bibr" rid="ref1">0,1</xref>
        ];
 
ℎ
      </p>
      <p>
        ℎ () — the hyperbolic tangent function, mapping values to [
        <xref ref-type="bibr" rid="ref1">-1,1</xref>
        ];
⊙— element-wise (Hadamard) multiplication, applied to vectors of equal dimension, e.g.,
 ⨀  = [ 1 1,  2 2,  3 3, ⋯ ,     ].
(15)
This element-wise multiplication allows the gating vectors  ,
   , and  
to selectively modulate
the memory and hidden states, enabling LSTMs to retain, update, or output information in a
controlled manner. A diagram illustrating the LSTM model is shown in Figure 3.
domain], via Wikimedia Commons (By Guillaume Chevalier - File:The_LSTM_Cell.svg, CC BY-SA
      </p>
      <p>
        Variants such as Bidirectional LSTMs (BLSTM) process sequences in both forward and backward
directions, while Gated Recurrent Units (GRUs) simplify the gating mechanism yet maintain
competitive performance in modeling sequential dependencies [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Experimental validation</title>
      <p>
        The dataset comprises historical daily temperature observations for Kyiv, which is part of the Kaggle
dataset covering the period 1881–2017 [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. For demonstration purposes, only the subset starting
from January 1, 2001, is considered. Although the original dataset includes minimum (TMIN),
maximum (TMAX), and average (TAVG) temperatures, this study focuses exclusively on the weekly
average temperature (TAVG). The daily data were aggregated to a weekly frequency, resulting in
consecutive weekly averages spanning several decades. For model evaluation, the training period
includes data prior to January 1, 2010, while all subsequent observations constitute the test period.
      </p>
      <p>Forecasting experiments were conducted using multiple look-back windows, which determine
the number of preceding weekly observations used as input for the model. Specifically, four
lookback windows were considered: 4 weeks (approximately one month), 13 weeks (one quarter), 52
weeks (one year), and 104 weeks (two years). These windows were chosen to capture short-term,
seasonal, and long-term temporal dependencies, allowing the models to learn patterns ranging from
monthly variability to multi-year trends.</p>
      <p>The dataset provides sufficient temporal coverage and variability, enabling robust evaluation of
short- and medium-term temperature forecasting performance. Representative weekly TAVG
profiles during the test period illustrate both seasonal fluctuations and interannual variability,
demonstrating the suitability of the dataset for developing predictive models. Figure 4 shows the
Kyiv temperature time series divided into training and testing subsets. This structured data serves
as the input for the LSTM-based forecasting model, which leverages the sequential nature of the
observations to capture temporal dependencies and generate accurate future temperature
predictions.</p>
      <p>The predictive model is based on a LSTM neural network, designed to capture temporal
dependencies in TAVG series. The input layer receives sequences of length equal to the chosen
lookback window, with a single feature per time step corresponding to the temperature values. This is
followed by an LSTM layer comprising 64 units, which incorporates a recurrent dropout rate of 0.2
to mitigate overfitting by randomly deactivating recurrent connections during training. A
subsequent Dropout layer with a rate of 0.2 further regularizes the network by reducing
coadaptation between neurons. Finally, a fully connected Dense layer with a single neuron produces
the forecasted temperature for the subsequent week. The model is compiled using the mean squared
error (MSE) loss function and optimized with the Adam algorithm. Training is performed over 50
epochs with a batch size of 16, and 20% of the training data is reserved for validation. The network
learns to map past weekly temperature patterns to future values, capturing both seasonal
fluctuations and long-term trends.</p>
      <p>Training was performed over 50 epochs with a batch size of 16, using 20% of the training data for
validation to monitor generalization performance. The resulting training history, including both
training and validation losses, was retained for subsequent analysis and visualization, allowing
evaluation of convergence and overfitting. This architecture and training protocol facilitate accurate
one-step-ahead temperature forecasting by leveraging the temporal structure of the input sequences.</p>
      <p>As a baseline, the ARIMA model was implemented to provide a comparative reference against the
deep learning approach. To determine the optimal model configuration, a systematic grid search was
performed over all parameter combinations ( ,  ,  )ranging from (0,0,1)to (4,1,4), excluding the
trivial case (0,0,0). For each candidate model, forecasts were generated using the training subset,
and the corresponding predictive performance was evaluated on the test set. The evaluation metric
employed was the root mean squared error (RMSE), which quantifies relative deviations between
predicted and observed values and facilitates consistent comparison across datasets. The ARIMA
model yielding the minimum RMSE was selected as the optimal baseline configuration for
subsequent analysis [19].</p>
      <p>As shown in Figure 5, LSTM forecasts more closely match the observed TAVG values than ARIMA
forecasts (RMSE = 3.74 vs. 4.15), whereas ARIMA exhibits a smoother and slightly shifted profile.
Given the pronounced seasonal patterns, ARIMA could alternatively capture the underlying trend
and seasonal components. Forecast errors over the test period (Figure 6) yield MAE = 2.93 for LSTM
and MAE = 3.40 for ARIMA.</p>
      <p>To further investigate the forecast performance, the distribution of forecast errors for TAVG was
analyzed for both LSTM and ARIMA models. Histograms of the errors were constructed, spanning
the interval from –15 °C to 15 °C, and overlaid with fitted normal distributions to approximate the
underlying error patterns. For each model, the mean (μ), standard deviation (σ), and the location of
the distribution peak were estimated, providing insights into the bias and dispersion of the
predictions. Figure 7 presents the error distributions for LSTM and ARIMA forecasts, enabling a
direct comparison of their predictive characteristics.</p>
      <p>The LSTM errors are centered closer to zero (μ = 0.29) with a narrower spread (σ = 3.7 7) and a
peak near zero (peak = 0.34), indicating minimal bias and more consistent predictions. In contrast,
the ARIMA errors exhibit a larger mean deviation from zero (μ = 0.73), a wider spread (σ = 4.09), and
a peak shifted further from zero (peak = 0.83), reflecting a smoother and phase-shifted forecast
profile. These statistics confirm that LSTM more accurately captures short-term fluctuations and
seasonal patterns in the temperature time series, providing more reliable and less biased forecasts
compared to the ARIMA baseline.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>This study has investigated the feasibility and performance of deep learning–based forecasting for
weekly average temperatures (TAVG) in Kyiv, using LSTM networks as the primary predictive model
and ARIMA as a baseline. The analysis demonstrated that the LSTM model outperforms ARIMA in
terms of both RMSE and MAE, providing forecasts that more closely align with actual observations,
while ARIMA offers smoother predictions that can capture underlying trends and seasonal patterns.
Forecast errors were further analyzed using normal approximation, allowing characterization of the
error distributions and highlighting differences between the two modeling approaches.</p>
      <p>The results indicate that weekly temperature series exhibit pronounced seasonal behavior, and
multi-week look-back windows enable the LSTM model to effectively learn both short- and
mediumterm dependencies. These findings highlight the potential of LSTM-based forecasting to support
climate-related decision-making, urban planning, and environmental monitoring in a smart city
context. To ensure robust evaluation, it is essential to compare LSTM forecasts with other models,
such as ARIMA and Prophet, to contextualize performance and assess the advantages of deep
learning relative to classical statistical approaches. Future work may explore hybrid modeling
approaches that combine LSTM and statistical models to improve trend and seasonal component
estimation, extend forecasts to longer horizons, and incorporate additional meteorological or
environmental variables to enhance prediction accuracy. Overall, the study provides a robust
framework for accurate, data-driven temperature forecasting and demonstrates the practical
advantages of deep learning methods over traditional time series models. Additionally, long-term
analysis of historical temperature data could reveal effects of global warming or other anomalies.
Such analyses could further explore correlations with astronomical factors (e.g., solar activity),
climatic phenomena (e.g., El Niño and La Niña), and anthropogenic influences (e.g., urbanization,
reservoir water levels, agricultural engineering), providing deeper insight into the drivers of
temperature variability.</p>
    </sec>
    <sec id="sec-5">
      <title>Declaration on Generative AI</title>
      <p>The authors have not employed any Generative AI tools.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Qian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Duffield</surname>
          </string-name>
          .
          <source>Towards Invariant Time Series Forecasting in Smart Cities. arXiv preprint arXiv:2405</source>
          .
          <fpage>05430</fpage>
          . (
          <year>2024</year>
          ). https://arxiv.org/abs/2405.05430
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Shao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Piao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Yao</surname>
          </string-name>
          , et al.
          <article-title>An adaptive composite time series forecasting model for short - term traffic flow</article-title>
          .
          <source>Journal of Big Data</source>
          ,
          <volume>11</volume>
          ,
          <issue>102</issue>
          (
          <year>2024</year>
          ). https://doi.org/10.1186/s40537-024-00967-w
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Liang</surname>
          </string-name>
          .
          <article-title>Predicting short-term urban bike sharing demand in a smart city</article-title>
          .
          <source>Computers, Environment and Urban Systems</source>
          ,
          <volume>88</volume>
          ,
          <issue>101703</issue>
          (
          <year>2026</year>
          ). https://doi.org/10.1016/j.compenvurbsys.
          <year>2021</year>
          .101703
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>T. González</given-names>
            <surname>Grandón</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Schwenzer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Steens</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Breuing</surname>
          </string-name>
          .
          <article-title>Electricity demand forecasting with hybrid classical statistical and machine learning algorithms</article-title>
          <source>: Case study of Ukraine. Applied Energy</source>
          ,
          <volume>355</volume>
          (C),
          <volume>122249</volume>
          (
          <year>2024</year>
          ). https://doi.org/10.1016/j.apenergy.
          <year>2023</year>
          .122249
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>I.</given-names>
            <surname>Didych</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mykytyshyn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Stanko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mytnyk</surname>
          </string-name>
          .
          <article-title>Application of machine learning methods to the prediction of NO₂ concentration in the air environment</article-title>
          .
          <source>CEUR Workshop Proceedings</source>
          ,
          <volume>3896</volume>
          ,
          <fpage>15</fpage>
          -
          <lpage>18</lpage>
          (
          <year>2023</year>
          ). Available at: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3896</volume>
          /short15.pdf
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <article-title>[6] Real-Time Temperature Prediction Models for Smart City Applications</article-title>
          . https://www.researchgate.net/publication/388194104_RealTime_Temperature_Prediction_Models_for_Smart_City_Applications
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>G. E. P.</given-names>
            <surname>Box</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. M.</given-names>
            <surname>Jenkins</surname>
          </string-name>
          .
          <source>Time Series Analysis: Forecasting and Control</source>
          . Prentice
          <string-name>
            <surname>Hall</surname>
          </string-name>
          (
          <year>1970</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>R. J.</given-names>
            <surname>Hyndman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Athanasopoulos</surname>
          </string-name>
          . Forecasting:
          <article-title>Principles and Practice</article-title>
          .
          <source>OTexts</source>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>V. S.</given-names>
            <surname>Yavuz</surname>
          </string-name>
          .
          <article-title>Forecasting monthly rainfall and temperature patterns in Van Province, Türkiye using ARIMA and SARIMA models</article-title>
          .
          <source>Journal of Water and Climate Change</source>
          (
          <year>2025</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <article-title>GeeksforGeeks</article-title>
          .
          <article-title>ARIMA vs. SARIMA model</article-title>
          . Available at: https://www.geeksforgeeks.
          <article-title>org/machine-learning/arima-vs-sarima-model/</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S. J.</given-names>
            <surname>Taylor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Letham</surname>
          </string-name>
          . Prophet: forecasting at scale.
          <source>PeerJ Preprints</source>
          ,
          <volume>6</volume>
          (
          <year>2018</year>
          ). https://doi.org/10.7287/peerj.preprints.3190v2
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>L.</given-names>
            <surname>Menculini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Marini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Proietti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Moretti</surname>
          </string-name>
          .
          <article-title>Comparing Prophet and Deep Learning to ARIMA in Forecasting Wholesale Food Prices</article-title>
          . arXiv:
          <volume>2107</volume>
          .12770 (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Elman</surname>
          </string-name>
          .
          <article-title>Finding structure in time</article-title>
          .
          <source>Cognitive Science</source>
          ,
          <volume>14</volume>
          (
          <issue>2</issue>
          ),
          <fpage>179</fpage>
          -
          <lpage>211</lpage>
          (
          <year>1990</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>I.</given-names>
            <surname>Goodfellow</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Bengio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Courville</surname>
          </string-name>
          . Deep Learning. MIT Press (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>S.</given-names>
            <surname>Hochreiter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Schmidhuber</surname>
          </string-name>
          .
          <article-title>Long short-term memory</article-title>
          .
          <source>Neural Computation</source>
          ,
          <volume>9</volume>
          (
          <issue>8</issue>
          ),
          <fpage>1735</fpage>
          -
          <lpage>1780</lpage>
          (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>K.</given-names>
            <surname>Greff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. K.</given-names>
            <surname>Srivastava</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Koutník</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. R.</given-names>
            <surname>Steunebrink</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Schmidhuber</surname>
          </string-name>
          .
          <article-title>LSTM: A search space odyssey</article-title>
          .
          <source>IEEE Transactions on Neural Networks and Learning Systems</source>
          ,
          <volume>28</volume>
          (
          <issue>10</issue>
          ),
          <fpage>2222</fpage>
          -
          <lpage>2232</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <article-title>GeeksforGeeks. Bidirectional LSTM in NLP</article-title>
          . Available at: https://www.geeksforgeeks.org/nlp/bidirectional
          <article-title>-lstm-in-nl</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Kyiv</surname>
          </string-name>
          , Ukraine Weather Statistics Dataset. Available at: https://www.kaggle.com/datasets/thedevastator/kyiv-ukraine
          <string-name>
            <surname>-</surname>
          </string-name>
          weather-statistics
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>