-

Investigation of Hybrid Deep Learning Networks in Forecasting Energy Supply

Yuriy Zaychenko

zaychenkoyuri@ukr.net 0 1 2

Helen Zaichenko

0 1 2

Oleksii Kuzmenko

oleksii.kuzmenko@ukr.net 0 1 2

The Group

0 1 2 0 GMDH-neo-fuzzy network , GMDH 1 National Technical University of Ukraine “Igor Sikorsky Kyiv Polytechnic Institute”, Institute for Applied 2 System Analysis , Prospect Beresteiskyi, 37, Kyiv, 03056 , Ukraine

In this paper the intelligent methods for solving the problem of short- and middle-term forecasting electricity sales to network for short- and middle-term forecasting was performed. Experimental studies of hybrid Problems of forecasting non-stationary time series and market indexes at stock exchanges pay great attention of managers of enterprises and various scientific researchers. For its solution were developed and applied for a long time powerful statistical methods, first of all ARIMA [1, 2]. In recent years, various intelligent methods, and technologies, such as fuzzy logic systems and neural networks, have also been proposed and widely used for forecasting in economics and technology.

Keywords1 Hybrid DL network GMDH ARIMA short-term middle-term forecasting

hybrid

ARIMA for short- and middle-term forecasting have been conducted. The accuracy of the obtained forecasts was compared. The expediency of applying the researched methods of artificial intelligence for the considered intervals is substantiated.

1. Introduction

2023 Copyright for this paper by its authors. overall training time. Based on the studied two classes of hybrid DL networks, their effectiveness for forecasting in the financial sphere was compared.

In the previous papers these methods were applied and investigated in the problem of forecasting in financial sphere for market indices Dow Jones industrial average and NASDAQ. That is why it is interesting to investigate the effectiveness of ARIMA, GMDH and hybrid DL networks in forecasting in other areas, such as technology and production, specifically in short- and middle-term forecasting tasks. The goal of this paper is to investigate the accuracy of intelligent methods – hybrid DL networks, GMDH and ARIMA at the problem of forecasting Electricity Sales to Ultimate Customers, Residential (USA), June 7, 2023 – at the different forecasting intervals (short-term and middle-term), compare their efficiency and to determine which computational intelligence methods are the most perspective for forecasting in the economy and technology.

2. A review of the evolving hybrid GMDH-neo-fuzzy network

The architecture of the evolving hybrid DL-network is shown in Fig. 1. The input of the system accepts an ( × 1)-dimensional vector of signals that are considered input. Then the first hidden layer receives this signal. At this level there are 1 = 2 nodes, each of which has strictly two inputs.

Outputs [ 1 ] of the first hidden layer form the output signals to be further transmitted to the selection block located after the first hidden layer. precise signals by some chosen criterion (mostly by the mean squared error 2

It selects among the output signals ̂[ 1 ] 1 ∗ (where 1 ∗= is so-called freedom of choice) most best outputs of the first hidden layer ̂[ 1 ] ∗ 2 pairwise combinations ̂[ 1 ] ∗, ̂ [ 1 ] ∗ are formed. These signals are fed to the second hidden layer, that is formed by neurons [ 2 ]. After training these neurons output signals of this layer ̂[ 2 ] are transferred to the selection block [ 2 ] which choses best neurons by accuracy (e.g. by the value of 2

[ 2 ]) if the best signal of the second layer is better than the best signal of the first hidden layer ̂1[ 1 ] ∗. Other hidden layers work in a similar way. The evolution of the system continues until the best signal of the selection block [ +1] is worse than the best signal received on the previous s-h layer. After that, you need to return to the previous layer to select the best node neuron [ ], which will have some output signal ̂[ ]. The sequential movement from this neuron (node) back takes place along its connections and passes through all previous layers, which makes it possible to build the resulting structure of the GMDH-neo- fuzzy network.

As a result, due to the GMDH algorithm, it is possible to obtain a well-trained network with an optimal structure that was synthesized automatically. High-dimensionality problems, as well as vanishing or exploding gradients, are avoided because the learning is sequential layer-by-layer. [ 1 ]). Among these 1 ∗ 2.1.

The role of the Neo-fuzzy neuron in the hybrid GMDH system

In Fig. 2 shows the architecture of the node selected as the quality for the proposed GMDH system. This is a neo-fuzzy neuron (NFN) proposed by Takeshi Yamakawa et al. in [ 9 ]. It is a Using a conventional stochastic gradient descent algorithm, it can be minimized.

In the case of a predefined dataset, the training process can be performed in a single epoch in batch mode. For this purpose, the conventional least squares method is used [ 11 ] and fuzzy inference is realized in the form: if is then the output is ,where is the synaptic weight in the consequent, is a fuzzy set whose membership function is [ 11 ]. 2.2.

Neo-fuzzy neuron training algorithm

The standard local quadratic error function is used as the goal function (i.e., the learning criterion): ( )= ( ( )− ̂( )) = ( )2 = ( ( )− ∑ ∑ ( ( ))) non-linear system that has one output and several inputs. In the proposed GMDH system, neo-fuzzy-neurons with only two inputs are used, which implements the following mapping: of where ̂ is the output of the system, is the input signal ( = 1,2, … , ). The nonlinear synapses are the building blocks of a neo-fuzzy neuron. Their task is to convert the input signal in the form [ 1 ]( )= (∑ [ 1 ]( ) [ 1 ] ( ))

∑ [ 1 ]( ) ( )= [ 1 ]( )∑ [ 1 ]( ) ( ) where (•)+ denotes the pseudo-inverse of the Moore-Penrose (y(k) is assumed to be the real value of the external reference signal).

1 2 =1 2 1 2 1 2 + =1 2 ℎ =1 =1

2 =1 (1) (2) (3) (4) 3. Data set , (5) ( − 1)( ( )− ( ( − 1)) ( ( ))) ( ( ))

1 + ( ( ( ))) ( − 1) ( ( )) ( − 1) ( ( ))( ( ( ))) ( − 1)

1 + ( ( ( ))) ( − 1) ( ( ))

With the sequential receipt of training observations, i.e., in the online mode, the recurrent form of the ANN can be represented as

As the data set for forecasting were taken monthly Electricity Sales to Ultimate Customers, Residential (USA) since 01-2002 till 01-2023 taken. The whole sample consisted of 251 instances. The sample was divided into training and test subsamples. The dynamics of monthly energy power supply to Ultimate Customers, Residential (USA) is shown in the Fig. 3. preceding and conceding values and the process is periodical with period 6 months. Autocorrelation function (ACF) was determined for this process of power supply which is shown in the Fig. 5. The check for stationarity of this process was performed using Dickey-Fuller test.

P-value: 0.5117527467140699 > 0.05. As it follows from this test the initial time series is not stationary. Using differencing this time series was transformed to the stationary one that’s confirmed by Dickey-Fuller test with P-value: 1.3594288749888985e-14 < 0.05.

4. Experimental investigations

In the investigations was explored the forecasting accuracy of hybrid DL neo-fuzzy networks at various forecasting intervals: short-term forecasting with intervals 1, 3 5 and 7 days and middle term forecasting with intervals 10 and 20 days. At the first step the variable experimental parameters of hybrid network were chosen which are presented in the Table 1.

The optimization of these parameters was performed, in result the following optimal values were determined inputs: 3; linguistic variables: 3; ratio: 0,7.

After that the structure optimization of hybrid DL neo-fuzzy network was constructed using GMDH method. The process of structure generation is presented in the Table 2.

In result the optimal structure of three layers was determined: at the first layer 3 inputs, second layer – two neurons, third layer – one output neuron.

Further the training of the best hybrid network was carried out using method SGD (stochastic gradient descent with variable step. Flow chart of forecasting results for interval 1day in presented in the Fig. 6. The values of MSE and MAPE for this experiment are shown in the Table 3.

In the Fig. 6. flow chart of MAPE values for the best model of hybrid network is shown.

Further the similar experiments of hybrid network were performed with forecasting interval 3, 5, 10 and 20 days. After optimization the parameters and structure of hybrid network it was trained using training subsample. The forecasting accuracy at the test sample for interval 3 days is presented

Criterion Min: Avg: Max:

SB 1 0.138 0.23 0.116 at the Table 4. In the succeeding experiments forecasting accuracy of Hybrid neo-fuzzy network was investigated with forecasting intervals 5, 10 and 20 days.

In the Table 5 accuracy of forecasting of the hybrid NFN optimal structure is presented with forecasting interval 10 days and in the table 6 with forecasting interval 20 days.

Criterion Min: Avg: Max: Criterion Min: Avg: Max: Criterion Min: Avg: Max:

For estimating forecasting accuracy of hybrid DL network, it was compared with alternative methods: ARIMA and GMDH. The forecasting accuracy of GMDH for interval 1 day is shown in the Table 7 and for 5 days in the Table 8. The flowchart of forecasting results for the interval 5 days is shown in the Fig. 7 and for 20 days in the Fig. 8.

In the next experiments forecasting efficiency of method ARIMA was investigated and analyzed. After the preliminary investigations the optimal parameters for ARIMA were found which were used in the following experiments. The forecasting accuracy of ARIMA for interval 1 day is presented in the Table 9 and for interval 5 days in the Table 10. The flowchart of real and forecasting results for ARIMA with interval 20 days is shown in the Fig. 9.

The comparative experiments were performed in which the accuracy of forecasting by hybrid DL network, GMDH and ARIMA at the different forecasting intervals was estimated and compared. The corresponding results are presented in the Tables 11, 12 and Fig. 10, 11.

Analyzing the presented results in the Fig. 10 and 11 one may conclude that GMDH method appears to be the best at short term forecasting 1, 3 days which complies the theory.

Hybrid deep learning neo-fuzzy networks are the best at middle-term forecasting 5, 7, 10, 20 days. ARIMA appeared to be the worst by accuracy as compared with intelligent methods – hybrid DL networks and GMDH.

5. Conclusion

In this paper the investigations of artificial intelligence methods: hybrid Deep learning networks and GMDH and ARIMA were carried out in the problem of forecasting Electricity Sales to Ultimate Customers, Residential (USA) since 01-2002 till 01-2023.

During the experiments the optimal structure and optimal parameters: number of inputs, number of linguistic values, ratio training/test samples of hybrid neo-fuzzy networks were determined.

After optimization of hybrid neo-fuzzy networks and parameters of GMDH method the experiments on forecasting Electricity Sales to Ultimate Customers, were performed at different intervals: 1, 3, 5, 7 (short term forecast) and 10, 20 days (middle term forecast).

The accuracy of forecasting by Hybrid DL networks was compared with alternative methods – GMDH and ARIMA.

The analysis of obtained results have shown that GMDH method is the best at short term forecasting 1, 3 days while hybrid deep learning neo-fuzzy networks are the best at middle-term forecasting 7, 10, 20 days. Method ARIMA appeared to be the worst by accuracy as compared with intelligent method – hybrid DL networks and GMDH.

6. References

[1] Brockwell , Peter J . Introduction to time series and forecasting / Peter J. Brockwell and

Richard A.

Davis . - 2nd ed. Springer, 2002 . - 429 p.

[2] Robert

Shumway , David S.

Stoffer . Time Series Analysis and Its Applications With R Examples. 4 th edition . Springer, 2017 . - 562 p.

[3]

A. G.

Ivakhnenko ,

G. A.

Ivakhnenko ,

J. A.

Mueller , Self-organization of the neural networks with active neurons, Pattern Recognition and

Image

Analysis , 1994 , 4 , №2, pp. 177 188 .

[4]

A. G.

Ivakhnenko ,

Wuensch ,

G. A.

Ivakhnenko , Inductive sorting-out GMDH algorithms with polynomial complexity for active neurons of neural networks , Neural Networks , 1999 , 2, pp. 1169 1173 .

[5]

S. S.

Haykin , Neural networks: a comprehensive foundation , 2nd ed. Upper Saddle River, N.J : Prentice Hall, 1999 .

[6] Osowski

Neural networks for processing information . - Publishing House of the University of Technology Warszawska, Warszawa, (in Polish) - 2000 .

[7] Wang

. Neural Networks Genetic Algorithms and Fuzzy Logic for Forecasting // Proc. Intern. Conf. Advanced Trading Technologies . - New York, 1992 . - pp. 504 532 .

[8] Yamakawa

, Uchino

, Miki

, Kusanagi H . A neo-fuzzy neuron and its applications to system identification and prediction of the system behavior // Proc. 2nd Intеrn. Conf. Fuzzy Logic and Neural Networks «LIZUKA-92». - Lizuka , 1992 . - pp. 477 483 .

[9]

Goodfellow ,

Bengio and

Courville , Deep Learning, MIT PRESS, 2016 . URL: http://www.deeplearningbook.org.

[10] Yuriy

Zaychenko

, Yevgeniy Bodyanskiy, Oleksii Tyshchenko, Olena Boiko,

Galib

Hamidov . Hybrid GMDH-neuro-fuzzy system and its training scheme . Int. Journal Information theories and Applications , 2018 . vol. 24 , Number 2. - pp. 156 172 .

[11] Evgeniy

Bodyanskiy

, Yuriy Zaychenko, Olena Boiko, Galib Hamidov,

Anna

Zelikman . Structure Optimization and Investigations of Hybrid GMDH-Neo-fuzzy Neural Networks in Forecasting Problems . System Analysis & Intelligent Computing. Ed. Michael Zgurovsky ,

Natalia

Pankratova . Book Studies in Computational Intelligence, SCI , vol. 1022 . Springer, 2022 . - pp. 209 228 .