<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>CEUR Workshop
Proceedings (CEUR-WS.org</journal-title>
      </journal-title-group>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.26483/ijarcs.v9i1.5279</article-id>
      <title-group>
        <article-title>Improving the Prediction Quality for a Multi-Section Transport Conveyor Model Based on a Neural Network</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Oleh Pihnastyi</string-name>
          <email>pihnastyi@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Olha Ivanovska</string-name>
          <email>o.ivanovska@khai.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>National Aerospace University "Kharkiv Aviation Institute"</institution>
          ,
          <addr-line>17 Chkalova, Kharkiv, 61070</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>National Technical University " Kharkiv Polytechnic Institute"</institution>
          ,
          <addr-line>2 Kyrpychova, Kharkiv, 61022</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <volume>2608</volume>
      <fpage>01</fpage>
      <lpage>03</lpage>
      <abstract>
        <p>The multi-section transport conveyor model based on the neural network for predicting the output flow parameters is considered. The expediency of using sequential and batch modes of training of a neural network in a model of a multi-section transport conveyor has been investigated. The quality сriterion of predicting the output flow parameters of the transport system is written. Comparative analysis of sequential and batch modes of neural network training is carried out. The convergence of the neural network training process for different sizes of the training batch is studied. The effect of the batch size on the convergence rate of the neural network learning process is estimated. The results of predicting the output flow parameters of a multi-section transport system for models based on a neural network that was learned using training batches of different sizes are presented. A nonlinear relationship between the batch size and the convergence rate of the neural network learning process is demonstrated. The recommendations are given on the choice of learning modes for a neural network in the model of a multi-section transport conveyor. The choice of the initialization value of the node participating in the formation of the bias value is investigated. The qualitative regularities characterizing the influence of the choice of the node initialization value on the forecasting accuracy of the output flow parameters of the transport system are studied. Multi-section conveyor, distributed transport system, conveyor belt, belt speed control,</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Multi-Section Transport
accumulation bunker, bias, sequential mode, batch mode</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>To design control systems for the speed of the belt [1, 2] and the flow of material entering the
conveyor input from the accumulating bunker [3, 4], models of the transport conveyor are used, the
foundation of which is the aggregated equation of state of the transport system flow parameters [5], the
equations of system dynamics [6], finite element method [7]. The use of the finite element method
makes it possible to take into account the uneven distribution of material along the transportation route
[8, 9] when designing a control system for the flow parameters of a conveyor system. Taking into
account the uneven distribution of material allows you to reduce the cost of transporting material [10]
due to the optimal control of the speed of the belt or the flow of material entering the conveyor input
from the accumulating bunker. When designing control systems for flow parameters of a multi-section
transport system [11, 12], the use of a model based on the finite element method becomes difficult due
to the high cost of computing resources. In this case, along with the use of the analytical PiKh-model
of the transport conveyor [13], it is advisable to use linear regression equations [14, 15, 16] and models
based on the neural network [17, 18, 19] to predict the state of the flow parameters of the transport
system. The relevance of using models based on a neural network increases with an increase in the
number of sections of the transport system. For a modern transport conveyor [14, 20, 21], containing
several dozen separate sections, the use of models based on a neural network is of particular importance.</p>
      <p>2022 Copyright for this paper by its authors.
Indeed, the construction of an optimal control algorithm for the flow parameters of the transport
conveyor based on the analytical PiKh-model of the transport conveyor [13] already for several sections
leads to cumbersome analytical calculations with a fairly simple control quality criterion [22, 23]. With
an increase in the number of sections, the complexity of the algorithm increases nonlinearly. This is
due to the fact that the transport conveyor is a complex dynamic distributed system, the output flow
parameters of the sections of which contain a time-variable transport delay. The use of neural networks
for modeling the state of the output flow parameters of the transport conveyor allows obtaining fairly
simple approximate models, which makes it possible to design control systems for the flow parameters
of the transport conveyor with satisfactory accuracy. In this regard, the issue of analyzing the factors
affecting the accuracy of models of transport conveyors based on a neural network becomes relevant.</p>
    </sec>
    <sec id="sec-3">
      <title>2. Problem statement</title>
      <p>When constructing a model of a multi-section conveyor based on a neural network (multilayer
perceptron), it is required to select the neural network architecture, activation function, neural network
training method, neural network training rate, initial initialization method, weight coefficients. This list
can be significantly expanded and detailed.</p>
      <p>A sufficient number of papers are had in which a model of a transport system based on neural
networks of different architectures is considered. The architecture of the neural network 13-5-1 (13
nodes in the input layer, 4 nodes in the hidden layer, and one node in the output layer) is used in a model
for diagnosing the level of wear of a conveyor belt [17]. To develop a control system for the speed of a
transport conveyor belt, a model based on a neural network with architecture 3-4-3 [24] and a model
based on a neural network of three levels [25] are considered. Despite the fact that the choice of the
neural network architecture has a significant impact on the learning rate and the accuracy of the
predicted model of the explained parameters [26, 27, 28] when constructing the transport conveyor
model, this issue is not given due attention. The choice of the neural network architecture, like the other
parameters of the model listed above, is quite often made by the developer heuristically. In this regard,
it becomes necessary to analyze the accuracy of the model depending on the value of the selected factor.</p>
      <p>This study analyzes the impact of
a) selection of the bias-node value in the hidden layer of the neural network;
b) selection of the training mode (sequential or batch);
on the learning rate of the neural network and the accuracy of predicting the flow parameters of the
transport conveyor model based on the neural network.</p>
      <p>The choice was made on the specified meta-parameters of the model for the reason that, in
comparison with the issue of choosing the architecture of a neural network, the issues of choosing these
parameters when constructing a model of a transport system can have a significant impact on the
accuracy of forecasting and are less studied.</p>
    </sec>
    <sec id="sec-4">
      <title>3. Main material 3.1.</title>
    </sec>
    <sec id="sec-5">
      <title>General description of the transport system model</title>
      <p>Let us analyze how the choice of the bias-node value in the hidden layer of the neural network and
the choice of the training mode affect the accuracy of predicting the flow parameters of a
conveyortype transport system of a multi-section transport conveyor, the structural diagram of which is shown
in Figure 1 [29, 30]. The structural diagram of the conveyor proposed for analysis and the corresponding
data set were used in a number of works for a qualitative analysis of the parameters of the transport
conveyor. A detailed analysis of the dataset that will be used to train the neural network in this study is
presented in [30]. Each individual m –th section m  1..M  in the transport conveyor model has
parameters: the section transport route length  m , the material flow at the section input m ( ) , the
conveyor belt speed gm ( ) и the output material flow from the section1( , m ) . The parameters of
the model are dimensionless, which makes it possible to use the theory of the similarity of production
systems in the analysis of the transport conveyor. The technique for constructing a dataset is presented
in the paper [30]. The data set corresponds to the case of a transport system functioning with
dynamically changing flows of incoming material (Figure 2) entering the transport system, and
dynamically regulable belt speed for each separate section of the conveyor (Figure 3).The flow of
material from the 6-th bunker is divided into sections 7 and 8 in the ratio 7( ) / 8( )  2 / 3 . The
length
 m
of
the
individual
sections
is
different,
represented
by
proportions
1 : 2...8  1: 0,5 : 0,7 : 0,8 : 1,5 : 1: 1,5 : 0,6.
conveyor</p>
      <p>To analyze the influence of the choice of the bias-node value in the hidden layer of the neural
network and the choice of the training mode on the accuracy of predicting the flow parameters of the
conveyor-type transport system, a neural network with the architecture 12-15-2 will be used, the input
and output nodes of which are numbered as follows (Figure 5):</p>
      <p>x3m2   m ( )  x3m1  gm ( )  x3m   m  y1 17( ,7 )  y2 18( ,8) </p>
      <p>A similar approach to the numbering of input parameters and output parameters of the transport
conveyor was proposed in paper [31]. The number of nodes in the hidden layer is set in accordance
with the recommendations given in [26, 27, 32, 33]. When forming a neural network, a logistic</p>
      <p>The values of the output material flow 1( , m ) from the training dataset are characterized by the
distribution function shown in Figure 4 and are determined through the values of the input material flow
 m ( ) and the belt speed gm ( ) of a separate section:
1m( ,1)  1  H 1  G( ) m (   )</p>
      <p>m gm ( )  H(1  Gm ( )) m (1  Gm ( )) gm ( ) ,
gm (   )
m</p>
      <p>
        0
Equation (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) calculates the output flow taking into account the initial distribution of material along the
transport route. The initial distribution of the material  m ( m ) does not depend on the parameters
 m ( ) , gm ( ) , therefore the initial distribution of the material introduces an error in forecasting [30].
In this regard, the data set for training, the characteristic of the parameters of which is shown in Figure
4 was obtained under the condition Gm ( )   m . Solving the equations
allows you to calculate the amount of transport delay  m ( ) or a separate section and the conveyor
as a whole.
and linear activation function
f ( x) 
      </p>
      <p>
        a
1  exp( bx)
 a  4  b  1 
(
        <xref ref-type="bibr" rid="ref1">1</xref>
        )
(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )
(
        <xref ref-type="bibr" rid="ref3">3</xref>
        )
(
        <xref ref-type="bibr" rid="ref4">4</xref>
        )
was used. The parameters of the activation function are determined as a result of subsequent tuning,
which ensures the maximum quality of forecasting the streaming parameters of the transport system.
      </p>
      <p>The sequential neural network training method is to sequentially provide examples of epochs from
the training dataset, updating the weights after each training example. For each learning epoch,
examples from a set can come in a given order or in random order. The learning error for the n- th
example (n=1..N) is determined by the expression</p>
      <p>The training data set contains Ne  104 examples of the sequential state of the transport system.
The training of the neural network has been carried out for 10 4 э epochs. The criterion for the quality of
training was the value mean squared error (MSE)</p>
      <p>MSE 
1
N
e n1</p>
      <p>Ne
  y1n  y1t n 2   y2n  y2 n 2 
3.2.</p>
    </sec>
    <sec id="sec-6">
      <title>Sequential and batch modes of neural network training</title>
      <p>En 
1 VL
2
vL 1
eLvLn </p>
      <p>2
eLvLn  dvLn  yLvLn 
where dv n is the value of the output neuron in the training set; yLvLn is the generated value by the</p>
      <p>L
output neuron; VL is number of nodes in the output layer. Let's assume that the neural network contains
L 1 layers, each of which consists of Vl nodes, l  0..L . The input layer corresponds to the value
l  0 , the output layer corresponds to the value l  L . The value of the vl  th node vl  0..Vl  in the
l  th layer for the n  th training example is determined by the expression
 Vl1

ylvln  fl 
 vl1 0


wlvlvl1 y(l1)vl1n    lvln 
</p>
      <p>
        Vl1
vl10
 wlvlvl1 y(l1)vl1n 
(
        <xref ref-type="bibr" rid="ref5">5</xref>
        )
(
        <xref ref-type="bibr" rid="ref6">6</xref>
        )
(
        <xref ref-type="bibr" rid="ref7">7</xref>
        )
(
        <xref ref-type="bibr" rid="ref8">8</xref>
        )
(
        <xref ref-type="bibr" rid="ref9">9</xref>
        )
defined as
To change the weights, let's use the gradient descent method [34]
yl0n  ylb  const l  0..L1
wlvlvl1   En 
      </p>
      <p>wlvlvl1
which allows us to determine a direction vector in the weight space wlvlvl1
, that reduces the error value
En . The parameter  is the learning rate of the error back propagation algorithm. Let's use the
previously performed transformations in accordance with the notation Figure 6:</p>
      <p>
         Vl1  (
        <xref ref-type="bibr" rid="ref12">12</xref>
        )
wylvlvlvlnl1  fl vl10wwlvllvvllv1ly1(l1)vl1n   fllvllvnln  wlvlvlvlnl1  fllvllvnln  y(l1)vl1n 
(
        <xref ref-type="bibr" rid="ref10">10</xref>
        )
(
        <xref ref-type="bibr" rid="ref11">11</xref>
        )
(
        <xref ref-type="bibr" rid="ref13">13</xref>
        )
(
        <xref ref-type="bibr" rid="ref14">14</xref>
        )
(
        <xref ref-type="bibr" rid="ref15">15</xref>
        )
      </p>
      <p>V eLvLn  eLvLn
En   1 L eLvLn2  eLvLn yLvln
yLvln yLvln 2 vL1
dvLn  yLvLn  eLvLn </p>
      <p>yLvln
wlvlvl1
yl20n  0 l2 1..L1l2  l </p>
      <p>Vl2 yl2vl2n yl21vl21n </p>
      <p>VL1 yl2vl2n yl21vl21n  
wyll2vvlvl2ln1  vL10 yl21vl21n wlvlvl1 vl21yl21vl21n wlvlvl1
el21vl21n  vVl2l21el2vl2n yly2l21vlv2ln21n  vVl2l21el2vl2n fl2l2vl2l2vnl2nyl2 l21vlv2ln21n  
fl2 l2vl2nwl21vl2vl21n  el2vl2n fl2l2vl2l2vnl2nwl21vl2vl21n </p>
      <p>l2vl2n vl2 1</p>
      <p>
        Vl2 Vl2 (
        <xref ref-type="bibr" rid="ref16">16</xref>
        )
 el2vl2n
      </p>
      <p>vl2 1
and obtain an expression for determining the gradient of the function Enwlvlvl1  expressed through
the weight coefficients wlvlvl1</p>
      <p>wlEvlnvl1  vVLL1yELvnLn wylLvvlvLln1  vVLL1eLvLn wylLvvlvLln1  vVLL1eLvLnvVLL111yLyL1vlvnL1n ywLlv1lvvlL11n  
 vVLL111ywLlv1lvvlL11n vLL1eLvLn yLyL1vvlnL1n  vL11eL1vL1n wlvlvl1 vL21</p>
      <p>V VL1 yL1vL1n    eL2vL2n
VL2
yL2vL2n  
wlvlvl1
Thus, the gradient of the function En wlvlvl1</p>
      <p> has the form
En
and can be determined, if the activation function fl   for l  th layer, the values of the nodes of the
previous layer y(l 1)vl1n , the values of the weight coefficients wlvlvl1
and the errors elvln for the nodes
l  th layer are known
1 N VL
N
n1 vL 1
yLvln
yL1vL1n
En
wlvlvl1
</p>
      <p>En
yLvLn wlvlvl1</p>
      <p>N VL
yLvLn   N1 n1 vL1 eLvLn wylLvvlvLln1  
 </p>
      <p>eLvLn yL1vL1n
n1 vL1 1 wlvlvl1 vL 1</p>
      <p>yLvln
VL
 
 
1</p>
      <p>
N</p>
      <p>N VL1
n1 vL11
 eL1vL1n</p>
      <p>wlvlvl1
yL1vL1n   1</p>
      <p>N
vL2 1
VL2
 eL2vL2n
En wlvlvl1</p>
      <p> or the batch training mode:
By analogy with the sequential training mode, let's define the gradient of the function
(17)
(18)
fl  lvln 
 
1
N</p>
      <p>N
elvln
n1
fl  lvln 
 lvl n
y(l 1)vl1n  elvl n </p>
      <p>
        Vl1
 el 1vl1n
is similar to the expression for the sequential training mode (
        <xref ref-type="bibr" rid="ref11">11</xref>
        ).
      </p>
      <p>Averaging over the number of training examples, on the one hand, makes it possible to give a more
accurate estimate of the gradient vector, and, accordingly, simplifies the process of obtaining
convergence conditions for the neural network training algorithm, on the other hand, it requires
additional computational memory to store the error calculation results elvln and the node value
y(l 1)vl1n or each test case as part of a separate package. Another disadvantage associated with the
considered batch learning mode algorithm is the higher probability of stopping the learning algorithm
at a local minimum point. When using batch training mode, preprocessing of the training set is required
in order to eliminate duplicate training examples. The advantage of the batch mode algorithm is the
ability to parallelize calculations since, for each training example, the values of the weight coefficients
remain constant.</p>
    </sec>
    <sec id="sec-7">
      <title>4. Analysis of results 4.1. Comparative analysis of the results of predicting the output flow parameters of the transport conveyor using sequential mode and batch mode of neural network training</title>
      <p>The results of training a neural network for a batch training mode with one, two, four, and eight
examples in a batch are shown in Figure 7. The calculation of the gradient was carried out in accordance
with expression (19). The learning rate for each mode is selected from the condition
  0,01Nbatch ,
(20)
where Nbatch is the number of examples in one batch. Thus, for a numerical experiment with a batch
consisting of eight elements, the learning rate was set eight times higher   0,08 , than for a
numerical experiment with a package consisting of one element   0,01 .</p>
      <p>The analysis of the results of the numerical experiment presented in Figure 7 shows that despite the
fact that with an increase in the batch size Nbatch , the learning rate increases proportionally  (20), the
number of epochs to achieve the same value MSE increases nonlinearly. This is clearly shown in Figure</p>
      <p>MSE  f Nbatch, lg10(epoch). Analysis of the results of
8 in the form of equipotential lines
computational experiments shows the inefficiency of using the batch mode of training a neural network
in a distributed model of the transport conveyor, confirming the conclusions made in the works about
the general inefficiency of batch training for gradient descent learning [35, 36].</p>
      <p>So, for example, the value MSE  0,2 for batch size Nbatch  1   0,01Nbatch  0,01 is reached
at 12th learning epoch, for Nbatch  2   0,02 is reached at 19th learning epoch, for Nbatch  4
  0,04 is reached at 113th learning epoch, and finally, for Nbatch  8   0,08 are required 104
learning epoch.
one, two, four, and eight examples with a learning rate  0,01Nbatch .</p>
      <p>When parallelizing the training process for the batch mode with the number of elements in the batch
Nbatch  8 , the number of training epochs is required Nb3atch ~ 103 times more than in the sequential
training mode. In both cases, the same MSE  0,2 is achieved.</p>
      <p>An increase in the convergence of the learning process due to a proportional increase in the learning
rate   0,01Nb4atch  leads to the development of instability in the learning process and does not give
the desired results.</p>
      <p>Thus, a numerical experiment shows that when training a transport system model based on a neural
network, the batch training mode has slow convergence at a given training rate proportional to the
number of elements Nbatch in the batch.</p>
      <p>
        Figure 9 and Figure 10 show the results of predicting the output flow17( ,7 ) and 18( ,8 )
depending on the number of training epochs. The prediction quality results are determined by the MSE
(
        <xref ref-type="bibr" rid="ref6">6</xref>
        ), which corresponds to 0,617; 0,197 and 0,045 after one, twelve, and one hundred and eighty epochs
of learning. The sequential neural network training mode demonstrates satisfactory convergence of the
neural network training process and high training accuracy.
4.2. Analysis of the choice of the initial value of the node participating in the
formation of the bias
      </p>
      <p>
        The value of the hidden node is determined by expressions (
        <xref ref-type="bibr" rid="ref9">9</xref>
        ) and (
        <xref ref-type="bibr" rid="ref10">10</xref>
        ). The value of the node
participating in the formation of the bias value, as a rule, is taken equal to one: yl0n  1 (
        <xref ref-type="bibr" rid="ref10">10</xref>
        ). This
approach is common, despite the fact that the choice of value yl0n can have a significant impact on the
convergence of the neural network learning process.
      </p>
      <p>
        The values of the nodes of the neural network are calculated during the forward pass based on the
known values of the weight coefficients (
        <xref ref-type="bibr" rid="ref9">9</xref>
        ).
      </p>
      <p>
        In the reverse pass, the values of the weight coefficients are calculated taking into account the values
of the nodes of the neural network (
        <xref ref-type="bibr" rid="ref11">11</xref>
        )-(19). A self-consistent change in the weight coefficients and
values of the neural network nodes is observed. If the learning process is convergent, then the values of
the neural network nodes and the values of the weight coefficients converge to their steady-state values.
For hidden layers, the node value yl0n is constant, not formed as a result of a direct pass of the neural
network.
trainable set; 2: after one learning epoch; 3: after twelve epochs of learning; 4: after one hundred
and eighty eras of learning
trainable set; 2: after one learning epoch; 3: after twelve epochs of learning; 4: after one hundred and
eighty eras of learning
      </p>
      <p>However, the node value yl0n participates in the process of forming the value of other nodes during
the forward pass and in the process of forming the values of the weight coefficients during the reverse
pass. If, as a result of training, the values of the hidden nodes y(l1)vl1n are small</p>
      <p>yl0n  1 y(l1)vl1n  1 vl 1  0 
then with the same order of values of the weight coefficients wlvl 0  wlvlvl1 the calculated value of the
node is determined by the bias value</p>
      <p>Vl1</p>
      <p> wlvlvl1 y(l 1)vl1n  wlvl 0 y(l 1)0n  wlvl 0 </p>
      <p>
        The experiments carried out for the analyzed neural network with the architecture 12-15-2 (
        <xref ref-type="bibr" rid="ref3">3</xref>
        )
confirm the influence of the value of the node participating in the formation of the bias value on the
learning process of the neural network in the transport conveyor model. The analysis was carried out
for a neural network with logistic activation function and linear activation function. The analysis results
are presented in Table 1. Figures 11 and 12 demonstrate the convergence of the learning process for a
neural network with logistic activation function and linear activation function with the same architecture
of the 12-15-2 neural network (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ). In these figures, the solid line represents the learning process for
positive values y(l 1)0n and the dashed line for negative values y(l 1)0n . It should be noted that the
dashed line MSE y(l 1)0n , y(l 1)0n  0 with the logistic activation function passes in the vicinity of
the solid line MSE y(l 1)0n , y(l 1)0n  0 . The dependence MSE y(l 1)0n  for the logistic activation
function is shown in Figure 13.
(21)
0,398227
0,515637
0,494928
0,490902
0,498925
0,490904
0,494926
0,515766
0,397910
      </p>
      <p>4
771
567
729
519
731
592
757
4
0,516921
0,516921
0,496836
0,492584
0,500923
0,492578
0,496836
0,511629
0,511629</p>
      <p>For the linear activation function, the solid and dashed lines coincide and are visually
indistinguishable in Figure 12. The dependence MSE y(l 1)0n  for the logistic activation function is
shown in Figure 14. For a nonlinear activation function, with an increase in the number of learning
epochs, the sensitivity of the quality of the learning process to the bias-node value decreases (Figure
13). For a linear function, the decrease in sensitivity is much slower (Figure 14). For two cases, the
symmetry of the function MSE y(l 1)0n  with respect to the zero value is observed. The minimum
value of the function MSE y(l 1)0n  is reached at a point different from the value y(l 1)0n  0 . This
indicates that the presence of a nonzero bias improves the accuracy of predicting the output material
flow.
y(l 1)0n  0;1;2;5;10 (logistic activation function and 12-15-2 neural network architecture)
y(l 1)0n  0;1;2;5;10. (linear activation function and 12-15-2 neural network architecture)
12-15-2 neural network architecture)
4.3.</p>
    </sec>
    <sec id="sec-8">
      <title>Conclusion</title>
      <p>In this work, we analyzed the choice of the neural network training mode, which is used in the
transport conveyor model. For comparative analysis, a neural network with architecture 12-15-2 was
used. The experiment was carried out for a neural network with a logistic activation function and a
linear activation function of its nodes.
12-15-2 neural network architecture)</p>
      <p>The learning rate for the analyzed batches is chosen proportional to the size of the training batch.
For example, for a batch learning mode of size eight, the learning rate is chosen to be eight times faster
than the sequential learning mode. With the same number of epochs, a linear increase in the MSE
value is observed depending on the size of the training batch. Thus, at the same learning rate, there is
a quadratic dependence of the MSE value on the size of the training batch for the selected neural
network architecture. Switching to batch training mode significantly reduces the efficiency of neural
network training. It should be noted that there is a quasi-linear relationship between the size of the
training batch and the lg10(epoch), value, which characterizes the number of training epochs. In
addition, attention is paid to the choice of the value of the node of the hidden layer participating in the
formation of the bias. Numerical experiments demonstrate the need to use bias to improve the accuracy
of predicting the output flow parameters of the transport system.</p>
      <p>Ensuring the convergence of the learning process is one of the important problems in the design of
transport conveyor models based on a neural network. The use of the batch mode guarantees the
convergence of the algorithm to a local minimum under fairly simple conditions, which explains its
use in proving the convergence of the back propagation algorithm. Convergence is increased by
parallelizing the learning process. However, as shown in this work, the use of the batch learning mode
is ineffective when constructing models for predicting the output flow of material in the multi-section
transport conveyor. This is explained by the fact that the multi-section transport conveyor is a complex
dynamic distributed system, at the input of which material flow arrives, the value of which is of a
stochastic nature. The superposition of flows of individual sections with a transport delay leads to the
presence of characteristic peaks in the value of the output flow. Under such conditions, the use of the
batch mode of training the neural network leads to a strong smoothing of the peaks, decreasing the
value in proportion to the number of examples in the batch, and as a consequence, significantly
increases the MSE value, reducing the quality of the prediction of the values of the output flow
parameters of the transport system. The peculiarity of the multi-section transport system associated
with the presence of peak values as a result of the superposition of material flows of individual sections
limits the possibility of using the batch mode of training the neural network when constructing models
of the transport conveyor. The same reason can be accepted as an explanation for the low efficiency
of the use of a linear activation function in the formation of a neural network. The scientific novelty
of the results was obtained in the fact that for the first time the problem of increasing the efficiency of
the learning process of a neural network used to build a model of a multi-section transport conveyor
was posed. An estimate is given for the decrease in the convergence rate of the neural network learning
process depending on the increase in the number of examples in the training batch. Recommendations
on the choice of the activation function and bias value for the neural network nodes, as well as the
neural network training mode, are presented. These recommendations are valid for multi-section
transport systems with ten or more sections. With a small number of sections, the peaks in the output
stream are not so pronounced, which allows you to select both sequential learning mode and batch
learning mode.</p>
      <p>The prospect of further research is the analysis of the choice of the training mode for neural network
used in transport conveyor models, depending on its architecture. Of particular interest is the case of
using multilayer neural network in models of transport systems. Also, an interesting direction for
future research is the influence of the type of activation function used in the neural network on the
quality of predicting the flow parameters of the transport system.</p>
    </sec>
    <sec id="sec-9">
      <title>5. References</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bebic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Ristic</surname>
          </string-name>
          ,
          <article-title>Speed controlled belt conveyors: drives and mechanical considerations</article-title>
          ,
          <source>Advances in Electrical and Computer Engineering</source>
          ,
          <volume>18</volume>
          (
          <issue>1</issue>
          ) (
          <year>2018</year>
          )
          <fpage>51</fpage>
          -
          <lpage>60</lpage>
          . http//dx.doi.org/10.4316/AECE.
          <year>2018</year>
          .01007
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>I.</given-names>
            <surname>Halepoto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Uqaili</surname>
          </string-name>
          ,
          <article-title>Design and implementation of intelligent energy efficient conveyor system model based on variable speed drive control and physical modeling</article-title>
          ,
          <source>International Journal of Control and Automation</source>
          ,
          <volume>9</volume>
          (
          <issue>6</issue>
          ) (
          <year>2016</year>
          )
          <fpage>379</fpage>
          -
          <lpage>388</lpage>
          . http//dx.doi.org/10.14257/ijca.
          <year>2016</year>
          .
          <volume>9</volume>
          .6.
          <fpage>36</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>O.</given-names>
            <surname>Pihnastyi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Kozhevnikov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Khodusov</surname>
          </string-name>
          ,
          <article-title>Conveyor model with input and output accumulating bunker</article-title>
          ,
          <source>in: Proceedings of 11th International Conference on Dependable Systems, Services and Technologies</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>253</fpage>
          -
          <lpage>258</lpage>
          . http://dx.doi.org/10.1109/DESSERT50317.
          <year>2020</year>
          .9124996
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>P.</given-names>
            <surname>Bardzinski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Walker</surname>
          </string-name>
          , W. Kawalec,
          <article-title>Simulation of random tagged ore flow through the bunker in a belt conveying system</article-title>
          ,
          <source>International Journal of Simulation Modelling</source>
          <volume>4</volume>
          (
          <year>2018</year>
          )
          <fpage>597</fpage>
          -
          <lpage>608</lpage>
          . https://doi.org/10.2507/
          <issue>IJSIMM17</issue>
          (4)
          <fpage>445</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Reutov</surname>
          </string-name>
          ,
          <article-title>Simulation of load traffic and steeped speed control of conveyor</article-title>
          ,
          <source>in: Proceedings of IOP Conf. Series: Earth and Environmental</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>4</lpage>
          . https://doi.org/10.1088/
          <fpage>1755</fpage>
          - 1315/87/8/082041
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>E.</given-names>
            <surname>Wolstenholm</surname>
          </string-name>
          ,
          <article-title>Designing and assessing the benefits of control policies for conveyor belt systems in underground mines</article-title>
          ,
          <source>Dynamica</source>
          <volume>6</volume>
          (
          <issue>2</issue>
          ) (
          <year>1980</year>
          )
          <fpage>25</fpage>
          -
          <lpage>35</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>B.</given-names>
            <surname>Karolewski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Ligocki</surname>
          </string-name>
          ,
          <article-title>Modelling of long belt conveyors</article-title>
          ,
          <source>Maintenance and reliability 16</source>
          (
          <issue>2</issue>
          ), (
          <year>2014</year>
          )
          <fpage>179</fpage>
          -
          <lpage>187</lpage>
          . http://yadda.icm.edu.pl/yadda/element/bwmeta1.element.
          <source>baztech-ce355084-3e77-4e6b-b4b5-ff6131e77b30</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>O.</given-names>
            <surname>Pihnastyi</surname>
          </string-name>
          ,
          <article-title>Control of the belt speed at unbalanced loading of the conveyor</article-title>
          ,
          <source>Scientific bulletin of National Mining University</source>
          <volume>6</volume>
          (
          <year>2019</year>
          )
          <fpage>122</fpage>
          -
          <lpage>129</lpage>
          . https://doi.org/10.29202/nvngu/2019-6/
          <fpage>18</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Semenchenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stadnik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Belitsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Semenchenko</surname>
          </string-name>
          ,
          <string-name>
            <surname>O. Stepanenko,</surname>
          </string-name>
          <article-title>The impact of an uneven loading of a belt conveyor on the loading of drive motors and energy consumption in transportation</article-title>
          ,
          <source>Eastern-European Journal of Enterprise Technologies (82)</source>
          (
          <year>2016</year>
          )
          <fpage>42</fpage>
          -
          <lpage>51</lpage>
          . https://doi.org/10.15587/
          <fpage>1729</fpage>
          ­
          <lpage>4061</lpage>
          .
          <year>2016</year>
          .75936
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>R</given-names>
            <surname>Kiriia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Shyrin</surname>
          </string-name>
          ,
          <article-title>Reducing the energy consumption of the conveyor transport system of mining enterprises</article-title>
          .
          <source>in: Proceedings of international conference essays of mining science and practice</source>
          ,
          <volume>109</volume>
          (
          <year>2019</year>
          ). https://doi.org/10.1051/e3sconf/201910900036
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Siemens</surname>
          </string-name>
          .
          <article-title>Innovative solutions for the mining industry</article-title>
          ,
          <year>2021</year>
          . URL: www.siemens.com/mining
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Koman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Laska</surname>
          </string-name>
          ,
          <article-title>The constructional solution of conveyor system for reverse and bifurcation of the ore flow, Rudna mine</article-title>
          KGHM Polska
          <string-name>
            <surname>Miedź</surname>
            <given-names>SA</given-names>
          </string-name>
          , CUPRUM
          <volume>3</volume>
          (
          <issue>72</issue>
          ) (
          <year>2014</year>
          )
          <fpage>69</fpage>
          -
          <lpage>82</lpage>
          . http://www.czasopismo.cuprum.wroc.pl/journal-articles/download/113
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>O.</given-names>
            <surname>Pihnastyi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Khodusov</surname>
          </string-name>
          ,
          <article-title>Calculation of the parameters of the composite conveyor line with a constant speed of movement of subjects of labour</article-title>
          ,
          <source>Scientific bulletin of National Mining University</source>
          <volume>4</volume>
          (
          <issue>166</issue>
          ) (
          <year>2018</year>
          )
          <fpage>138</fpage>
          -
          <lpage>146</lpage>
          . https://doi.org/10.29202/nvngu/2018-4/
          <fpage>18</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>M.</given-names>
            <surname>Andrejiova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Marasova</surname>
          </string-name>
          ,
          <article-title>Using the classical linear regression model in analysis of the dependences of conveyor belt life</article-title>
          ,
          <source>Acta Montanistica Slovaca</source>
          <volume>18</volume>
          (
          <issue>2</issue>
          ) (
          <year>2013</year>
          )
          <fpage>77</fpage>
          -
          <lpage>84</lpage>
          . https://actamont.tuke.sk/pdf/2013/n2/2andrejiova.pdf
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <article-title>A regression model for prediction of idler rotational resistance on belt conveyor</article-title>
          ,
          <source>Measurement and Control</source>
          <volume>52</volume>
          (
          <issue>5</issue>
          ) (
          <year>2019</year>
          )
          <fpage>441</fpage>
          -
          <lpage>448</lpage>
          . https://doi.org/10.1177/0020294019840723
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>B.</given-names>
            <surname>Karolewski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Marasova</surname>
          </string-name>
          ,
          <article-title>Experimental research and mathematical modelling as an effective tool of assessing failure of conveyor belts</article-title>
          ,
          <source>Maintenance and reliability 16(2)</source>
          (
          <year>2014</year>
          )
          <fpage>229</fpage>
          -
          <lpage>235</lpage>
          . http://www.ein.org.pl/sites/default/files/2014-02-09.pdf
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>