<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Application of Artificial Neural Networks to Forecast Technological Process Parameters in Aluminum Production*</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Anton Mikh</string-name>
          <email>asmikhalev@yandex.ru</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lugov</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute of computational modelling of the Siberian Branch of the Russian Academy of Sciences</institution>
          ,
          <addr-line>50/44 Akademgorodok, Krasnoyarsk, 660036</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Siberian Federal University</institution>
          ,
          <addr-line>26, Kirenskogo str., Krasnoyarsk, 660074</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <fpage>99</fpage>
      <lpage>107</lpage>
      <abstract>
        <p>The study is aimed at methods of machine learning as it relates to forecasting technological process parameters. The forecasting tools are developed in two main stages: analysis and preprocessing of input data, elaboration of a math model and validation of the solution. Forecasting relies on recurrent neural networks. The method of maximum accuracy was used to elicit the neural network architecture, and calculate the metrics of MSE, MAPE, the coefficient of determination and Theil coefficient. The results obtained in the tests run on the suggested model of forecasting the cell voltage are deemed acceptable in terms of predicting the technological process indicators. The identified errors will ensure that preventive measures are taken in a timely manner to avoid process disruptions and increase overall efficiency of aluminum production.</p>
      </abstract>
      <kwd-group>
        <kwd>Neural Network</kwd>
        <kwd>Forecasting</kwd>
        <kwd>Process Disruptions</kwd>
        <kwd>Technological Process Parameters</kwd>
        <kwd>Voltage</kwd>
        <kwd>Aluminum Production</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Of all non-ferrous metal industries, aluminum production has the world’s biggest
share in manufacturing and consumption [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The industry develops within the lines
of enhancing productivity of the main unit, electrolysis cell, therefore one of the key
tasks is to control low-duty cells. Some of such cells are easily identifiable (shutdown
cells, those under localized repairs), so they are controlled based on the current
technical condition. Other are harder to identify, as deterioration in technology does
not manifest itself directly and can only be determined through indirect parameters.
Their number varies depending on supplied raw materials, occurring troubles,
operational activities, etc., which may cumulatively lead to a greater number of cells
* Copyright c 2020 for this paper by its authors. Use permitted under Creative Commons
License Attribution 4.0 International (CC BY 4.0).
operating at lower capacities and consequently to a considerable decrease in technical
and economic indexes [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ]. Timely detection of errors in the technological process
can be ensured in case the performance parameters of the complex of aluminum
production are analyzed using modern intelligent technologies.
      </p>
      <p>The technical condition of cells is controlled across a number of parameters that
are continuously measured and stored in the data base of the computer-aided process
control system: cell voltage, anode current, modes of automatic alumina consumption,
adjustable anode block position. Making sure that these parameters are properly
controlled and identified is critical in timely detection of process disruptions in the
course of cell operation.</p>
      <p>
        Values of parameters that need to be predicted are predominantly described as time
series, that is, in sequences of values taken at certain instants of time. Forecasting
time series normally entails using regression and autoregression methods, exponential
smoothing, neural networks, etc. [
        <xref ref-type="bibr" rid="ref4 ref5">4-5</xref>
        ]. The forecasting model in this study is
represented by artificial neural networks [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. This technology has the following key
strengths: solving problems with unknown patterns, resistance to noises in input data,
and potential high-speed response. Neural network topologies are selected depending
on the input data and type of tasks to be solved. This study looks at the application of
artificial neural networks to forecast one of the most crucial among the controllable
parameters – cell voltage. Recurrent neural networks (RNNs) were chosen for the
purpose. The elements in RNNs form a directed graph which allows for processing
series of events in time or consecutive spatial sequences. Unlike multilayer
perceptrons, RNNs can use their internal memory to process variable length
sequences of inputs.
      </p>
      <p>The predictive tools are developed in two stages: 1) analysis and preprocessing of
input data; 2) elaboration of a math model and validation of the solutions. The main
body of the article is structured based on this logic. Section 2 spells out the objectives
for time series forecasting. Section 3 describes the inputs. Section 4 elaborates on the
applied methods of preprocessing of input data. Section 5 presents the result of
selecting an optimal neural network architecture. Section 6 gives the results of voltage
forecasting.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Research Objective</title>
      <p>The aim of the time series forecasting is set as follows. Let us assume that the values
of the time series are the following:
 = { ( ),  ∈  ,  ( ) ∈  },  = {1,2, … ,  }
(4)
where  ( ) is the value of the analyzed parameter registered at a given instant in time.</p>
      <p>Based on the values of the analyzed parameter at preceding moments in time
 ( ),  ( − 1),  ( − 2), …  ( −  + 1),  ≤  we must predict (assess the values
with highest precision) the analyzed parameter as it should appear at points in time
 + 1,  + 2, … ,  +  , i.e. build a sequence of forecasted values:</p>
      <p>= { ( + 1),  ( + 2), … ,  ( +  )}
To calculate the values in the time series at future moments in time, we must
determine the functional relationship that shows the connection between the past and
future values of this time:</p>
      <p>( +  ) =   ( −  + 1),  ( −  + 2), … ,  ( +  − 1)
The presented functional relationship (3) represents the prediction model.</p>
      <p>Therefore, the task of time series forecasting is fulfilled through creating a
forecasting model that will satisfy the relevant criteria of forecasting quality control.
(5)
(6)
(7)
(8)
(9)
In addition, apart from the given evaluation characteristics, this study estimates the
accuracy of forecasts made to the elaborated prediction model using the coefficient of
determination and Theil inequality coefficient:
─
the coefficient of determination:

=
∑
∑
( ( ) )
( ( ) )</p>
      <p>Currently the accuracy of time series modelling is commonly estimated using the
following two indicators:
─ mean squared error, MSE:

=
∑</p>
      <p>( ) −  ( )
─ mean absolute percentage error, MAPE, mean average percentage deviation (mean
relative forecast error):

= 
∑
| ( ) ( )|
( )
∗ 100%
The coefficient of determination characterizes the strength of association of inputs
and forecasts, so the closer it gets to 1, the better is the quality of the prediction
model.
─ Theil inequality coefficient:
(10)
 =</p>
      <p>∑
∑
( )
( ) ( )
∑
( )
The Theil index shows the strength of association in time series, so the closer it is to
zero, the more strongly associated the series are that are compared.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Description of Inputs</title>
      <p>The basic time series presents the data on the cell voltage registered by system
detectors in the experimental area of the Khakas aluminum smelter. The voltage time
series contain three-minute values of voltage for the period from January 3, 2020 to
January 31, 2020. The time series parameters are demonstrated in Table 1.</p>
      <sec id="sec-3-1">
        <title>Cell</title>
        <p>No.1
No.2
No.3
No.4
No.5
No.6
No.7
No.8
The overall sample volume contains about 115,000 entries. To set up the prediction
model and evaluate the quality of the model itself, the sample volume was broken
down into three parts: training (voltage at cells No.1-6), validating (voltage at cell No.
7), and testing (voltage at cell No.7).
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Preprocessing of Inputs</title>
      <p>The stage of building a prediction model is preceded by the stage of analysis and
preprocessing of the time series. The preprocessing of the time series entails
identifying outliers and smoothing the series. Certain discrepancies in the quality of
measurements occur in various time series of data characterizing the production
process. The outliers may be caused by technical errors in data collection, processing,
and transfer.</p>
      <p>
        Sifting out the outliers from the rest of data is a specific mechanism to identify
and delete obvious discrepancies and other possible errors in inputs and make sure
further forecasts are accurate. In the study, outliers were isolated by the isolation
forest algorithm [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The isolation forest is a method to detect outliers that is mainly
centered around constructing a forest of decision trees during training and forecast
output. When it comes to detecting outliers, this method relies on the fact that outliers
have values that are decidedly different from the norm and only make up a small
proportion of the whole set of data. The results of detected outliers for voltage in cell
No. 1 are presented in Figure 2.
      </p>
      <p>
        Detected outliers are removed from the set and the resulting gaps in the data are
recovered by the interpolation technique [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. The view after the removal of outliers
for the cell voltage can be seen in Figure 3.
The efficiency of solving time series forecasting tasks that feature artificial neural
networks is defined by their hyperparameters. The main hyperparameters underlying
60
70
80
90
100
      </p>
      <sec id="sec-4-1">
        <title>Number of neurons/Number of layers 50</title>
        <p>an artificial neural network are the number of layers and the number of neurons in
each of the layers.</p>
        <p>The neural network architecture was selected by iterating over the values of the
number of layers/neurons. The number of LSTM-layers ranged from 1 to 3, whereas
the number of neurons in each layer varied from 50 to 100 with the step size of 10.
The Dropout technique was used to combat overfitting.</p>
        <p>The results of the neural network architecture selection are presented in Table 2.
The training data showed a similar result for all possible architectures. The eventually
selected architecture consisted of 1 LSTM-layers with 50 neurons and one fully
connected layer.</p>
        <p>Other hyperparameters of the model were set using the random-walk method with
cross-validation. The parameters for model construction were selected based on the
principle of maximum accuracy (Table 3).</p>
      </sec>
      <sec id="sec-4-2">
        <title>Parameter name Optimizer</title>
      </sec>
      <sec id="sec-4-3">
        <title>Loss function Metrics Number of epochs Mini-batch size</title>
        <p>Process disruptions in the retention cell operation build up over time and undetected
errors may spiral into serious accidents. Timely detection of deviation will entail
long-term forecasting.</p>
        <p>The long-term voltage forecasting is carried out through the iterative approach.
The iterative approach in forecasting involves a few forecasting runs performed one
step ahead, though using the values in the preceding stage. The general diagram of
long-term forecasting is given in Figure 4.</p>
        <p>The forecasting sequence is chosen to have a length of 10. The forecasts were fulfilled
10 steps ahead, which translates into 30 minutes. The forecasting results for the test
sets are presented in Figure 5.</p>
        <p>It can be derived from the resulting graph that long-term forecasting performed with
the iterative approach entails a value in every step that will differ from the real one,
i.e. there will always be a certain error that will be growing with every new step. In its
turn, the resulting prediction model makes it possible to reveal a tendency in how the
controlled parameter is changing and identify the process disruptions in a timely
manner.
7</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>The paper presents the results of artificial neural networks as they were applied to
forecast the values of the technological process parameters in aluminum production. It
looks at the mechanics of the prediction model construction aimed at one of the key
controllable process parameters, namely the retention cell voltage. The elaboration of
the forecasting tools is carried out in two main stages: analysis and preprocessing of
inputs, construction of a math model and validation of the solution. Forecasting was
chosen to be performed using recurrent neural networks. The method of maximum
accuracy was used in the selection of an optimal neural network architecture,
calculation of MSE, MAPE metrics, determination coefficient, and Theil coefficient.
As it can be derived from the values of the selected metrics, the accuracy of the
suggested model may be deemed appropriate.</p>
      <p>The results obtained in the testing process are acceptable in terms of forecasting
values of the process parameters. Timely detection of deviations in the forecasted
parameter will allow for a quick response to prevent any process disruptions and thus
increase aluminum production efficiency.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Abubakar</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Alyuminiyevaya promyshlennost' v sovremennom mire [The aluminum industry in the modern world]</article-title>
          .
          <source>Iinternational student research bulletin. 4-4</source>
          .
          <fpage>542</fpage>
          -
          <lpage>545</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Puzanov</surname>
            ,
            <given-names>I.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zavadyak</surname>
            ,
            <given-names>A.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Klykov</surname>
            ,
            <given-names>V.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Makeev</surname>
            ,
            <given-names>A.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Plotnikov</surname>
            ,
            <given-names>V.N.</given-names>
          </string-name>
          :
          <article-title>Continuous monitoring of information on anode current distribution as means of improving the process of controlling and forecasting process disturbances</article-title>
          .
          <source>J. Sib. Fed. Univ. Eng. technol. 9</source>
          (
          <issue>6</issue>
          ).
          <fpage>788</fpage>
          -
          <lpage>801</lpage>
          (
          <year>2016</year>
          ).
          <source>doi: 10.17516/1999-494X-2016-9-6-788-801</source>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Zavadyak</surname>
            ,
            <given-names>A.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Puzanov</surname>
            ,
            <given-names>I.I.</given-names>
          </string-name>
          , Tretyakov,
          <string-name>
            <given-names>Ya.A.</given-names>
            ,
            <surname>Morozov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.M.</given-names>
            ,
            <surname>Makeev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.V.</given-names>
            ,
            <surname>Pianykh</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.A.</surname>
          </string-name>
          :
          <article-title>Mathematical modeling of the impact of anode bottom problems of the anode current distribution high current electrolyzer</article-title>
          .
          <source>J. Sib. Fed. Univ. Eng. technol</source>
          .
          <volume>10</volume>
          (
          <issue>7</issue>
          ).
          <fpage>862</fpage>
          -
          <lpage>873</lpage>
          (
          <year>2017</year>
          ). doi:
          <volume>10</volume>
          .17516/1999-494X-2017
          <source>-10-7-862-873</source>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Montgomery</surname>
            ,
            <given-names>D.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jennings</surname>
            ,
            <given-names>C.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kulahci</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Introduction to Time Series Analysis and Forecasting</article-title>
          . New Jersey: John Wiley and Sons (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Hyndman</surname>
            ,
            <given-names>R.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Athanasopoulos</surname>
          </string-name>
          , G.:
          <article-title>Forecasting: Principles and Practice</article-title>
          . Australia: OTexts (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Goodfellow</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bengio</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Courville</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Deep learning</article-title>
          . Cambridge: MIT press (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>F.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ting</surname>
            ,
            <given-names>K.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhou Z</surname>
          </string-name>
          .-H.:
          <article-title>Isolation forest</article-title>
          .
          <source>In: Proceedings of the 2008 Eighth IEEE International Conference on Data Mining</source>
          . pp.
          <fpage>413</fpage>
          -
          <lpage>422</lpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <article-title>Method for interpolating the Pandas library</article-title>
          . https://pandas.pydata.org/ pandasdocs/stable/reference/api/pandas.DataFrame.interpolate.html
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Kolmykov</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>The comparative analysis of the statistical model and neural network of the backpropagation in a forecasting problem</article-title>
          .
          <source>Applied Computer Science</source>
          <volume>6</volume>
          (
          <issue>30</issue>
          ),
          <fpage>111</fpage>
          -
          <lpage>119</lpage>
          (
          <year>2010</year>
          )
          <article-title>(in Russian)</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>