<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Metaheuristic algorithms for identification of the convection velocity in the convection-diffusion transport model</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>A V Tsyganov</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yu V Tsyganova</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>A N Kuvshinova</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>H R Tapia Garza</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Mathematics, Information and Aviation Technology, Ulyanovsk State University</institution>
          ,
          <addr-line>Ulyanovsk, Russian Federation</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Mathematics, Physics and Technology Education, Ulyanovsk State Pedagogical University named after I.N. Ulyanov</institution>
          ,
          <addr-line>Ulyanovsk, Russian Federation</addr-line>
        </aff>
      </contrib-group>
      <fpage>188</fpage>
      <lpage>196</lpage>
      <abstract>
        <p>This paper considers an application of metaheuristic algorithms for solving the problem of convection velocity identification in the convection-diffusion transport model. Algorithms based on numerical minimization of the parameter identification criterion for a discrete linear stochastic model using the simulated annealing and a genetic algorithm are proposed. The log-likelihood function is used as the identification criterion. Numerical experiments were conducted to compare the computational properties of the proposed algorithms.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction and problem statement</title>
      <p>Convection-diffusion transport models are an indispensable tool for describing various natural and
anthropogenic processes [1], [2]. These models contain parameters that must be specified to uniquely
determine the solution of boundary value problems, but in practice, situations often arise where some of
these parameters are unknown or given approximately and they need to be determined or refined. Such
problems belong to the class of inverse problems for the models of matter transfer.</p>
      <p>
        In the simplest one-dimensional case, the convection-diffusion transport model can be described by
equation (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) with initial condition (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) and boundary conditions (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ):
where is the function of interest (for example, the concentration of the pollutant), is the
spatial coordinate, is the time, is the convection velocity, is the diffusion coefficient, , ,
are given functions, , are boundaries of the considered segment.
      </p>
      <p>
        When solving a wide range of problems in ecology, geophysics, seismology, and other areas, the
problem of determining (identifying) the convection velocity in the convection-diffusion transport
model often arises. Depending on the equation under consideration and the boundary conditions,
various methods can be used to solve this problem. In [3], [4], to find the parameters of equation (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ), time
series analysis methods based on the method of least squares, extended Kalman filter and their
combination are used.
      </p>
      <p>
        In this paper, we propose the use of metaheuristic algorithms for numerical optimization to find the
optimal estimate of a parameter of a discrete linear stochastic model with respect to a given criterion
of identification quality. As in [4], this model with noisy measurements is constructed from the
original model (
        <xref ref-type="bibr" rid="ref1">1</xref>
        )–(
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) using a two-layer finite-difference scheme, but unlike [4] it uses a grid not with
three but with an arbitrary number of nodes on the coordinate and the conventional, rather than the
extended Kalman filter.
      </p>
      <p>The choice of metaheuristic algorithms to optimize a parameter identification criterion is due to the
fact that deterministic numerical methods applicability is guaranteed under the conditions of
convergence theorems [5]. The convergence of numerical methods is influenced by various factors, including
a good choice of the initial approximation.</p>
      <p>If the initial approximation is not chosen correctly, the exact algorithm for finding the estimates of
the parameters may diverge, which means that it is impossible to solve the identification problem.
Metaheuristic algorithms can be used both directly for solving the problem of minimizing the
identification criterion, and for finding a good initial approximation for exact methods. A similar approach
was applied by the authors in [6–9] and proved its efficiency.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Discrete linear stochastic model</title>
      <p>
        We want to move from the model (
        <xref ref-type="bibr" rid="ref1">1</xref>
        )–(
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) to a discrete linear stochastic model, whose equations
generally have the following form:
where is the system state vector, is the control vector, is the measurements
vector, noises and form independent normally distributed sequences with zero
mean and covariance matrices and respectively, matrices ,
, , , , can depend on an unknown
parameter .
      </p>
      <p>
        In computational practice, for the numerical solution of non-stationary problems for
convectiondiffusion equations, two- and three-layer finite-difference schemes are most often used. To obtain a
discrete linear stochastic model, consider in the plane a regular grid (
        <xref ref-type="bibr" rid="ref5">5</xref>
        ) with spatial step and
time step :
      </p>
      <p>
        Let us denote
difference scheme for (
        <xref ref-type="bibr" rid="ref1">1</xref>
        )–(
        <xref ref-type="bibr" rid="ref3">3</xref>
        ):
,
,
,
and write down the
finite(
        <xref ref-type="bibr" rid="ref4">4</xref>
        )
(
        <xref ref-type="bibr" rid="ref5">5</xref>
        )
(
        <xref ref-type="bibr" rid="ref6">6</xref>
        )
(
        <xref ref-type="bibr" rid="ref7">7</xref>
        )
(
        <xref ref-type="bibr" rid="ref8">8</xref>
        )
(
        <xref ref-type="bibr" rid="ref9">9</xref>
        )
(10)
It follows from equation (
        <xref ref-type="bibr" rid="ref6">6</xref>
        ) that the value of the required function at the internal node points of the
-th time series can be found through its values at the nodal points of the -th time series as
follows:
where
,
      </p>
      <p>
        . Let us write (
        <xref ref-type="bibr" rid="ref9">9</xref>
        ) in the form
where
      </p>
      <p>
        The desired discrete linear stochastic model can be represented in the following form:
Note that the first equation in model (12) is deterministic, the initial value of the state vector is (
        <xref ref-type="bibr" rid="ref7">7</xref>
        ),
and the boundary conditions (
        <xref ref-type="bibr" rid="ref8">8</xref>
        ) act as control parameters.
      </p>
      <p>Suppose that the diffusion coefficient and the characteristics of the noise in the measurer are
known, and the steps of the space-time grid and are given, then the unknown parameter of
model (12) to be determined is the convection velocity , on which coefficients and of equation (10)
and, consequently, matrix depend. Briefly, the model (12) can be written in the form:
where</p>
      <p>.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Metaheuristic algorithms for parameter identification</title>
      <p>Let us consider the problem of parameter identification of model (13) from noisy measurements data
for estimating an unknown parameter. The parameter identification problem consists in finding an
unknown parameter from known input signals and the output observation
data in accordance with the chosen identification criterion . In this
case, the problem of estimating an unknown parameter requires solving the nonlinear programming
problem with constraints
(12)
(13)
(14)
where (the domain of ).</p>
      <p>In this paper, we use metaheuristic algorithms to solve problem (14). Metaheuristic is a high-level
search strategy for finding solutions, applicable to a wide range of optimization tasks. Metaheuristics
have the following properties: they are based on fairly simple ideas, for example, imitating biological
or physical processes, they are problem-independent, practically all of them are nondeterministic.</p>
      <p>Most metaheuristic optimization algorithms can be divided into two large groups according to the
method of obtaining a solution: trajectory and population-based ones. In trajectory algorithms, the
process of finding a solution can be considered as a movement between individual solutions of a
problem while in population-based algorithms a group of solutions called population changes in the
process of finding the solution.</p>
      <p>One of the most popular trajectory algorithms used in solving global optimization problems is the
simulated annealing (SA) method. A key feature of the method is the use of a control parameter– a
temperature, which allows controlling the nondeterministic process of solution search. As a rule, the
temperature decreases during the operation of the algorithm according to a certain law, starting with
some initial value. At each iteration of the algorithm, the randomly generated new solution from the
neighborhood of the current solution is taken with probability 1 if it improves it, and with probability
less than 1, if worsens, and the probability of making the worst decision decreases with a decreasing
temperature. The quality of decisions is estimated using a cost function (integer or real).</p>
      <p>Simulated Annealing (SA)</p>
      <p>The genetic algorithm (GA) is a popular version of the so-called evolutionary optimization
algorithms based on the simulation of natural selection processes. In evolutionary algorithms, the quality
of solutions is estimated using the fitness function, and the main idea of algorithms is that solutions
with the best values of a given function “survive” in the course of evolution.</p>
      <p>In GA, at each iteration of the evolutionary process, a new population is obtained from the current
population using one or more genetic operators successively. The most common genetic operators are
the crossover (recombination) which is used to generate the descendant solutions from the parent
solutions and the mutation – an accidental change in the solution.</p>
      <p>Genetic Algorithm (GA)
1: Population InitialPopulation()
2: for all Population do
3: EvaluateFitness( )
4: end for
5: while not StopCondition() do
6: Parents SelectParents(Population)
7: Offspring Crossover(Parents)
8: Offspring Mutation(Offspring)
9: for all Offspring do
10: EvaluateFitness( )
11: end for
12: Population
13: end while
14: Solution ChooseBestOf(Population)
Output: Solution.</p>
      <p>UpdatePopulation(Population</p>
      <p>Offspring)
The algorithms SA and GA are discussed in more detail, for example, in [10].</p>
      <p>Since (13) is a discrete linear stochastic model with Gaussian noise, it is advisable to select the
identification criterion (14) in the form of a negative logarithmic likelihood function as the cost/fitness
function for implementing metaheuristic algorithms [11]
where the residual vector and its covariance matrix
are calculated from the known Kalman filter equations [12]:</p>
      <p>A. Time update.</p>
      <p>For the Kalman filter computes extrapolated estimates
through the temporal update from to as
with
and the covariance matrices
for a given value of the parameter</p>
      <p>
        (15)
in (
        <xref ref-type="bibr" rid="ref4">4</xref>
        )
for
. They are obtained
where .
      </p>
      <p>B. Measurement update.</p>
      <p>For the Kalman filter computes the so-called filtered (i.e., measurement updated)
estimates . They are obtained through the measurement update using with noise covariance
, as
,
,
with filter gain
and filtered estimates covariance matrices</p>
      <p>At present, when solving practical problems with the use of a computer, it is preferable to apply
square-root and UD-implementations that are numerically stable against machine round-off errors
instead of the conventional form of the Kalman algorithm [12, Chapter 6].</p>
      <p>The maximum likelihood method consists in optimizing criterion (15) with respect to the system
parameter . It is often used in practice to solve parameter identification problems of discrete linear
stochastic systems [12, 13].</p>
    </sec>
    <sec id="sec-4">
      <title>4. Numerical experiments</title>
      <p>Consider the following problem:
where is the concentration of the pollutant in one-dimensional flow, (17) is the initial
distribution of the pollutant and boundary conditions (18) correspond to the case of two absorbing walls.</p>
      <p>The exact solution of (16)–(18) has the form:</p>
      <p>Suppose that the value of parameter in equation (16) is known and it is required to determine the
value of parameter , provided that noisy measurements from sensors located at nodes of some regular
(16)
(17)
(18)
(19)
grid are available. Let a grid with 10 nodes (
) be given on the
axis, and the time step
.
,
. The plot of the exact solution for this case is shown in figure 1.</p>
      <p>To simulate noisy measurements we add random errors (white noise) to the values of the exact
solution (19) at the grid nodes. The covariance matrix of the noise is , where is the identity
matrix and is the known variance. An example of a noisy solution is shown in figure 2.</p>
      <p>Figure 3 demonstrates the averaged plot of criterion (15), obtained from the results of 100
experiments. To minimize it, we have used functions simulannealbnd() and ga() from the Global
Optimization Toolbox of the MATLAB system. Numerical experiments were conducted on a
hardwaresoftware platform: Intel Core 2 Quad Q6600 @ 2.40 GHz, 4 Gb RAM, Microsoft Windows 10 Pro
x64, MATLAB R2017a.</p>
      <p>For each of the algorithms, the corresponding cost/fitness and output functions were written. Basic
settings of the algorithms are given in table 1. As a stopping criterion for both algorithms, a time limit
of 5 seconds was used. The search for solutions was carried out on the interval [0; 5], the number of
steps in time (i.e., the number of time series) is .
Table 2 shows the results of computational experiments for different values of the noise variance
. For each value of , a series of 100 experiments was conducted and for each series, the following
values were calculated: mean value of the identified parameter (Mean), mean absolute percentage
error (MAPE) and root mean squared error (RMSE). The obtained results show that, with the selected
settings, both algorithms allow identifying the convection velocity with an acceptable accuracy. At
high noise values, the quality of parameter identification is approximately the same for both
algorithms, with a low noise level, the simulated annealing method yields smaller MAPE and RMSE
errors.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>This paper demonstrates the practical applicability of metaheuristic algorithms for solving the problem
of parameter identification in the model of convection-diffusion transport. The convection velocity
was considered as an unknown parameter of the model. The problem solution was obtained with the
use of the maximum likelihood method, in which the simulated annealing method and the genetic
algorithm were used to numerically minimize the negative logarithmic likelihood function.</p>
      <p>Numerical experiments were conducted in MATLAB. The results obtained make it possible to
conclude that the application of metaheuristic algorithms is expedient since it allows to obtain acceptable
estimates of the parameter for different levels of noisy measurements.</p>
      <p>Further research will focus on the construction and software implementation of new exact and
hybrid algorithms for parameter identification of convection-diffusion transport models and their
application to real life problems.
[10] Hromkovič J. 2002 Algorithmics for Hard Problems: Introduction to Combinatorial
Optimization, Randomization, Approximation and Heuristics ed 2 (Springer-Verlag Berlin, Heidelberg)
p 557
[11] Åström K J 1980 Maximum Likelihood and Prediction Error Methods Automatica 16 5 pp
551574
[12] Grewal M S and Andrews A P 2001 Kalman Filtering: Theory and Practice Using MATLAB
ed 2 (New Jersey: Prentice Hall) p 401
[13] Gibbs V P 2011 Advanced Kalman Filtering, Least-squares and Modeling: Practical Handbook
(Hoboken, New Jersey: John Wiley &amp; Sons, Inc.) p 632</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Leontiev</surname>
            <given-names>A I</given-names>
          </string-name>
          <year>1997</year>
          <article-title>The Theory of Heat</article-title>
          and Mass Transfer ed 2 (Moscow: Bauman Moscow State Technical University) p
          <fpage>683</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Farlow S J 1982 Partial</surname>
          </string-name>
          <article-title>Differential Equations for Scientists and Engineers</article-title>
          (New York: Wiley) p
          <fpage>402</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Matveev</surname>
            <given-names>M G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sirota</surname>
            <given-names>E A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Semenov</surname>
            <given-names>M E</given-names>
          </string-name>
          and
          <string-name>
            <surname>Kopytin</surname>
            <given-names>A V</given-names>
          </string-name>
          <year>2017</year>
          <article-title>Verification of the Convective Diffusion Process Based on the Analysis of Multidimensional Time Series Proc. of the DAMDID/ RCDL'2017. XIX International Conference on Data Analytics and Management in Data Intensive Domains (Moscow: Federal research center “Informatics and Control” of the Russian Academy</article-title>
          of Sciences) pp
          <fpage>433</fpage>
          -
          <lpage>437</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Matveev</surname>
            <given-names>M G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kopytin</surname>
            <given-names>A V</given-names>
          </string-name>
          and
          <string-name>
            <surname>Sirota</surname>
            <given-names>E A</given-names>
          </string-name>
          <year>2018</year>
          <article-title>Combined method for identifying the parameters of a distributed dynamic model Proc. of the ITNT-2018</article-title>
          . IV International Conference on Information Technology and
          <string-name>
            <surname>Nanotechnology (SAMARA</surname>
          </string-name>
          : New Technique) pp
          <fpage>1651</fpage>
          -
          <lpage>1656</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Vasilyev</surname>
            <given-names>V P</given-names>
          </string-name>
          <year>1982</year>
          <article-title>Numerical Methods for Solving Extreme Problems</article-title>
          (Moscow: Mir) p
          <fpage>372</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Tsyganova</given-names>
            <surname>Yu</surname>
          </string-name>
          <string-name>
            <given-names>V</given-names>
            and
            <surname>Tsyganov</surname>
          </string-name>
          <string-name>
            <surname>A V</surname>
          </string-name>
          <year>2010</year>
          <article-title>Simulation normalization in the problem of identification of parameters of stochastic linear system Stochastic Optimization in Informatics 6</article-title>
          1 pp
          <fpage>147</fpage>
          -
          <lpage>159</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Tsyganov</surname>
            <given-names>A V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bulychov</surname>
            <given-names>O I</given-names>
          </string-name>
          and
          <article-title>Tsyganova Yu V 2011 Parallel hybrid algorithms for the problem of parameter identification in stochastic linear systems Vector of Sciences</article-title>
          .
          <source>Togliatti State University</source>
          <volume>3</volume>
          (
          <issue>17</issue>
          ) pp
          <fpage>45</fpage>
          -
          <lpage>49</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Tsyganova</given-names>
            <surname>Yu</surname>
          </string-name>
          <string-name>
            <given-names>V</given-names>
            and
            <surname>Tsyganov</surname>
          </string-name>
          <string-name>
            <surname>A V</surname>
          </string-name>
          <year>2013</year>
          <article-title>Program for parameter identification in stochastic linear systems ISLSP v.1.1 The Certificate of State Registration of Computer Programs no 2013612686</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Tsyganov</surname>
            <given-names>A V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Semushin</surname>
            <given-names>I V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tsyganova Yu</surname>
            <given-names>V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Golubkov</surname>
            <given-names>A V</given-names>
          </string-name>
          and
          <string-name>
            <surname>Vinokurov S D 2017</surname>
          </string-name>
          <article-title>Metaheuristic algorithms in the issue of identification of the moving object mathematical model parameters</article-title>
          <source>Automation of Control Processes</source>
          <volume>1</volume>
          pp
          <fpage>16</fpage>
          -
          <lpage>23</lpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>