<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <article-id pub-id-type="doi">10.18287/1613-0073-2016-1638-593-599</article-id>
      <title-group>
        <article-title>MULTI-CRITERIA OPTIMIZATION BASED ON THE REGRESSION EQUATION SYSTEMS IDENTIFICATION</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>A.P. Kotenko</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>D.A. Pshenina</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Samara National Research University</institution>
          ,
          <addr-line>Samara</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Samara State Technical University</institution>
          ,
          <addr-line>Samara</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <fpage>593</fpage>
      <lpage>599</lpage>
      <abstract>
        <p>Consider the problem of multi-criteria optimization with conflicting criteria. An example is the complex chemical production with random parameters. We describe using regression equations dependence of targets from control actions. Investigation of the systems of interdependent regression equations requires description of all possible variants of their properties. The most productive case is exact identification of the system parameters. It allows to find the optimal values of the control parameters for manufacturing of quality assured production.</p>
      </abstract>
      <kwd-group>
        <kwd>multivariate optimization</kwd>
        <kwd>statistical parameters</kwd>
        <kwd>system of linear regressions</kwd>
        <kwd>conflicting criteria</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Basic notation system</title>
      <p>
        Let dependence of optimization criteria y1, y2,…, yn from the controlling factors x1,
x2,…, xm is expressed by a system of linear regressions. In the structural form of the
model (SFM) [1,2]
(
        <xref ref-type="bibr" rid="ref1">1</xref>
        )
~y1  11x1  12 x2    1m xm , ~y2  21x1  22 x2    2m xm , …
    
with regression values ~yi  ~yi1, ~yi2 ,, ~yiN T   N and the reduced coefficients βij,
found by the Ordinary Least Squares (OLS) method.
      </p>
      <p>At the same time vectors of the regression values of endogenous variables are
obtained by orthogonal projection Pr of vectors of observations of endogenous variables
to the linear span [1] Lx1, x2 ,, xm   N of dimension</p>
      <p> 
dim Lx1, x2 ,, xm   m</p>
      <p> 
of observed values vectors of the predefined variables
~yi  Pryi ; Lx1, x2 ,, xm  Lx1, x2 ,, xm .</p>
      <p>
            
(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )
(
        <xref ref-type="bibr" rid="ref3">3</xref>
        )
(
        <xref ref-type="bibr" rid="ref4">4</xref>
        )
Since the structure coefficients are functions of the solutions of the reduced system of
linear algebraic equations (SLAE), the solvability of SFM identification problem (
        <xref ref-type="bibr" rid="ref1">1</xref>
        )
is determined by the ranks of the system’s matrix and the extended matrix of the
system.
      </p>
      <p>Accordingly, there may be the following cases:
 one solution (accurate identifiability of the system’s equation by indirect</p>
      <p>
        OLS);
 no solution (superidentifiable equation when two-step OLS is applicable);
 infinite many solutions (unidentifiable equation of the SFM system).
Replacing the sample values of the endogenous variablesofSFM (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) by the
regression values from PFM (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ), we obtain in the linear span Lx1, x2 ,, xm  of system (
        <xref ref-type="bibr" rid="ref1">1</xref>
        )
predefined variables’ sample values the following independent linear equations,
which enab~le us to identify the ~structural factors: 
~y1  a12 y2  a13 ~y3    a1n yn  b11x1  b12x2    b1m xm ,
~y2  a21~y1  a23 ~y3    a2n ~yn  b21x1  b22 x2    b2m xm ,

~   
~yn  an1 ~y1  an2 ~y2    an,n1 yn1  bn1x1  bn2 x2    bnm xm .
      </p>
      <p>
        From (
        <xref ref-type="bibr" rid="ref1">1</xref>
        )-(
        <xref ref-type="bibr" rid="ref3">3</xref>
        ), we obtain the system of m linear equations to identify structural
coefficients of i-th equation:
      </p>
      <p>
        m  n m  n  m   m 
~yi  ik xk   aij ~y j   bik xk   aij   jk xk    bik xk 
k1 j1; ji k1 j1; ji  k1  k1
 i1  bi1 
n
 j1aij , i2  bi2 
j1; ji
n
 j2aij , …, im  bim 
j1; ji
n
 jmaij
j1; ji
Let us consider the linear span L by combining multiple regression vector of
endogenous variables values from the right hand side of the i-th equation of the system (
        <xref ref-type="bibr" rid="ref4">4</xref>
        )
and the set of vectors of sampled values of predefined variables.
Identifiability of structure coefficients of the i-th equation is determined by a
combination of the following factors: the fact of its left hand side belonging (not belonging)
to the linear span L of the SFM’s right hand side vectors and by their linear
(in)dependence.
      </p>
      <p>
        In this case the two-step OLS result coincides with indirect OLS result if the
considered system (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) equation can be exactly identified.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Setting up a problem</title>
      <p>The content of bitumen in the physical volume of asphalt is only 5-7%, but it has the
most significant influence on the quality and durability of the road surface. Bitumen
production is one of the most energy-intensive. Energy costs for production of
oxidized bitumen consist of steam, fuel and electricity consumptions. [3]
Residual bitumen production technology is based on the concentration of heavy
petroleum residues of tar vacuum distillation. Tars characteristics affect the quality of road
asphalts, in turn, these characteristics can be influenced by changing the composition
of the hydrocarbon composition of the feedstock.</p>
      <p>The main factors of the residual bitumen production process are the depth of the
vacuum, steam flow and distillation temperature. Physical and chemical properties of
bitumen depend on the hydrocarbon composition of raw materials. Thus, there is an
inverse problem as follows: to obtain the desired properties of the product we need to
determine the characteristics of raw materials composition and production process
parameters.</p>
    </sec>
    <sec id="sec-3">
      <title>Mathematical model</title>
      <p>For this purpose, it proposed to use the methods of mathematical modeling, in
particularly, to set up the system of regression equations where the input parameters are the
parameters of the approved standards, and the results of calculations are the
characteristics of the bitumen.</p>
      <p>Input variables are as follows:
х1 – penetration depth of the needle at 250C,
х2 – penetration depth of the needle at 00C,
х3 – extensibility at 250C,
х4 – dynamic viscosity at 600C,
х5 – softening temperature change after heating,
х6 – extensibility after warming up.</p>
      <p>Calculations will enable us to determine the following values:
у1 – sulfur in %%,
у2 – paraffins and naphtha of tar in %%,
у3 – heating temperature,
у4 – air consumption,
у5 – warming-up duration,
у6 – oil tar in %%,
у7 – VU80 tar index,
у8 – light aromatics of tar in %%.
65 observations have been processed. [4]
Setting up each of the regression equation was carried out in the frameworks of a
posteriori approach, implying the inclusion of all feasible variables and sequential
exclusion of insignificant ones.</p>
      <p>At the first step let us set up regression equations with six variables:
у1=4,05+0,014х1–0,08х2+0,003х3–0,005х4–0,11х5–0,0014х6, R2=0,745399.
Analysis of the Student’s coefficients revealed the following significant variables:
у1=5,17–0,075х2–0,0054х4–0,117х5, R2=0,72951.</p>
      <p>Similarly,
у2=2,61+0,17х1+0,27х2–0,043х3+0,002х4+0,89х5–0,011х6, R2=0,759902; 
y2=3,34+0,23х1–0,05х3+1,35х5, R2=0,74412;
у3=251,98–0,16х1–0,58х2–0,076х3+0,001х4+4,07х5–0,03х6, R2=0,219015; 
у3=222,17+3,69х5, R2=0,191535;
у4=7,15–0,02х1–0,02х2–0,003х3+0,0004х4–0,02х5–0,006х6, R2=0,234903; 
y4=5,24–0,002х6, R2=0,05344;
у5=–6,73+0,13х1+0,13х2–0,01х3+0,0075х4+0,38х5–0,0075х6, R2=0,743188; 
у5=–10,73+0,18х1+0,0064х4+0,764х5, R2=0,705288;
у6=11,72+0,0096х1–0,16х2–0,009х3–0,0008х4+0,18х5+0,009х6, R2=0,121453; 
у6=12,12–0,16х2+0,18х5, R2=0,061457;
у7=543,24–4,75х1-2,35х2–0,51х3–0,1х4–12,16х5+0,94х6, R2=0,677759; 
у7=321,05–4,59х1+1,024х6, R2=0,61394;
у8=13,21–0,066х1+0,1х2+0,0079х3–0,024х4+0,597х5–0,0088х6, R2=0,502146; 
у8=14,96–0,0235х4, R2=0,393982.</p>
      <p>Let us set up a system of equations out of significant regression equations:
у1=5,17–0,075х2–0,0054х4–0,117х5;
y2=3,34+0,23х1–0,05х3+1,35х5;
у3=222,17+3,69х5;
y4=5,24–0,002х6;
у5=–10,73+0,18х1+0,0064х4+0,764х5;
у7=321,05–4,59х1+1,024х6;
у8=14,96–0,0235х4.</p>
      <p>The parameters provided by the RF Standard are considered to be the given variables.
They are as close as possible to Euro-requirements establishing quality standards for
EU road bitumen:
х1=60; х2=13; х3=80; х4=320; х5=3; х6=40.</p>
      <p>Hence,
у1=1,104; у2=21,85; у3=11,07; у4=–0,08; у5=15,14; у7=–234,4; у8=7,52.
The results show the statistical unreliability of production results when production
process is carried out in accordance with current technologies under the specified
Standard values.</p>
    </sec>
    <sec id="sec-4">
      <title>Forecasts confidence intervals</title>
      <p>On the basis of 10 samples there were calculated correlations (Table 1), indicating a
strong linear relationship between the following parameters [5]:
X –softening temperature,
Y1 – dynamic viscosity at 600C,
Y2 – needle penetration depth at 250C,
Y3 – extensibility after warming up.</p>
      <p>This allows to find linear regressions among all pairs of the indices.</p>
      <p>As a general regressor there was chosen a softening temperature X, for which the
measurement can be made in the most accurate way.
Significant regressions are obtained:
Y1=–2038,63+45,29X, R2=0,9674; Y3=868,37–15,97X, R2=0,9456.</p>
      <p>The high values of determination coefficients indicate the adequacy of the proposed
linear models to experimental data. The equations’ parameters are conservatively
significant according to Fisher’s test at the significance level of α=0,01–0,05–0,10.
The link between Y2 and X has not been studied, as it is co-directional to the link
between Y3 and X. There were found the forecast confidence intervals for Yregr1 (Table 2)
and Yregr3 (Table 3).</p>
      <p>Regressor X values and Y1 and Y3 forecast confidence intervals boundaries satisfying
Standard ( X  49 , Y1  250 , Y3  80 ) are marked in the Tables 2,3 by italics. Thus,
in Table 2 only the rows with T  50 allow the values of Y1 to satisfy the Standard,
while in Table 3 these are only the rows with T  50,4 .</p>
      <p>However, at significance level of   0,05 there is only the line with T=50 in both
tables, which is suitable. Y3 index observed value is sufficiently shifted to the left
boundaries of confidence intervals (both at α=0,05 and at α=0,01), which do not meet
the Standard.</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>Standard boundaries are contradictory for indexes X, Y1 and Y3, which means the
necessity of technological upgrade of production process if it is aimed at European
bitumen standards.</p>
      <p>This example illustrates the possibility of using identifiable systems of regression
equations. They help to define the boundaries of control factors of production to
ensure product quality at a given conflicting criteria and random perturbations.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Kotenko</surname>
            <given-names>AP</given-names>
          </string-name>
          .
          <article-title>Geometry of Systems of Econometric Equations. Control of organizational and economic systems</article-title>
          . Samara State Aerospace University,
          <year>2012</year>
          ;
          <volume>9</volume>
          :
          <fpage>35</fpage>
          -
          <lpage>41</lpage>
          . [In Russian]
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Kotenko</surname>
            <given-names>AP</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bukarenko</surname>
            <given-names>MB</given-names>
          </string-name>
          .
          <source>Geometry of Systems of Linear Regression Equations. News of Samara Science Center or Russian Academy of Science</source>
          ,
          <year>2013</year>
          ; Vol.
          <volume>15</volume>
          ;
          <issue>6</issue>
          (
          <issue>3</issue>
          ):
          <fpage>820</fpage>
          -
          <lpage>823</lpage>
          . [In Russian]
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Kotenko</surname>
            <given-names>AP</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuznetsova</surname>
            <given-names>OA</given-names>
          </string-name>
          .
          <article-title>The Use of Multivariate Regression Analysis for Optimizing of the Production of Bitumen of Standardized Specifications</article-title>
          .
          <source>Modern Information Technology and IT-Education, Proc. of 10th Conf</source>
          ., Moscow State University,
          <year>2015</year>
          :
          <fpage>356</fpage>
          -
          <lpage>359</lpage>
          . [In Russian]
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Dokuchaev</surname>
            <given-names>AV</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kotenko</surname>
            <given-names>AA</given-names>
          </string-name>
          .
          <article-title>Software for Multi-criteria Optimization of Production with the Help of Systems of Regression Equations</article-title>
          .
          <source>Rospatent. Certificate of registration No</source>
          <volume>2016612606</volume>
          . [In Russian]
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Tukilina</surname>
            <given-names>PM</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Melnikov</surname>
            <given-names>VN</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tyschenko</surname>
            <given-names>VA</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ermakov</surname>
            <given-names>VV</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pimenov</surname>
            <given-names>AA</given-names>
          </string-name>
          .
          <article-title>The Use of Multivariate Data Analysis Method in the Development of Production Technology of Highquality Road Bitumen</article-title>
          .
          <source>Chemistry and Technology of Fuels and Oils</source>
          ,
          <year>2015</year>
          ;
          <volume>5</volume>
          :
          <fpage>18</fpage>
          -
          <lpage>23</lpage>
          . [In Russian]
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>