<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Prediction based on TSLP</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Oleh Pihnastyi</string-name>
          <email>pihnastyi@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Olha Kozhyna</string-name>
          <email>olga.kozhyna.s@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kostiantyn Voloshyn</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Kharkiv National Medical University</institution>
          ,
          <addr-line>4 Nauky Avenue, Kharkiv, 61022</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>National Technical University "Kharkiv Polytechnic Institute"</institution>
          ,
          <addr-line>2 Kyrpychova, Kharkiv, 61002</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>V.N. Karazin Kharkiv National University</institution>
          ,
          <addr-line>4 Svobody Sq., Kharkiv, 61022</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Bronchial asthma is a heterogeneous disease affecting more than 300 million people, and its occurrence increases every year. Severity of disease is characterized by interaction of genetic factors and numerous environmental factors. The technique of regression model construction is used to determine significance and dependence between factors. The data of 70 children with diagnosed bronchial asthma and 20 children of a control group were used in the study. 142 factors were studied and level of thymic stromal lymphopoietin (TSLP) in blood serum was taken as a value of a referable variable to construct regression models. Prediction model parameters were assessed. Errors distribution law was defined on the basis of a ten-factor regression model analysis. In the absence of large deviations from normal distribution law, validity coefficients are close to the errors under normal distribution law. A comparative study of histograms is provided to distribute model's regressors values. Bronchial asthma, child, regression model, residual plot, TSLP Bronchial asthma is a chronic pulmonary inflammatory disease [1]. Development of approaches in the disease treatment as well as new medications creation did not allow consolidating control over bronchial asthma. For this reason, the search of inflammation biomarkers is the most pressing challenge Thymic stromal lymphopoietin can be used as one of inflammatory markers [4, 5]; that is hematopoetin cytokine isolated from mouse thymus epithelium culture [6, 7]. When studying the role of thymic stromal lymphopoietin its direct involvement in initiation and support of allergic inflammation in bronchial asthma is confirmed [8, 9]. Asthma phenotyping is based on diversity of disease clinical signs and heterogeneity [10, 11]. Special attention is paid to study of severe uncontrolled asthma in children [12, 13].</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>2023 Copyright for this paper by its authors.</p>
      <sec id="sec-1-1">
        <title>Regression models are a common way to predict a course of bronchial asthma in children [19, 20].</title>
        <p>Regression model defines a relationship between the observed value Yi and factors X K ( k = 1...K )
characterizing bronchial asthma. It can be represented as follows</p>
        <p>
          Yi = F (X1i , X 2i ,..., X ki,..., X Ki ) + i ,
(
          <xref ref-type="bibr" rid="ref1">1</xref>
          )
here Yi is the value of a referable variable for the i -th test, i = 1...n ; X ki is the predictor’s
(regressor’s) value for the i –th test; i is the prediction error for the value of Yi of the referable
variable in the i –th test having the following properties:
        </p>
        <p>
          E(i ) = 0, 2 (i , i ) = 2 , 2 (i ,  j ) = 0, i  j. (
          <xref ref-type="bibr" rid="ref2">2</xref>
          )
        </p>
      </sec>
      <sec id="sec-1-2">
        <title>The study purpose is research of the regression model features of thymic stromal lymphopoietin level in the blood serum.</title>
      </sec>
      <sec id="sec-1-3">
        <title>The regression model (1) shows the trend of the referable variable change</title>
        <p>Yˆ = F (X1, X 2 ,..., X K ),</p>
        <p>
          E(Y ) = Yˆ,
(
          <xref ref-type="bibr" rid="ref3">3</xref>
          )
depending on the change of predictors’ X k values, which characterize bronchial asthma, and the range
of Yi values around the value of F (X1, X 2 ,..., X K ) . The range of values for each level of predictors
X k follows probability distribution with mathematical expectation of E(i ) = 0 and mean square
deviation of (i ) =  .
        </p>
        <sec id="sec-1-3-1">
          <title>Definition of a set of predictors X k for the referable variable Y analyzing is the central task of</title>
          <p>
            regression model construction, which defines connection of factors characterizing bronchial asthma.
Consistent increase of the number of predictors in the model is one of the methods (
            <xref ref-type="bibr" rid="ref1">1</xref>
            ). It is supposed
that an addition of a new predictor to an existing set of predictors helps to decrease the 2 (i ) value.
Selection of predictors X k starts in the virtue of the regressor as a cause factor importance. The
referable variable of Y depends on it. Selection of the type of functional dependence (
            <xref ref-type="bibr" rid="ref3">3</xref>
            ), as well as
selection of the X k factors, is one of the main problems of severe bronchial asthma study. In the most
of the cases the dependence is defined as a result of a definite number of successive approximations.
Model of linear regression is used as an initial approximation during functional dependence
constructing [21]. Also special attention is to be paid to causal relationships of Y and X k , given that
statistical relationship of the referable variable Y and predicate X k doesn’t mean that the value of the
referable variable Y causally depends on the value of predicate X k . All the above questions will be
studied in details in this paper.
2. Materials and methods
          </p>
          <p>90 children aged 6 to 18 years were involved into the study of bronchial asthma in children in view
of the phenotype associated with thymic stromal lymphopoietin. The main group amounted to 70
children with diagnosed bronchial asthma and the control group amounted to 20 children. The average
age of children with bronchial asthma was 11 years. For every patient, information on 142 factors that
could be the cause of bronchial asthma was gathered, processed and analyzed. The study was conducted
with respect for human rights and in accordance with international ethical requirements; it doesn't
violate any scientific ethical standards and standards of biomedical research [22]. Parents were
questioned about symptoms characteristic to bronchial asthma in patients as well as the patients’disease
anamnesis. The results of the questioning were added to patient’s materials. Clinical features of disease
and results of laboratory tests were studied. Level of thymic stromal lymphopoietin in patients’ blood
serum was defined with enzyme immunoassay using a commercial test system manufactured by
Biotechne (ELISA, USA) on the immune-enzyme analyzer “Labline-90” (Austria). Enzyme immunoassay
is based on sandwich-type technique which is characterized by cobinding of biotin-labeled antibodies
to the analyzed analyte. Amount of thymic stromal lymphopoietin in a batch is defined on an analytical
curve in accordance with routine practice for such experiments. Amount of thymic stromal
lymphopoietin is measured in picograms per 1 ml of serum (pcg / ml) [23]. Indicators “Severe” and
“TSLP” were taken as a prediction parameter Y that defines severity of bronchial asthma in children.
Negative effect of thymic stromal lymphopoietin cytokine on the course of disease was proved in [24,
25]. All the above questions will be studied in details in this paper.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>3. Basic materials. Prediction model of bronchial asthma severity</title>
    </sec>
    <sec id="sec-3">
      <title>3.1. Assessment of prediction model parameters</title>
      <sec id="sec-3-1">
        <title>For linear model of multiple regression with model parameters of 0 , k :</title>
        <p>
          constant additive Yˆi and random additive i values. As defined for an error i (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ), the observed variable
Yi assessment follows:
        </p>
        <p> K  K K
E(Yi ) = E0 + k X ki + i  = 0 + k X ki + E(i ) = 0 + k X ki,</p>
        <p> k=1  k=1 k=1
Model parameters of 0 , k have an assessment:</p>
        <p>E(b0 ) = 0,</p>
        <p>E(bk ) = k ,</p>
        <p>mxk =</p>
        <p>K
b0 = my − mxk bk ,
k=1
1 n</p>
        <p> X ki,
n i=1
bk =</p>
        <p>Kx y</p>
        <p>k ,
Kxk xk
my =
1 n</p>
        <p>Yi ,
n i=1
Kxk xk =
1 n(X ki − mxk )2,
n i=1</p>
        <p>Kxk y =
1 n
(X ki − m k )(Yi − my ).</p>
        <p>x
n i=1</p>
        <p>
          Since, when predicting the value of the observed Yi , the values of regressors X ki are known,
unknown values of 0 , k with assessment of their mathematical expectation (
          <xref ref-type="bibr" rid="ref6">6</xref>
          ) are defined by a
linear combination of random values of Yi of the observed variable. To estimate the mean square
deviation of (Yi ) let’s use the definition of i error (
          <xref ref-type="bibr" rid="ref2">2</xref>
          )
        </p>
        <p>
           K 
2 0 + k X ki + i  = 2 (i ), 2 (i ,  j ) = 0. (
          <xref ref-type="bibr" rid="ref8">8</xref>
          )
        </p>
        <p> k=1 </p>
        <p>
          In the  = const assumption, the value of a random variable of Y is defined by probability
distribution with mathematical expectation (
          <xref ref-type="bibr" rid="ref5">5</xref>
          ) and the mean square deviation of  = const which does
not depend on predictor X value. Assessment of 2 value can be defined as follows:
n
2 = E(MSE ), MSE = , SSE = (Yi − Yˆi )2,
        </p>
        <p>
          SSE
n − m
i=1
here m = (K + 1) is the number of constraints, it can be defined by a number of equations of (
          <xref ref-type="bibr" rid="ref7">7</xref>
          ) type.
Retaining connections of b0 = B0 (Y1,Y2 ,...,Yn ) , bk = Bk (Y1,Y2 ,...,Yn ) (
          <xref ref-type="bibr" rid="ref7">7</xref>
          ) correspond to the regression
model (
          <xref ref-type="bibr" rid="ref4">4</xref>
          ). Regardless, (n − m) of independent values Yi can be defined. The rest m of Yi values are
defined from the system of equations (
          <xref ref-type="bibr" rid="ref7">7</xref>
          ). Expressions (
          <xref ref-type="bibr" rid="ref6">6</xref>
          ) and (
          <xref ref-type="bibr" rid="ref9">9</xref>
          ) give assessment for coefficients of
the regression model and mean square deviation of  . Defining of b0 , bk coefficients according to the
ordinary least squares technique, regardless the function of error rate distribution i , makes it possible
(
          <xref ref-type="bibr" rid="ref5">5</xref>
          )
(
          <xref ref-type="bibr" rid="ref6">6</xref>
          )
(
          <xref ref-type="bibr" rid="ref7">7</xref>
          )
(
          <xref ref-type="bibr" rid="ref9">9</xref>
          )
to calculate the unbiased point estimations for b0 , bk , that have minimal dispersion. However, to define
the space of b0 , bk parameters, assumption on the error rate i distribution law is required.
3.2.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Assessment of prediction error distribution law</title>
      <p>
        When studying the factors characterizing bronchial asthma severity, let us introduce an assumption
of a normal error rate distribution i and define when the conditions of the assumption are met. For a
normal law of error rate distribution i with the conditions (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ), designation N (0, 2 ) is used. It is
known that bronchial asthma severity is determined by a large number of factors that are weakly
interconnected. When investigating severity of bronchial asthma in paper [26], more than 100 factors
are being analyzed with correlation index rxk xv :
rxk xv =
      </p>
      <p>Kxk xv
Dxk Dxv
,
xk = Dxk ,</p>
      <p>Dxk =
n1 i=n1 (X k i − mxk )2 ,
k, v = 1...K ,
between factors X k , X v that satisfies inequality rxk xv  0.15 . A weak relation is observed not only
between factors that are used as the linear model regressors, but also between the referable Y value,
characterizing bronchial asthma severity, and the regressor X k . Since value of correlation index ry xk :
ry xk =</p>
      <p>Kxk y
, ,
 y =</p>
      <p>Dy ,</p>
      <p>Dy =</p>
      <p>1 n(Yi − my )2 , k , v = 1...K ,</p>
      <p>
        Dy Dxk n i=1
between the referable value Y and factor X k is small, it is necessary for the model (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) to have enough
number of regressors to provide required accuracy of referable value prediction. If the error rate  of
the referable value Y is formed under the influence of a large number of X k factors, which are weakly
dependent against each other and have similar rates of referable value Y formation, then the error rate
distribution  can be moved into proximity with normal distribution. If the number of factors which
are weakly dependent on each other is more than ten, we can assume that the error rate distribution i
of the referable value for the linear regression model (
        <xref ref-type="bibr" rid="ref4">4</xref>
        ) is not different from the normal law of errors.
Thus, error rate distribution i follows the normal law of errors for the regression model (
        <xref ref-type="bibr" rid="ref4">4</xref>
        ). Indeed,
if the model (
        <xref ref-type="bibr" rid="ref4">4</xref>
        ) has summands whose effect on the sum (16) dispersion is overbearingly large compared
to other summands, then we can ignore the summands whose effect is vanishingly small. If the number
of factors in resulting model is small, then the assumption of the normal law of errors i should be
discarded. Thus, due to the fact that the result of predicting of bronchial asthma severity depends on
sufficiently large number of weakly dependent on each other X k factors, having approximately similar
rates of the referable variable Y value forming, prediction error i distribution for a number of
regressors K  10 tends to the normal law of errors. In this study, let us use the model of linear
regression (
        <xref ref-type="bibr" rid="ref4">4</xref>
        ) with ten regressors to justify our assumption about the normal law of errors. Additionally,
let us mention that procedures of model parameters estimation based on the t-distribution are as a rule
sensitive only to large deviations of errors from normal distribution. If there are no large deviations
from normal law of errors, validity coefficients for any error are close to an error following the normal
law of errors.
(
        <xref ref-type="bibr" rid="ref10">10</xref>
        )
(
        <xref ref-type="bibr" rid="ref11">11</xref>
        )
3.3.
      </p>
    </sec>
    <sec id="sec-5">
      <title>Assessment of factors using distribution histogram</title>
      <p>Schematic prediction of regression factor of X k is one the main criteria of model regressors
selection. This analysis will allow us to obtain diagnostic information about a predictor variable by
determining outlying values of X k factor that has an effect on selection of regression function factors.
Diagnostic information about a range and concentration of X levels in the study helps to adjust the range
of regression analysis certainty. For schematic prediction, when choosing model parameters, we use a
distribution histogram of model regressors qualitative values. Distribution histograms of model
regressors qualitative values are given in Fig.1.</p>
      <p>This analysis will allow us to correct the experimental data by excluding the data containing
measurement errors. Comparative study of distribution histograms of model regressors values presented
in various research papers is used to estimate the quality of experiment. The value for each regressor is
defined by the value of a qualification factor, which characterizes the degree of regressor’s effect on
bronchial asthma severity (0 – no effect, and 1-,2-,3-,4- degree of regressor’s effect on bronchial asthma
manifestation in a child). The regressors “Atopic dermatitis”, “Allergic rhinitis” and “BA in relatives
of second generation” reproduces the results in papers [27]. The results of the regressors “Sheep wool”,
“Rabbit hair”, “Dog hair”, and “Pillow feather” are correspondent to the paper [28]. The rest of the
factors correspond to statistical average data for the disease.
3.4.</p>
    </sec>
    <sec id="sec-6">
      <title>Model construction</title>
      <p>Р1 – comparison between the clinical sign presence and absence groups’
Р2 – comparison with the control group</p>
      <sec id="sec-6-1">
        <title>Analysis of difference of thymic stromal lymphopoietin levels depending on the presence or absence</title>
        <p>
          of asthma clinical signs sets only one possible difference – the value of negative heredity for allergy
but not separately for asthma (table 2). The findings confirm the study of Gilda Varicchi, which shows
that the gene of thymic stromal lymphopoietin is located on the 5q22.1 chromosome next to the “atopic
cytokine” cluster on 5q31 and is responsible for allergy manifestations [29].
CD25 10*3 cells
100.739
here the calculation of ry xm , rxmxv is done in accordance with the formulas (
          <xref ref-type="bibr" rid="ref10">10</xref>
          ), (
          <xref ref-type="bibr" rid="ref11">11</xref>
          ). Raw experimental
data for the factors selection are given in [30]. Numerical characteristics of the set of factors in
accordance with the criterion (12) are given in Table 2. The selected factors are used to construct a
linear regression model in this study (
          <xref ref-type="bibr" rid="ref4">4</xref>
          ).
        </p>
        <p>The values of correlation coefficients between the factors of the rxm xv model, as well as between the
model factors and the observed value ry xm are given in Table 3.</p>
      </sec>
      <sec id="sec-6-2">
        <title>In Table 3, there are twelve factors for construction of a linear regression model, containing ten</title>
        <p>factors. The number of techniques which are used to select 10 regressors for the model (16) out of
twelve factors is defined by the following expression</p>
        <p>12!
2!(12 − 2)!</p>
      </sec>
      <sec id="sec-6-3">
        <title>Thus, 132 models are studied in this paper: 66 models with the above mentioned regressors for the observed value of Severe and 66 models with the above mentioned regressors for the observed value of TSLP. In Tables 4 and 5 four models with ten regressors for the Severe and TSLP values prediction are (13)</title>
        <p>given. The value of</p>
        <p>
          MSE (
          <xref ref-type="bibr" rid="ref9">9</xref>
          ) was the selection criterion of models for TSLP prediction. The models
are placed in the descending order of the
        </p>
        <p>MSE value. The models to predict the Severe value contain
the same regressors as the model to predict the TSLP. The value of
MSE for the 66 TSLP prediction
models corresponds to the inequality 22,97  MSE  26,44 . It follows the assumtion that the
regressors given in Table 2 have approximately similar rates of the referable value Y formation. If there
is a predominant factor in a set of regressors, the value of
MSE in the TSLP prediction models under
is present.
- 6
the absence of this factor would be significantly larger than in the model where this predominant factor</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>4.1. Diagnostic assessment of the model using Residual Plot</title>
      <p>
        Let us consider the residual
ei = Yi − Yˆi
(14)
as an observed value during diagnostic assessment of linear regression model. The observed residuals
e show the properties of the  error (
        <xref ref-type="bibr" rid="ref4">4</xref>
        ) that is fundamental for linear regression model assessment
i i
7
6
8
2
,
0
8
3
1
3
,
0
7
3
7
3
,
0
8
8
7
3
,
0
9
0
,
6
4
3
,
9
1
2
3
6
0
,
0
5
3
0
,
0
4
1
3
0
,
0
9
6
,
4
8
6
,
3
2
,
1
8
5
,
1
- 9
      </p>
      <p>- 9
me = n1 i=n1 ei = 0, s2 = n −1 m i=n1 (ei − me )2 = n −1 m i=n1 ei2 = MSE. (15)
The number of freedom degrees for the linear regression model with ten regressors is m = 11. The
value of
the dependences of residuals e = f (zvalue ) and e = f (Yˆ ) (Fig. 2) for the model diagnostic assessment.</p>
      <p>MSE acts as an assessment of mean square deviation  . We will use a residual plot showing
The value of zvalue j is defined by equation
if z j = zvalue j . To calculate cumulative probability Pj for the sorted residual of e j , ( j = 1..n p ) let us
use the formula</p>
      <p>Pj =
1
2
z j
−</p>
      <p> x22 dx,
 exp −


Pj = (2 j − 1) / 2n p.
(16)
(17)</p>
      <sec id="sec-7-1">
        <title>The emphasis should be on assessment of experimental data outliers. In Fig. 2, two outliers in</title>
        <p>experimental data used for prediction of TSLP in the linear model No 101 (Table 2) with ten regressors
are circled. This model contains the factors given in Table 2, excluding X 4 , X 6 . Model No 101 with
the specified ten factors has the minimum value of s = 22.9728 among analyzed 66 models of linear
regression for the observed TSLP.</p>
        <p>An outlier for the residual e j can be connected with the measurement error of both the factor being
predicted and one of its regressors. Let us construct M models with (M-1) regressors to define the source
of the outlier in the M-factor model. If the outlier of e j remains in each of M (M-1)-factor models
constructed, hence the Yi value is the source of the outlier. If for one of M models the outlier e j
disappears, then the factor which is absent in the model is the source of the outlier. This technique will
be called M-regressors technique. On each step of the M-regressors technique application we remove
several outliers specifying the source of their appearance (Table 6). The values of data for definite
sources of outliers will be replaced with empty values. The criterion for outlier definition is as follows:
experimental selections with minimum and maximum deviation of the actual value of e j from the
expected value of e j (deviation of the actual value of e j from the straight line the expected values are
placed on) are taken. If absolute value of one of the values is significantly larger than of the other one,
the iteration should be made for the maximum absolute value of e j . Only the experiments with existing
model regressors values were used to define coefficients of the linear regression model. In Table 6
iterative approximants to define the outliers for the selection with number of elements n p = 61 with
the total number of experiments of 90 are given. In 29 cases of experimental data, the model regressors
contain missed values. As the first iteration, residuals e j for patients number 22 and 26 (Fig. 3-6) were
assessed. The sources of outliers ( Yi , X m i ) were defined based on the outliers M in the models (M-1)
study.
are the source for the outlier 81, and the seven factors X 1 , X 2 , X 3 , X 5 , X 7 , X 8 , TSLP are the
source for the outlier 39. If there is no factor X 7 in a model, then there is no outlier as well. Along with
this, factor X 7 is the source for both outlier 81 and 39.</p>
      </sec>
      <sec id="sec-7-2">
        <title>Since several factors are the source for each outlier, then it is supposed that the TSLP factor also</title>
        <p>takes an effect on the e j value of the outlier. The consecutive removal of outliers (Table 6, Fig. 4, Fig.
5) resulted to the change of MSE value for the ten-factor model.</p>
        <p>Table 4 shows the consecutive change of MSE value on each step of the outliers removal. After
the outliers removal, the MSE value decreased by 30%: from MSE =22.97 to MSE =15.89 under
11 restrictions applied by linear regression model's coefficients. For the four values of the error e j in
Fig. 6 zvalue  −1,5 and for the four values of the error e j when zvalue  1,5 . Error e j probability can
be defined from expression (16), where z j = e j /
MSE . Calculations show that z j  1,5 . This
corresponds to the error value probability of P( z  1,5) = Pj  0,1 . In Fig. 7 residual plot of e = f (Yˆ )
depending on the predicted value of Yˆ for the selection values of zvalue  1,5 up to the value of MSE
=10.51 for 45 patients in the selection and under 11 restrictions applied by linear regression model’s
coefficients is shown.
4.2.</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Asessment of regression model coefficients</title>
      <p>Let us consider the test for m coefficients</p>
      <p>H 0 : m = 0, H  : m  0. (23)
The case of m = 0 corresponds to assessment for Y</p>
      <p>E(Y ) = 0 + 1 X 1 + ... + m−1 X m−1 + 0  X m + m+1 X m+1 + ... +  M X M .</p>
      <p>For the error i normal distribution this condition means not only the fact that there is no linear
relationship between Y and X m . In fact, it means that there is no relationship between Y and X m at
all, because for the model under studying only linear relationship between Y and X m is supposed. The
value of the bm coefficient can be represented as</p>
      <p>n
bm = ryxm xym + 0(rxv xm )  KKxxmmxym = i=1 (X m i − mxm )(Yi − m y ), mxm = n1 i=n1 X m i.</p>
      <p>n
 (X mi − mxm
i=1
)2
For regression model with normal error i distribution
n n n
 (X m i − mxm )Yi −  (X m i − mxm )m y  (X m i − mxm )Yi n
bm  i=1 i=n1 (X mi −i=m1xm )2 = i =i=n11 (X mi − mxm )2 = i=1 km iYi .</p>
      <p>Coefficient km i is a determinate function of X m i regressor having the following properties
n
nkm i = in=1 (X mi − mxm ) = 0, km i = n X m i − mxm ,
i=1  (X mi − mxm )2  (X mi − mxm )2
i=1 i=1
(24)
(25)
n n
i=n1 km i X mi = i=i1=n(1X(Xmim−i−mmxmxm)X)2mi = 1, i=n1 km2 i =  i=ni1=1( X(Xmmii−−mmxmxm)2)2 = i=n1 (X mi1− mxm )2 .</p>
      <sec id="sec-8-1">
        <title>Coefficient bm normal distribution law results from the fact that coefficient bm is a linear</title>
        <p>combination of random value of Yi . To assess the regression model coefficient bm , let us consider the
following expressions:</p>
        <p> n  n n  M  n n M
E(bm )  E km iYi  = km i E(Yi ) = km i 0 + v X v i  = 0 km i + km i v X v i =
 i=1  i=1 i=1  v=1  i=1 i=1 v=1
= n km i Mv X v i = Mv n km i X v i = m + Mv n km i X v i = m + 0(rxv xm ), (26)
i=1 v=1 v=1 i=1 vv=1m, i=1
2 (bm )  2  nkm iYi  = nk m2 i2 (Yi ) = 2 nk m2 i + 0(rxv xm ) = n 2
 i=1  i=1 i=1 (X mi − mxm )2
i=1
+ 0(rxv xm ).</p>
        <p>We will use them to calculate the value of Student’s t-test [31]
tm value = s(bbmm ) , s(bm ) = (bm ) 
n (X mi − mxm )2 (27)
i=1
for coefficient bm of the model when the number of items in selection is n=45 (Fig. 7) and the number
of restrictions is M = 11. Critical value of t=0.05,r=34  2.032 corresponds to the statistics (27) when
signification factor is  = 0.05 and the number of freedom degrees is r = (45 − 11) = 34 . Since
distribution of bm / s(bm ) is the t-distribution, then confidence interval for bm coefficients can be
defined by relationship</p>
        <p>P(bm − s(bm ) t=0.05,r=34  m  bm + s(bm ) t=0.05,r=34 ) = 1 − , (28)

,
hence</p>
        <p>m = bm  bm , bm = s(bm ) t=0.05,r=34.</p>
        <sec id="sec-8-1-1">
          <title>Results of the confidence interval calculations for several factors are given in Table 7. (29)</title>
        </sec>
      </sec>
    </sec>
    <sec id="sec-9">
      <title>5. Conclusions</title>
      <p>The technique of multifactor regression model construction is developed to predict bronchial asthma
severity in children, when thymic stromal lymphopoietin level in blood serum is taken as a referable
variable. Whereas disease severity is defined by a large number of factors that are weakly dependent
on each other, assumption on normal error distribution was suggested. In preparation of experimental
data comparative analysis of distribution histograms for model regressors values was used. The
comparative study of experimental data and the data given in research papers on bronchial asthma
severity was done to assess the quality of raw data. The results of histograms analysis were used to
correct initial values of raw experimental data. During the analysis the data containing measurement
error were excluded. The technique of multifactor linear regression model construction was developed
to analyze bronchial asthma severity in children. The model of linear regression with ten regressors was
studied in details. More than 100 factors effecting on bronchial asthma severity in children were
analyzed to form a set of model’s regressors. The criteria which allow forming an optimal set of model’s
regressors were developed. As a result of statistical analysis it was revealed that a confidence interval
for coefficients of linear regression model is limited rather widely. It proves the necessity of the model
further improvement. Further research perspectives involve model construction techniques based on
nonlinear regression equations or neural network application.
6. References
[12] I. Pavord, N. Hanania, Controversies in Allergy: Should Severe Asthma with Eosinophilic</p>
      <sec id="sec-9-1">
        <title>Phenotype Always Be Treated with Anti-IL-5 Therapies, The Journal of Allergy and Clinical</title>
        <p>Immunology: In Practice, 7 5 2019, 1430–1436. doi: 10.1016/j.jaip.2019.03.010
[13] T. Bittar, S. Yousem, S. Wenzel, Pathobiology of severe asthma, Annual Review of</p>
      </sec>
      <sec id="sec-9-2">
        <title>Pathology: Mechanisms of Disease, 10 (2015) 511–545.</title>
        <p>[14] S. Ramratnam, L. Bacharier, T. Guilbert, Severe asthma in children, J Allergy Clin Immunol</p>
        <p>Pract, 5 (2017) 889-898. doi: 10.1016/j.jaip.2017.04.031.
[15] A. Fitzpatrick, W. Moore, Severe Asthma Phenotypes – how should they guide evaluation
and treatment? J Allergy Clin Immunol Pract, 5 4 (2017) 901-908. doi:
10.1016/j.jaip.2017.05.015.
[16] G. Collins, K. Moons, Reporting of artificial intelligence prediction models, Lancet, 393
(2019)1577-1579. doi:10.1016/S0140-6736(19)30037-6.
[17] A. Ray, J. Das, SE. Wenzel, Determining asthma endotypes and outcomes: Complementing
existing clinical practice with modern machine learning, Cell reports. Medicine, 3 12 (2022)
10085. doi:10.1016/j.xcrm.2022.100857.
[18] L. Guilleminault, H. Ouksel, C.Belleguic, Y.Le Guen, P. Germaud, E. Desfleurs, Personalised
medicine in asthma: from curative to preventive medicine, European Respiratory Review, 26
143 (2017) 160010. doi:10.1183/16000617.0010-2016.
[19] D. M Kothalawala, L. Kadalayil, V. Weiss, M. Aref Kyyaly, S. Hasan Arshad, John W</p>
      </sec>
      <sec id="sec-9-3">
        <title>Holloway et al, Prediction models for childhood asthma: A systematic review, Pediatr Allergy</title>
        <p>Immunol, 31 6 (2020) 616-627. doi: 10.1111/pai.13247.
[20] Y. Zhang, C. Zhou, J. Liu, H. Yang, S. Zhao, A new index to identify risk of multi-trigger
wheezing in infants with first episode of wheezing, J Asthma, 51 10 (2014) 1043–1048.
doi:10.3109/02770903.2014.936449.
[21] O. Pihnastyi, O. Kozhyna, Methods for constructing estimated two-factor linear regression
models for diagnosing the severity of bronchial asthma in children, Innovare Journal of
Medical Sciences, 9 1 (2021) 23–30. doi:10.22159/ijms.2021.v9i1.40408.
[22] UNESCO.ORG, Universal Declaration on Bioethics and Human Rights, 2005. URL:
http://portal.unesco.org/en/ev.php. 2005.
[23] ELISA-Kit. URL: https://www.cusabio.com/ELISA-Kit/Human-thymic-stromal
lymphopoietinTSLP-ELISA-Kit-110403.html
[24] B. Watson, G. Gauvreau, Thymic stromal lymphopoietin: a central regulator of allergic
asthma, Journal Expert Opinion on Therapeutic Targets, 18 7 (2014) 771-785. doi:
10.1517/14728222.2014.915314.
[25] S. Colicino, D. Munblit, C. Minelli, A. Custovic, P. Cullinan, Validation of childhood asthma
predictive tools: a systematic review, Clin Exp Allergy, 49 4 (2019) 410- 418.
[26] O. Kozhyna, O. Pihnastyi, Covariance coefficients factors from a clinical study of the severity
of bronchial asthma in children of the Kharkov region, 2017, Mendeley Data, 1 (2019).
[27] G. Luo, FL Nkoy, BL Stone, D. Schmick, MD. Johnson, A systematic review of predictive
models for asthma development in children, BMC Med Inform Decis Mak, 15 99 (2015). doi:
10.1186/s12911-015-0224-9.
[28] H. Mohammad, D. Belgrave, K. Kopec Harding, C. Murray, A. Simpson, A. Custovic, Age,
sex, and the association between skin test responses and IgE titres with asthma, Pediatr
Allergy Immunol, 27 (2016) 313–319. doi: 10.1111/pai.12534.
[29] G. Varricchi, A. Pecoraro , G. Marone, G. Criscuolo, G. Spadaro, A. Genovese et al, Thymic</p>
      </sec>
      <sec id="sec-9-4">
        <title>Stromal Lymphopoietin Isoforms, Inflammatory Disorders, and Cancer, Frontiers in</title>
        <p>Immunology, 9 (2018). doi:10.3389/fimmu.2018.01595.
[30] O. Kozhyna, O. Pihnastyi, Data Structure of Clinical Research, Human Health &amp; Disease, 3
9 (2019) 71-79.
[31] Student, The probable error of a mean, Biometrika, 6 1 (1908)1-25.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1] WHO. Asthma/ WHO,
          <year>2020</year>
          ). URL: https://www.who.int/news-room/factsheets/detail/asthma
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>S.</given-names>
            <surname>Sánchez-García</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Habernau</given-names>
            <surname>Mena</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Quirce</surname>
          </string-name>
          ,
          <article-title>Biomarkers in inflammometry pediatric asthma: utility in daily clinical practice</article-title>
          ,
          <source>Eur Clin Respir, 4</source>
          <volume>1</volume>
          (
          <year>2017</year>
          ). doi:
          <volume>10</volume>
          .1080/20018525.
          <year>2017</year>
          .
          <volume>1356160</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Licari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Castagnoli</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Brambilla</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Marseglia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tosca</surname>
          </string-name>
          , G. Marseglia et al,
          <article-title>Asthma endotyping and bomarkers in childhood asthma</article-title>
          ,
          <source>Pediatr Allergy Immunol Pulmonol</source>
          ,
          <volume>31 2</volume>
          (
          <year>2018</year>
          )
          <fpage>44</fpage>
          -
          <lpage>55</lpage>
          . doi:
          <volume>10</volume>
          .1089/ped.
          <year>2018</year>
          .
          <volume>0886</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>L.</given-names>
            <surname>Bjerkan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sonesson</surname>
          </string-name>
          ,
          <string-name>
            <surname>K Schenck</surname>
          </string-name>
          ,
          <article-title>Multiple functions of the new cytokine-based antimicrobial peptide Thymic stromal lymphopoietin (TSLP)</article-title>
          ,
          <source>Pharmaceuticals 31 2</source>
          (
          <year>2016</year>
          ). doi:
          <volume>10</volume>
          .3390/ph9030041.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Chauchan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Agarwal</surname>
          </string-name>
          et al,
          <source>Correlation of TLSP</source>
          , IL-
          <volume>33</volume>
          , and CD4+, CD5+, Fox P3+, T regulatory (
          <article-title>Treg) in pediatric asthma</article-title>
          ,
          <source>J Asthma</source>
          ,
          <volume>52</volume>
          (
          <year>2015</year>
          )
          <fpage>868</fpage>
          -
          <lpage>872</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>C.</given-names>
            <surname>Kuo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Pavlidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Loza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Baribaud</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rowe</surname>
          </string-name>
          ,
          <string-name>
            <surname>I.</surname>
          </string-name>
          <article-title>Pandis et al, A Transcriptome-driven Analysis of Epithelial Brushings and Bronchial Biopsies to Define Asthma Phenotypes in UBIOPRED</article-title>
          ,
          <source>American Journal of Respiratory and Critical Care Medicine</source>
          ,
          <volume>195 4</volume>
          (
          <year>2017</year>
          )
          <fpage>443</fpage>
          -
          <lpage>455</lpage>
          . doi:
          <volume>10</volume>
          .1164/rccm.201512-
          <fpage>2452OC</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kuruvilla</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Lee</surname>
          </string-name>
          , Understanding Asthma Phenotypes, Endotypes, and Mechanisms of Disease,
          <source>Clinical Reviews in Allergy &amp; Immunology, 56</source>
          <volume>2</volume>
          (
          <year>2018</year>
          )
          <fpage>219</fpage>
          -
          <lpage>233</lpage>
          . doi:
          <volume>10</volume>
          .1007/s12016-018-8712-1.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Elmaraghy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hodie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Khattab</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Abdelgalel</surname>
          </string-name>
          ,
          <article-title>Association between TSLP gene polymorphism and bronchial asthma in children in Beni Suef Governorate in Egypt</article-title>
          ,
          <source>Comparative Clinical Pathology</source>
          ,
          <volume>27</volume>
          (
          <year>2018</year>
          )
          <fpage>565</fpage>
          -
          <lpage>570</lpage>
          . doi:
          <volume>10</volume>
          .1007/s00580-017-2626-9.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Verma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Michalec</surname>
          </string-name>
          , W. Liu,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sripada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Rollins</surname>
          </string-name>
          et al,
          <article-title>Steroid resistance of airway type 2 innate lymphoid cells from patients with severe asthma: the role of thymic stromal lymphopoietin</article-title>
          ,
          <source>J Allergy Clin Immunol</source>
          ,
          <volume>141 1</volume>
          (
          <year>2018</year>
          )
          <fpage>257</fpage>
          -
          <lpage>268</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.jaci.
          <year>2017</year>
          .
          <volume>03</volume>
          .032.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A.</given-names>
            <surname>Koczulla</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Vogelmeier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Garn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Renz</surname>
          </string-name>
          ,
          <article-title>New concepts in asthma: clinical phenotypes and pathophysiological mechanisms</article-title>
          ,
          <source>Drug Discov Today</source>
          ,
          <volume>22 2</volume>
          (
          <year>2017</year>
          )
          <fpage>388</fpage>
          -
          <lpage>396</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.drudis.
          <year>2016</year>
          .
          <volume>11</volume>
          .008.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M.</given-names>
            <surname>Loza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Djukanovic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Chung</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Horowitz</surname>
          </string-name>
          , K. Ma, P. Branigan et al,
          <article-title>Validated and longitudinally stable asthma phenotypes based on cluster analysis of the ADEPT study</article-title>
          ,
          <source>Respiratory Research</source>
          ,
          <volume>17 1</volume>
          (
          <year>2016</year>
          ).
          <source>doi: 10.1186/s12931-016-0482-9.</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>