On linear regression for fuzzy data of different quality
Serhii Mashchenkoa, Oleksandr Marchenkoa
a
    Taras Shevchenko National University of Kyiv, 64/13, Volodymyrska Street, City of Kyiv, 01601, Ukraine

                 Abstract
                 The present paper is devoted to the linear regression for a fuzzy set of fuzzy data samples.
                 This model allows one to take into account the data of different quality. It is shown that
                 regression parameters are type-2 fuzzy sets. Furthermore, the corresponding type-2
                 membership functions are given. The decomposition approach is used to investigate the
                 T2FSs of linear regression parameters. It is shown that each T2FS of regression parameter
                 can be decomposed according to secondary membership grades into a finite collection of
                 fuzzy numbers. Each of these fuzzy number is the corresponding fuzzy regression parameter
                 for a set of data numbers. This set is the corresponding α-cut of the original fuzzy set of fuzzy
                 data samples. The illustrative example is given.

                 Keywords 1
                 Linear regression, fuzzy least squares estimator, fuzzy number, type-2 fuzzy set.

1. Introduction

    The classical regression analysis is based on crisp data and a crisp relationship between the
dependent variable and the independent variables. In practice, there are many situations in which
observations cannot be measured as crisp quantities, because the information is often fuzzy,
incomplete, linguistic or noisy. Fuzzy regression analysis is a non-statistical method based on a fuzzy
set theory rather than probability theory (see [1]). In a general model of a fuzzy regression both input
and output are fuzzy. In this regard, the fuzzy regression model contains fuzzy parameters instead of
error terms.
     The three main fields can be distinguished in fuzzy regression analysis. These are possibilistic
regression analysis, fuzzy least squares methods and machine learning techniques. The probabilistic
approach in fuzzy regression analysis was first proposed by Tanaka et al. [2]. Unlike conventional
regression analysis, where deviations between observed and predicted values reflect a measurement
error, deviations in fuzzy regression reflect the uncertainty of the system structure expressed by fuzzy
parameters of the regression model. Fuzzy parameters of the model are considered to be distributions
of possibilities and determined by solving a linear programming problem that allows one to minimize
fuzzy deviations subject to membership degrees constraints. Since the membership functions (MFs) of
fuzzy sets (FSs) can be viewed as probability distributions, this approach was called ‘possibilistic
regression analysis.’ The possibilistic approach was explored and improved by many authors.
Reviews of possibilistic regression analysis can be found in D’Urso [3].
    Fuzzy regression analysis was also considered from the viewpoint of generalizing the classical
least squares method to the case of fuzzy data. The idea of this approach is to minimize in some sense
the different distance measures between the predicted fuzzy values and the given fuzzy data. Fuzzy
least squares methods were first proposed by Celmins [4]. Later, this approach was significantly
developed by many researchers. In [1] one can find qualitative reviews on the fuzzy least squares and
fuzzy least absolute methods.


The Sixth International Workshop on Computer Modeling and Intelligent Systems (CMIS-2023), May 3, 2023, Zaporizhzhia, Ukraine
EMAIL: s.o.mashchenko@gmail.com (S. Mashchenko); rozenkrans17@gmail.com (O. Marchenko)
ORCID: 0000-0003-4863-2763 (S. Mashchenko); 0000-0002-5408-5279 (O. Marchenko)
            © 2023 Copyright for this paper by its authors.
            Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
            CEUR Workshop Proceedings (CEUR-WS.org) Proceedings
    Machine learning techniques made it possible to generalize fuzzy regression analysis through the
use of genetic algorithms, neural networks, and support vector machines. Relevant references can be
found in Chukhrova and Johannssen [1] and Hastie et al. [5].
    Often, when solving applied problems, data of different quality can be used [6]. For example,
according to [7], wind tunnel experiments provide high simulation accuracy (a source of high-fidelity
data). Also, experiments based on computational physical models have a higher error (source of low
fidelity data). In some applications, a significantly more accurate regression model can be built if low-
precision data are also used. In this case, the problem arises of constructing a regression based on data
of different quality.
    Using data of different quality with the aim of improving model accuracy is not a new concept.
For instance, Hevesi et al. [8] predict average annual precipitation values near a potential nuclear
waste disposal site using a set of precipitation measurements from the region along with more easily
obtainable elevation map of the area. Kennedy and O’Hagan [9] approach the subject from the
perspective of model construction using data resulting from computational simulations of varying
fidelities and costs. In [7], to construct a Gaussian regression model, the problem of planning an
experiment is solved with the choice of the ratio between the sizes of samples of low-precision and
high-precision data. Also, to process data with a variable degree of certainty the methods of transfer
learning [10], space mapping [11], and others [12] are used.
    Data quality (degree of usability of data) is a complex concept. It is characterized by objectivity,
integrity, relevance, measurability, controllability, etc. In some cases, the quality of data may not be
crisp defined [13]. According to [14], where using data without expert knowledge, the choice of a
representative sample becomes an NP-complete problem. Therefore, samples have to be found within
a reasonable time, and this justifies the use of fuzzy methods that formalize expert knowledge
expressed in natural language words. For instance, in the framework of quality control fuzzy expert
assessments are used in [15] to construct acceptance sampling plans (how many units can be selected
from a consignment and how many defective units are allowed in this sample).
    In this article, we intend to investigate the method of constructing a fuzzy linear regression in the
case when fuzzy data samples are of different qualities. Furthermore, the degrees of membership to a
FS are known for these samples. This leads to a possibility that the regression takes into account
fuzziness of the quality assessments of different samples, rather than just uncertainty of data.
Examples of such FSs data samples could be: ‘High quality data samples’, ‘Questionable data
samples’, ‘Actual data samples’, etc.
    The main result of the article justifies the fact that a FS of different quality samples of type-1 fuzzy
data generates a type-2 fuzzy regression model. In this model, the regression parameters are T2FSs on
the real line with constant secondary grades. Although, in general, a T2FS is a rather complicated
mathematical object, T2FSs with constant secondary grades are simple enough for practical use. This
feature allows us to decompose this set by secondary grades into a collection of corresponding fuzzy
numbers. Each of them represents the corresponding fuzzy regression parameter for a crisp set of data
samples. The set in the focus is the α-cut of the original fuzzy set of data samples. We note that the
well-known type 2 fuzzy regression models use crisp collections of type 2 data sets, while the model
proposed in the article uses a fuzzy collection of type 1 fuzzy data samples. This is a principal
difference between them. It should also be added that this article continues the line of research in the
field of mathematical operations with a fuzzy set of operands, first introduced in the context of
intersections and unions of fuzzy sets [16, 17].

2. Materials and Methods

   In this section, we briefly review some existing theories and definitions.

2.1. Linear regression analysis for fuzzy input and output data using the
extension principle
   The article focuses on a linear regression in the case when data samples form a FS. We stress that
one could have exploited different known methods of constructing a linear regression for fuzzy data.
As an alternative, we intend to modify the method [18] based on the extension principle.
   Let K  {1,..., K } be the set of indices of data samples { yi , xi1 ,..., xip } , i  K , where K is the
cardinality of K . A crisp statistical linear regression has the form
                                   yi  xi ( ( K ))T   i , i  K ,                               (1)
where for each i  K , yi is the dependent variable; xi  (1, xi1 ,..., xip ) is the vector of independent
variables (factors, regressors) xil , l  1,..., p ;  i is the independent normal random variable. The
symbol T denotes the transpose,                                                 p    is   the number of independent variables,
 ( K )  (0 ( K ),..., p ( K ))  (l ( K ))l 0,..., p                     is   the    vector of regression parameters. Let
y ( K )  X ( K )( ( K ))T   ( K ) be the matrix notation of equations (1) with X ( K )  {xis }iK , s  0,... p and
 xi 0  1 for all i  K , y ( K )  ( yi )TiK and  ( K )  ( i )TiK . For the convenience of presentation, the set
 K of sample indices is indicated hereinafter as a parameter in these formulae. According to the least
squares method, the estimate ˆ ( K )  (ˆl ( K ))l  0,..., p of the parameter vector  ( K )  (l ( K ))l  0,..., p has
the form
                               ˆ ( K )  [ X T ( K ) X ( K )]1 X T ( K ) y ( K ) .                                     (2)
Assume that the data is fuzzy. To generalize formula (2) we denote by
                                         fl ( X ( K ), y ( K ))  ˆl ( K )                                              (3)
the l -th element, l  0,..., p of the estimate ˆ ( K ) . We also denote by
                                X ( K )  {xis }iK , s 0,... p , y ( K )  ( y i )TiK                             (4)
the matrix of independent variables and the vector of dependent variables, respectively, where for
each i  K , xi 0  1 is the crisp number which is equal to 1; xis , s  1,..., p and yi are fuzzy numbers
(FNs) with the MFs  xis ( xis ) , xis  , s  1,..., p and  yi ( yi ) , yi  , i  K , respectively. Here,
is the real line.
     Remark 1. Recall that a FN is a normal FS on                            with the upper semicontinuous and quasi-
concave MF (for example, see [19]).
     The vector  ( K )  (l ( K ))l  0,..., p of fuzzy parameters of the regression has the form
 ( K )  [ X T ( K ) X ( K )]1 X T ( K ) y ( K ) according to the least squares method [18]. For each l  0,..., p ,
l ( K )  fl ( X ( K ), y ( K ))  {(r ,  ( K ) (r )) : r  } is the FN with the MF
                                                     l

                  ( K ) (r )  max { min { x ( xis ),  y ( yi )}: r  fl ( X ( K ), y ( K )),
                    l            X ( K ), y ( K ) iK , s  0,..., p      is          i


                                                                pK                               K
                                                                                                                         (5)
                 X ( K )  {xis }iK ;s  0,..., p                    , y ( K )  { yi }iK        }
by Zadeh’s extension principle [20]. Here, f l ( X ( K ), y ( K )) is the l-th element of the vector
ˆ ( K )  [ X T ( K ) X ( K )]1 X T ( K ) y ( K ) by (2) and (3). According to Remark 1, the maximum in (5)
exists. As shown in [21], the representation of FNs by u -cuts is simpler for calculations than the
functional approach. Therefore, for each l  0,..., p , we represent the MF of the FN l ( K ) in the
form
                                             l ( K ) (r )  max u1[l ( K )]u (r ) ,              (6)
                                                                          u[0,1]

where [l ( K )]u is the u -cut of the FN l ( K ) . This u -cut is the set [l ( K )]u  {r  : l ( K ) (r )  u}
with the MF
                                                   1, r  [l ( K )]u ;
                             1[l ( K )]u ( r )                                                (7)
                                                   0, r  [l ( K )]u ;
r  , u [0,1] . According to [18] and Remark 1, formula (5) implies that u -cut [l ( K )]u of the FN
l ( K ) has the form
                 [l ( K )]u  { f l ( X ( K ), y ( K )) : xis  [ xis ]u , s  0,..., p; yi  [ yi ]u , i  K } ,     (8)
where for each i  K , the u -cuts of the FNs xis , s  0,... p and yi are closed intervals
                         [ xil ]u  [[ xis ]uD ,[ xis ]uH ], s  0,..., p and [ y i ]u  [[ y i ]uD ,[ y i ]uH ] ,  (9)
respectively, of the real line                         . Since l ( K ) is a FN, then its u -cut [l ( K )]u is the interval
[l ( K )]u  [[l ( K )]u ,[l ( K )]u ] 
                              D               H
                                                             too. Equality (8) entails
             [ l ( K )]u  min{ f l ( X ( K ), y ( K )) : xis  [ xis ]u , s  0,..., p; yi  [ y i ]u , i  K } ,
                        D
                                                                                                                         (10)
            [l ( K )]uH  max{ f l ( X ( K ), y ( K )) : xis  [ xis ]u , , s  0,..., p; yi  [ y i ]u , i  K } .  (11)
Thus, (6) ensures that formula (5) and the FN l ( K ) have the forms
                          ( K ) (r )  max{u [0,1]: [l ( K )]uD  r  [l ( K )]uH } , r  ,
                            l
                                                                                                                         (12)
       l ( K )  {( r , u ) : r  [[l ( K )]uL ,[l ( K )]uH ], u  [0,1]}  {([[l ( K )]uL ,[l ( K )]uH ], u ) : u  [0,1]} ,
respectively. Problems (10) and (11) are rather complicated. In view of this, it is suggested in [18] to
use, for l  0,..., p , the approximate value of the l -th fuzzy parameter
                                 l ( K )  {([[ l ( K )]uL ,[ l ( K )]uH ], u ) : u  [0,1]} ,                                  (13)
with the MF
                        ( K ) (r )  max{u [0,1]: [ l ( K )]uD  r  [ l ( K )]uH } , r  ,
                            l
                                                                                                                                             (14)
where
                      [ l ( K )]uD  min{ f l ([ X ( K )]uD ,[ y ( K )]uD ), f l ([ X ( K )]uD ,[ y ( K )]uH ,
                                                                                                                                             (15)
                       f ([ X ( K )]H ,[ y ( K )]D ), f ([ X ( K )]H ,[ y ( K )]H )},
                        l                   u               u           l           u   u

                      [ l ( K )]uH  max{ f l ([ X ( K )]uD ,[ y ( K )]uD ), f l ([ X ( K )]uD ,[ y ( K )]uH ,
                                                                                                                                             (16)
                       f ([ X ( K )]H ,[ y ( K )]D ), f ([ X ( K )]H ,[ y ( K )]H )}.
                        l                   u               u           l           u   u

Here, the matrices [ X ( K )]uD , [ X ( K )]uH and the vectors [ y ( K )]uD , [ y ( K )]uH are comprised in of the
elements [ xis ]uD ,[ xis ]uH , s  0,..., p; [ y i ]uD ,[ y i ]uH , i  K , respectively (see (9)). It is clear that
[ ( K )]D  [  ( K )]D , [ ( K )]H  [  ( K )]H . Therefore, the inclusion [  ( K )]  [ ( K )] is hold for
   l      u       l             u       l           u           l           u                                     l     u       l     u

the closed interval [  ( K )]  [[  ( K )]D ,[  ( K )]H ] . This entails  
                                    l           u       l               u       l   u                  l ( K )
                                                                                                                  (r )  l ( K ) (r ) and thereupon
the value l ( K ) (r ) is a lower bound of l ( K ) (r ) for r                        .


2.2.       Type‐2 fuzzy sets

    Zadeh [22] introduced the notion of type-2 fuzzy set (T2FS) as a generalization of type-1 fuzzy set
(T1FS) (that is, an ordinary FS). Unlike a T1FS, the membership degree of elements in a T2FS is a FS
on closed interval [0, 1]. Based on the ideas of Karnik and Mendel [23], Mendel and John [24] gave a
different definition. The T2FS C on a set X is the collection
                         C  {(( x, u ),  ( x, u )) : x  X , u  J  [0,1]} ,
                                                                    C                       x
                                                                                                (17)
where  C ( x, u ) is a type-2 membership function (T2MF), J x is the set of primary membership
degrees u of x  X to C . The value   ( x, u ) is a crisp number from the closed interval [0, 1] which
                                                                        C

is called the secondary grade of the pair ( x, u) .
    Remark 2. According to the comments of Harding et al. [25] and Aisbett et al. [26], one has to
define the T2MF  C ( x, u ) for all x  X and u [0,1] . To this end, one should put  C ( x, u )  0 for all
u  Jx , x X .
   Remark 3. The primary membership degree u is deemed as the degree of manifestation of some
property (which determines the given FS) of x  X . According to [27], we interpret the secondary
grade  C ( x, u ) as the degree of truth of the corresponding primary degree u of this property for x.
   Following          [24],    we     define     embedded     T2FSs     and    T1FSs       for    a    T2FS
C  {(( x, u ),C ( x, u )) : x  X , u [0,1]} . Assume that u x  C e1 ( x)  [0,1] is a unique primary degree of
membership,          for       each     x X ,     where       C ( x ) ,
                                                                  e1          x X        is      the       MF   of    the     T1FS
C e1  {( x, C e1 ( x)) : x  X }. This T1FS is called embedded in the T2FS C . We define the embedded
T2FS C e 2 in C in the form C e 2  {(( x, u x ),C e 2 ( x, u x )) : x  X } with C e 2 ( x, u x ))  C ( x, C e1 ( x))) for
all x  X .
    Remark 4. According to [24], each element of the type-2 fuzzy collection
 
C  {(( x, u ),C ( x, u )) : x  X , u [0,1]} is interpreted as a subset. Thus, the collection is represented as
the classical union of its elements in the sense of T1FSs. In [24], Mendel and John stated that each
T2FS can be represented as a collection of all possible embedded T2FSs.

2.3.      Collections of T2FSs with constant secondary grades

  We shall need one special case of a T2FS to be defined according to [28, 29, 30]. Let
A  { C ( x, u ) :  C ( x, u )  0, x  X , u  [0,1]} be the set of all possible positive values of secondary
grades for the T2FS C  {(( x, u ),C ( x, u )) : x  X , u [0,1]} . Assume that the set A is finite.
   Definition 1. We say that an embedded T2FS C e 2 ( )  {(( x, u ), e 2 ( x, u )) : x  X } in the T2FS
                                                                                     x     C   ( )    x

C has a constant secondary grade   A if, for each x  X , the unique primary degree
ux  C e1 ( ) ( x) [0,1] exists for which C e 2 ( ) ( x, ux )   , i.e. C e 2 ( )  {(( x, C e1 ( ) ( x)), ) : x  X } .
    Here C e1 ( ) ( x) , x  X is the MF of the embedded T1FS C e1 ( )  {( x, C e1 ( ) ( x)) : x  X } in the
T2FS C .
    Remark 5. Obviously, for the T2FS C and each   A , there is the unique embedded T1FS
C ( )  {( x, C e1 ( ) ( x)) : x  X } which is corresponding to the embedded T2FS C e 2 ( ) with
  e1


constant secondary grade                     . Hence,      C e 2 ( )  {(C e1 ( ), )}  {({( x, C e1 ( ) ( x)) : x  X }, )} 
 {(( x, C e1 ( ) ( x)), ) : x  X } .


3. Formulation of the problem and the main idea

    Let N  {1,..., n} be the set of indices of fuzzy data samples { yi , xi1 ,..., xip }, i  N in the form of
FNs with the MFs  xis ( xis ) , xis  , s  1,..., p ,  yi ( yi ) , yi  , i  N , respectively. The matrix
X ( N ) of independent variables and the vector y ( N ) of dependent variables are given by formula
(4). Assume that qualities of data samples { yi , xi1 ,..., xip }, i  N are different. Furthermore, the
degrees  I ( j ), j  N of membership to the FS I  {( j ,  I ( j )) : j  N } of data samples indices are
known. The following question arises: ‘what are linear regression parameters in the case when fuzzy
data samples are involved in the calculation with the corresponding degrees  I ( j ) , j  N of
membership?’ In other words: ‘what are the fuzzy parameters of the regression for the FS I of the
data samples indices? We investigate this problem for the l -th regression parameter. First, we
generalize formula (12) for the case of an arbitrary subset K  N of sample indices and represent it
in a convenient form for us. For each r  , we consider the mapping U lr : 2 N  [0,1] given by
                    U lr ( K )  max{u  [0,1]: [ l ( K )]uD  r  [ l ( K )]uH }, K  N .       (18)
According to (12), the mapping U lr associates each subset K  N of data sample indices with the
value of the MF   (r ) of the fuzzy parameter  ( K ) (see (13)). The latter is the FN
                         l ( K )                                  l
                                              l ( K )  {(r ,  ( K ) (r )) : r  } ,
                                                                  l
                                                                                                                             (19)
with the MF
                        ( K ) (r )  U lr ( K ) , r  supp( l ( K ))  {r  : U lr ( K )  0} ,
                         l
                                                                                                                             (20)

where supp(l ( K )) is the support of the FN l ( K ) . Next, we generalize formulae (19) and (20) to
the case of the FS I of sample indices. We denote by B l the l -th regression parameter, and by
 M B (r ) , r  its MF for the FS I of data samples indices. In this case, the value of the MF M B (r )
      l                                                                                                                          l


for each fixed r  r * coincides with the image U ( I ) of the FS I under U , i.e. M Bl ( r*)  U ( I )
                                                                        l
                                                                         r*
                                                                                                        l
                                                                                                         r*
                                                                                                                               l
                                                                                                                                r*


. According to Zadeh’s extension principle [20], the image of the FS I under the mapping
U lr * : 2 N  [0,1] (see (18)) is the FS U l ( I )  {(u, U r* ( I ) (u )) : u [0,1]} with the MF
                                             r*
                                                                                  l


                             U ( I ) (u )  max{ [0,1]: u  U lr* ( I ( ))} , u [0,1] .
                                r*
                                                                                                                             (21)
                               l


Here, I ( )  { j  N :  I ( j )   } is the  -cut,  [0,1] of the FS I  {( j ,  I ( j )) : j  N } of sample
indices;
                                         U lr * ( I ( ))   l ( I ( )) (r*) ,                            (22)
is the image of the  -cut I ( ) ,  [0,1] of the FS I of the samples indices in the mapping U lr * (see
(18)). The value U lr * ( I ( )) is equal to the MF value  ( I ( )) (r*) of the l -th fuzzy parameter
                                                                                            l


 ( I ( )) for the set I ( ) of sample indices.
  l

    Remark 6. Let A  { I ( j ) : j  N } be the set of membership degrees values of the fuzzy set
 I  {( j ,  I ( j )) : j  N } of sample indices. Note that the cardinality of the set A is |A| ≤ n. The
situation |A| < n may occur if the degrees of membership  I ( j ) coincide for different indices j  N
of samples. It is clear that while obtaining  -cuts I ( )  { j  N :  I ( j )   }   of the fuzzy set I
we can assume that   A rather than  [0,1] .
   Thus, according to (20) and (21), for fixed r  r * , values of M B (r*) form the T1FS                 l


{(u ,  M B ( r *) (u )) : u  [0,1]} on [0, 1] with the MF M B ( r *) (u )  U r* ( I ) (u )  max{  A : u  U lr* ( I ( ))} ,
           l                                                                      l             l


u  supp( M Bl (r*))  {u  [0,1]: u  U ( I ( )),   A} . Then (22) entails
                                                        l
                                                         r*


                                      M ( r *) (u )  max{  A : u    ( I ( )) (r*)},
                                        Bl                                           l
                                                                                                                             (23)
where u  supp( M Bl (r*))  {u [0,1]: u  l ( I ( )) (r*),   A} . Therefore, we conclude that B l is a
FS on          with the MF whose values form T1FS on [0,1] . Then, according to [22], B l is the T2FS on
      . In the manner of vertical slices [27] the T2FS B on  has the form:   l

                   Bl  {(r , M Bl (r )) : r  R}  {(r ,{(u, M B ( r ) (u )) : u  J r }) : r  } ,                     (24)
                                                                              l

where  M B ( r ) (u ) , u [0,1] is the MF of the T1FS M Bl ( r )  {{(u ,  M B ( r ) (u )) : u  [0,1]} of values of
               l                                                                                    l


fuzzy degree of membership of the element r                              to the T2FS B l and J r  supp(M Bl (r )) is the set of
primary membership degrees, where supp(M Bl (r )) is the support of the T1FS M Bl (r ) for r                                          .
According to Section 2.2, we can also characterize the T2FS B l of the l -th regression parameter by
means of the T2MF  Bl (r , u )   M B ( r ) (u ), u  J r and  Bl (r , u )  0, u  J r . This conclusion allows us
                                                    l

to introduce the following notion.
    Definition 2. By the regression parameter with index l  0,..., p for the FS I of sample indices is
meant the T2FS
                           Bl  {((r , u ), B (r , u )) : u  [0,1], r  } ,
                                                              l
                                                                                               (25)
with the T2MF
                                                             max{  A : u  l ( I ( )) (r )}, u  J r ;
                                           B (r , u )                                                                           (26)
                                              l
                                                                         0,                        u  Jr .
       Here,

                                                      J r  {u [0,1]: u  l ( I ( )) (r ),   A} ,                            (27)
is the set of primary membership degrees u [0,1] with strictly positive secondary grades  Bl (r , u )
which coincides with the support supp( M Bl (r )) (see (23)) of the T1FS M Bl (r ) of fuzzy membership
degrees of the element r  ;
                  ( I ( )) (r )  max{u [0,1]: [ l ( I ( ))]uD  r  [ l ( I ( ))]uH } ,
                                 l
                                                                                                                                    (28)
is the MF of the l -th fuzzy parameter
                              l ( I ( ))  {(r ,  ( I ( )) (r )) : r  } ,
                                                                               l
                                                                                                                                    (29)
for the set I ( ) of sample indices (see (19)-(20) for K  I ( ) );
          [  ( I ( ))]D  min{ f ([ X ( I ( ))]D ,[ y ( I ( ))]D ), f ([ X ( I ( ))]D ,[ y ( I ( ))]H ,
                     l               u                   l              u              u     l              u                u
                                                                                                                                    (30)
                f l ([ X ( I ( ))]uH ,[ y ( I ( ))]uD ), fl ([ X ( I ( ))]uH ,[ y ( I ( ))]uH )}
and
               [ l ( I ( ))]uH  max{ f l ([ X ( I ( ))]uD ,[ y ( I ( ))]uD ), f l ([ X ( I ( ))]uD ,[ y ( I ( ))]uH ,
                                                                                                                                    (31)
                f ([ X ( I ( ))]H ,[ y ( I ( ))]D ), f ([ X ( I ( ))]H ,[ y ( I ( ))]H )}
                 l                        u                   u     l              u              u
are the estimates of the lower and upper bounds (see (15)-(16) for K  I ( ) ) of the u -cut
[  ( I ( ))] of the FN  ( I ( )) ;
   l             u                                l

                                 I ( )  { j  N :  I ( j )   }                                     (32)
is the  -cut of the FS I of sample indices;
     A is the set of the membership degrees values  I ( j ), j  N of the FS I  {( j,  I ( j )) : j  N } of
sample indices (see Remark 6). According to (26), the set A includes all possible positive values of
secondary grades for the T2FS B l of the l -th regression parameter.

4. Regression for a fuzzy set of sample indices
4.1. Decomposition of T2FSs of regression parameters

   For each l -th regression parameter, l  0,..., p , we apply a decomposition approach to represent
the T2FS B in a more convenient form. Theorem 1 justifies the representation of the T2FS B in the
                     l                                                                                                              l
form of a collection of the embedded T2FSs with constant secondary grades.
   Theorem 1. The T2FS B l of the l -th regression parameter is represented in the form of the
collection B  {B e 2 ( I ( )) :   A} of embedded T2FSs
                         l   l

                                               Ble 2 ( I ( ))  {( l ( I ( )), ))} ,          (33)
with the constant secondary grades   A . For each   A , the embedded T1FS
l ( I ( ))  {(r ,  ( I ( )) (r )) : r  } is the FN which is the l -th fuzzy parameter for the set
                             l


I ( )  { j  N :  I ( j )   } of sample indices, with the MF l ( I ( )) (r ) in form (28).
   Proof. According to (25), the T2FS of the regression parameter with the index l  0,..., p has the
form B  {((r , u ),   (r , u )) : u  [0,1], r  } . Hence,
          l                          Bl

        Bl  {{((r , u ), max{  A : u   l ( I ( )) ( r )}) : u  J r }  {((r , u ),0) : u  J r }: r  } ,                 (34)
by (26). Remark 4 allows us to ignore the pairs (r , u ) that have secondary grades equal to 0. Thus,
                      Bl  {(( r , u ), max{  A : u   l ( I ( )) (r )}) : u  J x , r  } ,                    (35)
and thereupon               Bl  {((r ,  l ( I ( )) ( r )) :   A) : r  }    by (27). Note that the collection
{( l ( I ( )) (r ), ) :   A} is the T1FS which is formed by the unique value u  l ( I ( )) (r ) of the fuzzy
degree of membership of r  . The different values   A may correspond to u  l ( I ( )) (r ) .
Therefore, the equality {( l ( I ( )) (r ), ) :   A}  {(u, max{ : u   l ( I ( )) (r )})} holds true. Further,
                                                                            A
regrouping elements yields
       Bl  {(r ,(  l ( I ( )) (r ), )) :   A, r  }  {{((r ,  l ( I ( )) (r )), ) : r  }:   A} .     (36)

Finally, by virtue of formula (29) we conclude that Bl  {( l ( I ( )), ) :   A} .
   The proof of Theorem 1 is complete.

4.2.      Calculation of T2FSs of regression parameters

    First, we construct the set A  { I ( j ) : j  N } of membership degrees values of the FS
I  {( j ,  I ( j )) : j  N } of sample indices. For each   A , according to (32), we construct the  -cut
I ( )  { j  N :  I ( j )   } of the FS I . Further, for each l  0,..., p , we use the representation of the
T2FS B in the form of a collection of embedded T2FSs with constant secondary grades (see
           l

Theorem 1). This leads to the following sequence of calculations for each   A .
  We construct the embedded T1FS l ( I ( ))  {(r ,  ( I ( )) (r )) : r  } . This is the FN, which is the
                                                                            l

l -th fuzzy parameter of the regression for the set I ( ) of sample indices. To construct the FN
 ( I ( )) one can use any known methods. An application of the method worked out in Section 2.1 of
  l

[18] yields l ( I ( ))  {([[ l ( I ( ))]uD ,[ l ( I ( ))]uH ], u ) : u  [0,1]} (see (13), (30), (31) for K  I ( ) )
with the MF
            l ( I ( )) (rl )  max{u [0,1]: [ l ( I ( ))]uD  rl  [ l ( I ( ))]uH } , rl  .               (37)
Formula (37) is justified by (14) with K  I ( ) . Then, the corresponding embedded T2FS with the
constant secondary grade  has the form B e 2 ( I ( ))  {(  ( I ( )), ))} according to (33).
                                                           l                    l

      Once all embedded T2FSs Ble 2 ( I ( )) with constant secondary grades   A have been obtained,
the       resulting     T2FS       of        the       l -th  regression   parameter     has    the     form
                                
 Bl  {Bl ( I ( )) :   A}  {( l ( I ( )), )) :   A} by Theorem 1. According to Remark 3, the T2FSs
           e2


 B l can be interpreted as follows. For fuzzy data of different quality, the l -th regression parameter B l
is equal to the l -th parameter  ( I ( )) of the regression for the corresponding crisp set I ( ) of data
                                           l
samples with the degree of truth being equal to  ,   A .

5. Illustration and discussion
5.1. Example
    This example is devised to illustrate our approach. We stress that this example does not use real
data. All data are given to the second decimal place.
    To conduct the historical research, we need to find out the dependence of weight of a middle-aged
person on he or her tall in the 10th century. The data borrowed from four authentic historical
documents are located in lines 1-4 of Table 1. There is a reason to believe that the values in line 3 are
slightly less reliable than the rest. We assume that in those days the height and the weight were
measured with an accuracy to ±1Kg and ±5%, respectively. We denote by a  ( a L , a C , aU ) a
‘triangular’ FN with the MF
                                         ( r  a L ) / ( a C  a L ), r  [a L  a C ];
                                        
                             a (r )  (a R  r ) / (a R  a C ), r  [a C  a R ];                           (38)
                                        
                                        0,                            otherwise.
Table 1 contains the fuzzy data in forms of the ‘triangular’ FNs xi1  ( xiL1 , xiC1 , xiU1 ) and
 y i1  ( yiL , yiC , yiU ) , i {1,...,4} . The fuzzy data xi1 and yi be denoted as xi1  appr ( xiC1 ) and
 y i  appr ( yiC ) , and can be interpreted as ‘approximately xiC1 and yiC ’, respectively. The dependence
of the weight on the height for majority of data in rows 1-4 of Table 1 may seem strange, since weight
gain decreases as the height increases. We assume that this can be explained by the small sample size
and its non-representativeness.
Table 1
Weight depending on height
                     i                                              Height xi1             Weight yi
              1                                                        (149,150,151)            (55,1;58,60,9)
              2                                                        (159,160,161)           (63,65;67;70,35)
              3                                                        (169,170,171)           (63,65;67;70,35)
              4                                                        (189,190,191)           (80,75;85;89,25)
              5                                                             150                       50
              6                                                             160                       60
              7                                                             170                       70
              8                                                             180                       80

To solve this problem, we supplement the data sample. We know from the medical sources [31] that
the Broca’s formula for determining the height and the optimal weight has the form W = H − 100,
where H is the height in cm and W is the optimal weight in Kg. According to this formula, we
calculate the optimal weights for four different heights and place these data in rows 5-8 of Table 1.
Thus, we have three sources of data of different quality. The high-quality source is historical, which
we fully trust. We evaluate the degree of reliability of data from this source. The results which are
located in rows 1,2,4 of Table 1 have the values of the degree of reliability are equal to 1. The degree
of less reliable data from the second source of historical data which are located in line 3 of Table 1 is
estimated at 0,9. The third source of data is medical. It is known that modern persons are taller than
persons living in the 10th century and having the same weight. As a consequence, the degree of
reliability of data from a medical source is not very high (for example, we estimate it as 0,7), since
this source only provides information on the ratio of the weight and the height for modern people.
Thus,             we         construct        the         FS            ‘Reliable               data             samples’
 I  {(1;1),(2;1),(3;0,9),(4;1),(5;0,7),(6;0,7),(7;0,7),(8;0,7)} on the set N  {1,...,8} of data sample
indexes with the MF values  I (1)   I (2)   I (4)  1 ,  I (3)  0,9 и  I (5)  ...   I (8)  0,7 .
     In view of Remark 6, the set of membership degrees values of the FS I takes the form
 A  {0,7;0,9;1}. For   1,0 ;   0,9 and   0,7 , according to (32), we construct the
corresponding α-cuts I (1)  {1,2,4} , I (0,9)  {1,...,4} and I (0,7)  {1,...,8} of the FS I of data
samples indices. Next, we construct FNs of parameters of linear least-squares regressions (see Section
2.1). In Example 1, these FNs are of the ‘triangular’ type (in general case, this is not necessary). Here,
these FNs are of the ‘triangular’ type (in general case, this is not necessary).
     For   1 , we get embedded T1FSs 0 ( I (1))  (86,51; 82, 2; 79,6)  appr ( 82, 2) and
  ( I (1))  (0,88;0,91;0,94)  appr (0,91)
  1                                               in   the       T2FSs       B e 2 ( I (1))  {(  ( I (1)),1))}
                                                                                           0       0                 and
B1e 2 ( I (1))  {( 1 ( I (1)),1))} , respectively, with the constant secondary grade (the degree of truth) being
equal to 1.
     For   0,9 , we get embedded T1FSs 0 ( I (0,9))  (68, 47; 64, 4; 60, 41)  appr ( 64, 4) and
 ( I (0,9))  (0,77;0,81;0,85)  appr (0,81) in the T2FSs B e 2 ( I (0,9))  {(  ( I (0,9));0,9))} and
  1                                                                                0           0
B1e 2 ( I (0,9))  {( 1 ( I (0,9));0,9))} , respectively, with the degree of truth being equal to 0,9.
     For   0,7 , we get embedded T1FSs 0 ( I (0,7))  (81,79; 77; 72,3)  appr (77) and
 ( I (0,7))  (0,85;0,9;0,95)  appr (0,9) in the T2FSs B e 2 ( I (0,7))  {(  ( I (0,7));0,7))} and
 1                                                                     0                 0

B ( I (0,7))  {( 1 ( I (0,7));0,7))} , respectively, with the degree of truth) being equal to 0,7.
 e2
 1
   According to Theorem 1, the resulting T2FSs of regression parameters have the forms
                  B 0  {( appr ( 82, 2);1), ( appr ( 64, 4);0,9), ( appr ( 77);0, 7)} ,             (39)
                      B  {( appr (0,91);1),( appr (0,81);0,9), ( appr (0,9);0,7)} .
                      1
                                                                                                         (40)


     Figure 1. The lines of the levels   0, 7 ;   0,9 and   1,0 of the T2MF  B0 (r , u )
      Figure 2. The lines of the levels   0,7 ;   0,9 and   1,0 of the  B1 (r , u )
The T2MFs  B0 (r , u ) and  B1 (r , u ) can be calculated with the help of formulae (26) and (27). Their
levels   A  {0,7;0,9;1} are represented by solid (for   1 ), dashed (for   0,9 ) and dotted (for
  0,7 ) lines in Figure 1 for B (r , u ) and in Figure 2 for  B (r , u ) .
                                        0                                1

   The obtained T2FSs can be interpreted as follows. For fuzzy data of different quality, the T2FSs
B0 and B1 of fuzzy regression parameters values are equal to:

      the FNs  ( I (0,7))  appr (77) and  ( I (0,7))  appr (0,9) , respectively, for the crisp set
                       0                                  1
      I (0,7)  {1,...,8} of data sample indices with the degree of truth being equal to 0,7;
          the FNs  ( I (0,9))  appr (64, 4) and  ( I (0,9))  appr (0,81) , respectively, for the crisp set
                      0                                       1
      I (0,9)  {1,...,4} of data sample indices with the degree of truth being equal to 0,9 and
          the FNs  ( I (1))  appr ( 82, 2) and  ( I (1))  appr (0,91) , respectively, for the crisp set
                          0                               1
      I (1)  {1,2,4} of data sample indices with the degree of truth being equal to 1.

5.2.       The discussion of the results

   Let us consider a graphical interpretation obtained regression with T2FSs parameters. On Figure 3
thin solid lines demonstrate the graphs of the regression functions
                        y  ( 1 ( I (1))0L  ( 0 ( I (1))0L x  86,51  0,88 x ,        (41)
                                     y  ( 1 ( I (1)))0H  ( 0 ( I (1)))0H x  79,6  0,94 x                    (42)
with parameters which correspond to the lower and upper bounds of the 0-cuts
[( 0 ( I (1))0L ,( 0 ( I (1))0H ]  [86,51; 79,6] and [( 1 ( I (1))0L ,( 1 ( I (1))1H ]  [0,88;0,94] of the FNs
 ( I (1))  appr ( 82, 2) and  ( I (1))  appr (0,91) , respectively. These FNs are corresponding
  0                                 1
Figure 3. The illustration of the regression with the parameter T2FSs

parameters of the fuzzy linear regression for crisp set I (1)  {1,2,4} of data sample indices. For the
same set of sample indices, the thick solid line shows the graph of the regression function
 y  ( 1 ( I (1))1L  ( 0 ( I (1))1L x  82, 2  0,91x with parameters which correspond to the 1-cuts
[82,2; 82,2] and [0,91;0,91] of the FNs  ( I (1))  appr ( 82, 2) and  ( I (1))  appr (0,91) ,
                                                              0                          1
respectively. For the crisp set I (0,9)  {1,...,4} of data sample indices, the dashed lines demonstrate the
graphs of the regression functions y  (  ( I (0,9)) L  (  ( I (0,9)) L x  68, 47  0,77 x and
                                               1          0           0       0

                       y  ( 1 ( I (0,9))0H  ( 0 ( I (0,9))0H x  60, 4  0,85 x                (43)
(thin lines), and
                        y  ( 1 ( I (0,9))1L  ( 0 ( I (0,9))1L x  64, 4  0,81x                (44)
(a thick line). For the crisp set I (0,7)  {1,...,8} of data sample indices, the dotted lines demonstrate the
graphs of the regression functions y  (  ( I (0,7)) L  (  ( I (0,7)) L x  81,79  0,85 x and
                                               1          0           0       0

                       y  (  ( I (0,7)) H  (  ( I (0,7)) H x  72,3  0,95 x
                              1           0        0              0
                                                                                                      (45)
(thin lines), and
                         y  ( 1 ( I (1)))0H  ( 0 ( I (1)))0H x  79,6  0,94 x                 (46)
(a thick line).
     Figure 3 allows us to draw the following conclusions. Regressions corresponding to data of
different quality may differ from each other and express a different relationship between the
independent variables and predicted output. Therefore, the resulting regression with T2FSs of
parameters should be taken as a whole as a collection of regressions corresponding to  -cuts
 I ( ),  [0,1] of the FS I of data sample indices. Only in this case we get an idea about
dependence between the regression and the quality of the data used. Sometimes understanding this is
important.
     If we need some ‘maximizing’ regression, which has parameters values with the primary
membership degrees equal to 1, then it should be considered as a collection of regressions
corresponding to  -cuts I ( ),  [0,1] of the FS I of data sample indices. For these regressions,
we take the parameters values with the membership degrees equal to 1. In Example 1, these are
regressions with graphs depicted by thick lines in Figure 3.
   The regression for fuzzy data of different quality can be represented as a set of fuzzy regressions
with the degree of truth   A . These fuzzy regressions correspond to data samples of different
quality with indices belonging to  -cuts I ( ),   A of the FS I . Thus, the degree of truth of the
regression is determined by the quality of the data for which it is constructed. The higher the quality
of data samples, the greater the degree of membership of their indices to the FS I and the higher the
degree of truth of the corresponding fuzzy regression in the collection that forms the regression for
fuzzy data of different quality. On the other hand, increasing the amount of low-quality data reduces
the degree of truth of the corresponding fuzzy regression in the collection.

6. Conclusion

   According to the proposed approach, the linear regression parameters for fuzzy data of different
quality are T2FSs with constant secondary grades. Although, in general, a T2FS is a rather
complicated mathematical object, T2FSs with constant secondary grades are simple enough for
practical use. Therefore, to represent these sets in a form which is convenient for understanding and
applications, we have used a decomposition method. The results obtained have allowed us to
decompose the T2FSs of linear regression parameters according to secondary grades into finite sets of
FNs. These are fuzzy parameters of the regressions which correspond to different  -cuts of FSs of
data sample indices. One direction of future investigation of regressions with a fuzzy set of data
sample indices may be related to the development of a similar approach for possibilistic regression
analysis.

7. Acknowledgements
   This work has been supported by Ministry of Education and Science of Ukraine: R&D Project
0122U001844 for the period of 2022-2024 at Taras Shevchenko National University of Kyiv.

8. References

[1] N. Chukhrova, A. Johannssen, Fuzzy regression analysis: Systematic review and bibliography,
    Appl. Soft Comput. J. 84 (2019) 1–29. doi:10.1016/j.asoc.2019.105708.
[2] H. Tanaka, S. Uejima, K. Asai, Linear regression analysis with Fuzzy model, IEEE Trans. Syst.
    Man Cybern. 12(6) (1982) 903–907.
[3] P. D’Urso, Exploratory multivariate analysis for empirical information affected by uncertainty
    and modeled in a fuzzy manner: a review, Granular Comput. 2(4) (2017) 225–247.
    doi:10.1007/s41066-017-0040-y.
[4] A. Celmins, Least squares model fitting to fuzzy vector data, Fuzzy Sets and Syst. 22(3) (1987)
    245–269. doi:10.1016/0165-0114(87)90070- 4.
[5] T. Hastie, R. Tibshirani, J. Friedman, The Elements of Statistical Learning: Data Mining,
    Inference, and Prediction. Springer, New York, NY, 2009. doi:10.1007/978-0-387-84858-7.
[6] A. Forrester, A. Sobester, A. Keane, Multi-fidelity optimization via surrogate modelling,
    Proceedings of the Royal society A: Mathematical, Physical and Engineering Sci. 463 (2007)
    3251-3269.
[7] A. Zaytsev, E. Burnaev, Minimax Approach to Variable Fidelity Data Interpolation, in:
    Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, PMLR,
    volume        54,      Fort      Lauderdale,        FL,      USA,         2017,       pp.652–661.
    https://proceedings.mlr.press/v54/zaytsev 17a.html.
[8] J. Hevesi, A. Flint, J. Istok, Precipitation estimation in mountainous terrain using multivariate
     geostatistics. Part II: isohyetal maps. J. Appl. Meteorol. 31 (1992) 677–688. doi:10.1175/1520-
     0450(1992)031!0677:PEIMTUO2.0.CO;2.
[9] M. Kennedy, A. O’Hagan, Predicting the output from a complex computer code when fast
     approximations are available, Biometrika 87(1) (2000) 1–13. doi:10.1093/biomet/87.1.1.
[10] S. Pan, Q. Yang, A survey on transfer learning. IEEE Trans. on Knowledge and Data
     Engineering 22(10) (2010) 1345–1359. doi:10.1109/TKDE.2009.191.
[11] J.W. Bandler, Q.S. Cheng, S.A. Dakroury, A.S. Mohamed, M.H. Bakr, K. Madsen, J.
     Sondergaard, Space mapping: the state of the art, IEEE Trans. on microwave theory and
     techniques 52(1) (2004) 337–361. https://ieeexplore.ieee.org/document/ 1262727.
[12] C. E. Rasmussen, C. K. I. Williams, Gaussian processes for machine learning, The MIT Press,
     2006.
[13] N. Arruda, J. Alcantara, V. Vidal, A. Brayner, M. Casanova, V. Pequeno, W.A. Franco, Fuzzy
     Approach for Data Quality Assessment of Linked Datasets, in: Proceedings of the 21th
     International Conference on Enterprise Information Syst. (ICEIS 2019), volume 1, 2019, pp.
     399–406. doi:10.5220/0007718803990406.
[14] J. Gamez, F. Modave, O. Kosheleva, Selecting the Most Representative Sample is NP-Hard:
     Need for Expert (Fuzzy) Knowledge, in: Proceedings of the IEEE International Conference on
     Fuzzy Syst. (IEEE World Congress on Computational Intelligence), 2008, pp. 1069–1074.
     doi:10.1109/FUZZY.2008. 4630502.
[15] M. Z. Khan, M. F. Khan, M. Aslam, A. R. Mughal, Design of Fuzzy Sampling Plan Using the
     Birnbaum-Saunders Distribution, Mathematics 7(1) (2019) 9. doi:10.3390/math7010009.
[16] S. Mashchenko, Intersections and unions of fuzzy sets of operands, Fuzzy Sets and Syst. 352(1)
     (2018) 12–25. doi:10.1016/j.fss.2018.04.006.
[17] S.O. Mashchenko, D.O. Kapustian, Decomposition of intersections with fuzzy sets of operands,
     in: V.A. Sadovnichiy, M.Z. Zgurovsky (Ed.), Contemporary Approaches and Methods in
     Fundamental Mathematics and Mechanics. Understanding Complex Systems, Springer, Cham,
     2020, pp. 417–432. https://link.springer.com/book/10.1007/ 978-3-030-50302-4.
[18] H.-C. Wu, Linear regression analysis for fuzzy input and output data using the extension
     principle, Computers and Mathematics with Applications 45(12) (2003) 1849–1859.
     doi:10.1016/S0898-1221(03)90006-X.
[19] S. Heilpern, Representation and application of fuzzy numbers, Fuzzy Sets and Syst. 91 (1997)
     259–268.
[20] L. Zadeh, The concept of a linguistic variable and its application to approximate reasoning – I,
     Inform. Sci. 8 (1975) 199–249. doi:10.1016/0020-0255(75)90036-5.
[21] D. Dubois, H. Prade, Operations on fuzzy numbers, International J. of Syst. Sci. 9(6) (1978)
     613–626.
[22] L. A. Zadeh, Quantitative fuzzy semantics, Inform. Sci. 3 (1971) 159–176. doi:10.1016/S0020-
     0255(71)80004-X.
[23] N. N. Karnik, J.M. Mendel, Introduction to type-2 fuzzy logic systems, IEEE International
     Conference on Fuzzy Syst. 2 (1998) 915–920.
[24] J. M. Mendel, R. I. John, Type-2 fuzzy sets made simple, IEEE Trans. on Fuzzy Syst. 10 (2002)
     117–127.
[25] J. Harding, C. Walker, E. Walker, The variety generated by the truth value algebra of T2FSs,
     Fuzzy Sets and Syst. 161 (2010) 735–749.
[26] J. Aisbett, J.T. Rickard, D.G. Morgenthaler, Type-2 fuzzy sets as functions on spaces, IEEE
     Trans. on Fuzzy Syst. 18(4) (2010) 841–844.
[27] J. M. Mendel, Type-2 fuzzy sets: some questions and answers, IEEE Connections. Newsletter of
     the IEEE Neural Networks Society 1 (2003) 10–13.
[28] S.O. Mashchenko, Sums of fuzzy set of summands, Fuzzy Sets and Syst. 417 (2020) 140–151.
     doi:10.1016/j.fss.2020.10.006.
[29] S.O. Mashchenko, Sum of discrete fuzzy numbers with fuzzy set of summands, Cybernetics and
     Syst. Analysis 57(3) (2021) 374–382. doi:10.1007/s10559-021-00362-w.
[30] S.O. Mashchenko, Minimum of fuzzy numbers with a fuzzy set of operands, Cybernetics and
     Syst. Analysis 58(2) (2022) 210–219. doi: 10.1007/s10559-021-00362-w.
[31] J. C. Segen, Concise dictionary of modern medicine, McGraw-Hill, New York, NY,2006.