=Paper= {{Paper |id=Vol-2917/paper48 |storemode=property |title=Bayesian Methods Application for the Differential Diagnosis of the Chronic Obstructive Pulmonary Disease |pdfUrl=https://ceur-ws.org/Vol-2917/paper48.pdf |volume=Vol-2917 |authors=Volodymyr Lytvynenko,Mariia Voronenko,Olena Kovalchuk,Ulzhalgas Zhunissova,Luidmyla Lytvynenko |dblpUrl=https://dblp.org/rec/conf/momlet/LytvynenkoVKZL21 }} ==Bayesian Methods Application for the Differential Diagnosis of the Chronic Obstructive Pulmonary Disease== https://ceur-ws.org/Vol-2917/paper48.pdf
Bayesian Methods Application for the Differential Diagnosis of
the Chronic Obstructive Pulmonary Disease
Volodymyr I. Lytvynenko1, Mariia O. Voronenko1, Olena Kovalchuk2,
Ulzhalgas Zhunissova3 and Luidmyla N. Lytvynenko4
1
  Kherson National Technical University, Berislavske Shosse, 24, Kherson, 73008, Ukraine
2
  National Pirogov Memorial Medical University, st. Pirogova, 56, Vinnytsya, 21018, Ukraine
3
  Astana Medical University, st. Beibitshilik 49A, Astana, 010000, Kazakhstan
4
  Kherson city neuropsychiatric center, st. Morska, 1, Kherson, 73003, Ukraine


                Abstract
                The paper proposes a methodology for attachment Bayesian models in the differential
                diagnosis of a disease such as chronic obstructive pulmonary disease in different age groups
                patients with the obligatory presence of 1, 2, or three concomitant diseases in anamnesis. The
                ways for building the Bayesian models structure are considered. The medical experts,
                pharmacists, specialists were screening of input data for creation the probabilistic predicting
                system.

                Keywords 1
                Diagnostic Methods, Accompanying Illnesses, Bayesian Networks, Feature Selection
                methods, Clustering, The algorithm MeanShift

1. Introduction
    The medical diagnostics main task is to define possible diagnoses based on examination data. The
main difficulties faced by doctors at the medical diagnosis stage are:
      the amount of present illnesses is greater than the amount of ways for their diagnosis, which
         rises the misdiagnosis risk;
      the diagnosis ambiguity, i.e. the presence of comorbidities that strongly distort the symptoms
         picture;
      a large indicators number, which also leads to data distortion;
      lack of individual approach to each patient [1].
    The medical diagnosis peculiarity is that a qualified doctor for a certain group of patients can
clearly diagnose any disease, but each of the patients may have not one disease, but several at the
same time. That is, the patient may have several diagnoses (the doctor may prescribe for each
diagnosis a subjective level of belonging).
    There may also be situations where the physician is unable to diagnose a patient's illness due to a
lack of experience or due to atypical or distorted patient symptoms. From the computational
intelligence standpoint, this situation is the subject of active learning [2], when the diagnostic system
in the process of its training processes both various observations, which know exactly the value of the
training signal (reference signal) and a priori unclassified observations (self-learning mode). The
chronic obstructive pulmonary illness is the reason of death among children and the elderly today.


MoMLeT+DS 2021: 3rd International Workshop on Modern Machine Learning Technologies and Data Science, June 5, 2021, Lviv-Shatsk,
Ukraine
EMAIL: immun56@gmail.com (V. Lytvynenko); mary_voronenko@i.ua (M. Voronenko); elena.kovalchuk972@gmail.com (O.
Kovalchuk); Ulzhalgaszhunisova@gmail.com (U. Zhunissova); llytvynenko58@gmail.com (L.Lytvynenko)
ORCID: 0000-0002-1536-5542 (V. Lytvynenko); 0000−0002−5392−5125 (M. Voronenko); 0000-0002-3102-7973 (O. Kovalchuk); 0000-
0001-5255-9314 (U. Zhunissova); 0000-0001-8445-5704 (L.Lytvynenko)
           ©️ 2021 Copyright for this paper by its authors.
           Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
           CEUR Workshop Proceedings (CEUR-WS.org)
The incidence of the chronic obstructive pulmonary disease in elderly patients increases dramatically
while they are being treated in hospital facilities for another concomitant disease.
   Modeling and differential diagnosis, taking into account the presence of several comorbidities, are
quite difficult, since they are characterized by a complex structure of dependences. The computational
problems are also obvious since the analysis of a large database requires a long processing time and
can be performed only with adequate computational infrastructure. When resources are limited and
research is mainly based on open data, there is a need for models that can be modified depending on
the variability of functions, and thus can further use the experience to improve the predictability of the
model. Bayesian networks (BNs) are the most suitable computational tools for this.
   The study is dedicated to the creation of a system that makes it possible to achieve the
effectiveness of drug therapy in the presence of the patient's main diagnosis and several concomitant
diagnoses that aggravate the course of the disease. The task of this work is to make a methodology for
constructing a BN in the differential diagnosis of a disease such as the chronic obstructive pulmonary
disease.
   The main contribution is as follows:
   (i) realization of a comprehensive causal probabilistic model for the differential diagnosis of three
types of pulmonary diseases,(ii) development of a specific methodology for planning the development
of Bayesian networks for differential diagnosis problems, (iii) application of the MeanShift cluster
analysis algorithm to identify specific signs, symptoms, laboratory and instrumental findings
characteristic of the disease forms under study, and (iv) the use of algorithms to estimate the
information entropy, which made it possible to assess the informativeness of the features about the
corresponding class of the disease.
   Thus, the main problem is related to improving the quality of differential diagnosis in medical
decision support systems. The key problem focuses on improving the quality of differential diagnosis
through the comprehensive use of methods for selecting the most informative features.
   The article is drawn up like this. Section 2 defines the tasks that we will solve in our study. Section
3 shows the literature list on existing ways for the differential diagnosis. In Section 4 we discuss the
general formulation of the solution to the problem. Section 5 describes the input data and methods for
obtaining structural model indicators. After that, in the Section 6 we describe the sequence of building
and validating a BN. We present the research process and its results. Section 7 presents the study
results analysis. Section 8 summarizes and completes.

2. Problem Statement
   Using Bayesian Approach the developed model will facilitate a faster diagnosis and thus help to
pre-select the correct treatment method. This mathematical tool let us to describe the notions, which
the patient uses when describing his health [3].
   For a set of events X i  , i  1,K , N that are related, and a set of learning data
                             1   2       N
                                              
D   d1 ,K , d n  , di  xi  xi  K xi  , is given. Here the subscript is the observation amount, and the
upper one is the variable amount, n –is the amount of surveillances, each surveillance comprises
N  N  2   variables, and each         j -th variable     j  1,K , N  has such conditions:
  j
          
A   0 ,1,K ,    1
                  j
                               2 .
                                      j



      Based on a given training sample, you need to build an acyclic graph connecting the event sets
X i , i=1,…,N . In addition, each BN structure g  G is presented by a set N of predecessors

 P  ,K , P   , that is, for each vertex j  1,K , N , P  it is a variety of parent vertices, such that
      1       N                                             j



P    X   ,K , X    \  X    . Research will be carried out in accordance with the following stages of
  j           1       N           j



Bayesian network development (Figure 1).
   The aim of this engineering is to create the Bayesian-based model for the early diagnosis of the
chronic obstructive pulmonary disease if the patient has the likelihood of concomitant diagnoses.
3. Review of the Literature
   Many different fuzzy approaches to data clustering are currently used [4-6]. They are able to
effectively cluster data in situations where clusters overlap, assuming that the cluster size is small, i.e.
do not contain abnormal emissions. Real medical and biological data sets contain up to 20% of
emissions. The data of these indicators allow forming clusters that allow to estimate dynamics of
indicators, to simplify and accelerate the process of diagnostics.
   Existing methods for estimating biological conditions are based on the values arrays analysis of
the measured parameters set given in the vectors form. These can be immunological, biochemical,
physiological, cytogenetic, cytomorphological, and other medical and biological data.




   Figure 1: The stages according to which the research goal will be achieved.

     The measurement data analysis of an individual condition of a separate organism consists in the
definition of belonging of this condition to any of in advance known conditions (diagnoses). Each
specialist in certain types of diseases makes "his" diagnosis. This is due to objective reasons related to
the overlap and coincidence of different numbers of indicators of the state of the body for different
diagnoses, and the widespread prevalence of so-called polysyndromic conditions.
    Making correct and effective decisions in such situations requires a significant amount of time and
money to organize consultations of highly qualified persons. The most stringent in terms of fuzzy
clustering procedures are the so-called fuzzy clustering algorithms, based on objective functions [5, 7,
8] and made automatic classification (without teacher) by optimization predefined quality criteria. As
a result of this procedure, the formed clusters have the shape of hyperspheres, which significantly
limits the possibility of using the above methods to process data of more complex forms. In the
artificial intelligence field, the computer system that imitates the possibility to make human decisions
is called an expert system [9]. The first expert systems were created in the 1970s and became
widespread in the 1980s. They were one of the first truly successful artificial intelligence forms.
However, some experts note that expert systems were not part of true artificial intelligence, as they do
not have the ability to learn independently of external data [10].
    The MYCIN system, for example, has been designed to recommend the required amount of
antibiotics depending on the patient's weight [11]. Studies at Stanford Medical School found that
MYCIN offered acceptable therapies in approximately 69% of cases, better than infectious disease
experts who were evaluated according to the same criteria.
   However, when looking at the expert systems in real operating conditions, there are other
problems, such as integration, access to large databases, and performance [12]. The CADUCEUS
medical expert system was created to help diagnose blood infections as well as diagnose internal
diseases. CADUCEUS was able to recognize up to 1,000 different diseases, as well as to recognize
the several concomitant diseases presence. Fuzzy medical diagnosis in terms of increasing numbers
signs and diagnoses are represented by online diagnosis systems. To date, there are medical online
diagnostic systems DNFS and AWDNFN, designed on the basis of W-neuron. During their operation,
there will be a worse criterion for the effectiveness of AWDNFN than in DNFS due to the longer
processing time with better diagnostic accuracy [13].
   In the past, many authors have tried to create computational intelligence systems for diagnosis
using data sets from the medical repository as input data [14-18]. The inability to process data in real
time, a fixed number of medical signs, and a low convergence rate are the disadvantages of these
systems. In [19] was proposed the use of probabilistic methods of medical diagnosis, which raised
expert systems to a new level of development.

4. Materials and Methods

4.1.    Data
    The study involved 137 patients of different ages, different anamnesis, and varying disease
severity. The data set describes the results of clinical tests of patients, symptoms, indicators of
preliminary examination and complaints with which the patient consulted a doctor. All patients were
given a preliminary diagnosis of the chronic obstructive pulmonary disease, and each patient has two
more concomitant diagnoses that can complicate the course of the infection and the patient's
treatment. Our primary task is to identify the presence of concomitant diagnoses and predict scenarios
in which these diagnoses may complicate the course of the disease or aggravate the ongoing medical
therapy. In total, the data set contains 104 parameters that describe the condition of each of 137
patients.
    In situations where medical symptoms may include abnormal emissions, obstructions, and other
artifacts, robust online procedures should be used that allow for consistent self-learning diagnostics
using arbitrary cluster diagnoses. We carried out preprocessing of the data using the Feature Selection
methods of optimization, and after clustering according to the algorithm MeanShift, the key indicators
that have the greatest impact were identified (see Table 1). Among them, there are known to us input
indicators and unknown indicators, the probability of occurrence of which we need to determine.

Table 1
The Input data
    Designation of the node                  Description                    Interaction scheme
               9                            Suffocation
               18                   Increased body temperature
               40                    Harmful working conditions
               47                           Chest shape
               50                      Breathing is weakened
               51                          Wheezing rales
               52                          Wet wheezing
               53                          Dry wheezing
               57                     The volume of wheezing
               89                      Focus of inflammation
               1                        Diagnosis 1 yes / no?
               2                        Diagnosis 2 yes / no?
               3                        Diagnosis 3 yes / no?
   The network contains ten nodes:
    1. 9 corresponds to the patient's suffocation. This indicator has two states s0 and s1:
         s0 means no suffocation from the patient,
         s1 means the patient has suffocation.
    2. 18 corresponds to the increased patient's body temperature. This indicator has two states s0
        and s1:
         s0 means the absence of information about increased patient's body temperature,
         s1 means the presence of information about increased patient's body temperature.
    3. 40 corresponds to the harmful working conditions in which the patient works. This indicator
        has two states s0 and s1:
         s0 means the absence of the harmful working,
         s1 means the presence of the harmful working.
    4. 47 is the patient's chest shape. This indicator has two states s0 and s1:
         s0 refers to the barrel-like shape of the patient's chest;
         s1 refers to the patient's round chest.
    5. 50 is the patient's breathing is weakened/not weakened. This indicator has two states s0 and
        s1:
         s0 means that the patient's breathing isn`t weakened;
         s1 means that the patient's breathing is weakened.
    6. 51 corresponds to the wheezing/no wheezing in the patient's lungs. This indicator has two
        states s0 and s1:
         s0 means no wheezing in the patient's lungs,
         s1 means wheezing in the patient's lungs.
    7. 52 means presence /absence wet wheezing in the patient's lungs. This indicator has two
        states s0 and s1:
         s0 means the absence of wet wheezing in the patient's lungs,
         s1 means the presence of wet wheezing in the patient's lungs.
    8. 53 means the presence/absence of dry wheezing in the patient's lungs. This indicator has two
        states s0 and s1:
         s0 means the absence of dry wheezing in the patient's lungs,
         s1 means the presence of dry wheezing in the patient's lungs.
    9. 57 corresponds to the examination results of the patient's wheezing loudness. This indicator
        has two states s0 and s1:
         s0 means low volume of wheezing;
         s1 means high volume wheezing.
    10. 89 corresponds to the information whether there are foci of inflammation. This indicator has
        two states s0 and s1:
         s0 means that there are no foci of inflammation;
         s1 means that there are foci of inflammation.


4.2.    Future Selection

    Modern data arrays, to which certain Data Mining methods can be applied, can be described by a
large amount of data that form a large-dimensional feature space. Therefore, the proportion of such a
space reducing to a dimension that allows data processing and/or visualization without unnecessary
difficulties is very urgent. The solution to such a problem is called the optimization of the feature
space or the search for significant features (Feature Selection, or Feature Engineering).
    Data preprocessing is the most important stage, the quality of which determines the possibility of
obtaining high-quality results of the entire data analysis process. Feature Selection for the task being
implemented consists of choosing the most informative, useful features and excluding uninformative
features from consideration without transforming the original data space [20, 21]. We used the ID3
(Iterative Dichotomizer) algorithm proposed by D. Quinlan, which determines the order of a variable
and its attributes through their informational significance (informational entropy) [22]. To do this,
find the entropy of all unused features and their attributes relative to test specimens and choose the
one for which the entropy is minimal (and the information content is maximal). The entropy under the
condition of not equiprobable events pi is found by the well-known Shannon formula:
                                      I       pi log2 pi ,
                                                   i
                                                                                                   (1)
where I is the amount of information in bits that can be transmitted using m elements in the message
with n letters in the alphabet, and pi=m/n.

4.3.    Clustering algorithm MeanShift
   MeanShift is a nonparametric modus for determining the location of the probability density
maximum [23]. The mean shift algorithm basically assigns data points to clusters iteratively, shifting
the points towards the highest data points density, that is, the centroid of the cluster. The Rosenblatt –
Parsen estimate is one of the most widely used for nonparametric data density estimation [24].
   The density is estimated as the total influence of the sample elements, while the contribution of
each element is described by the bell-shaped function K(x), which depends on the distance to this
element. The formula for calculating the density estimate f(x) with the smoothing parameter
(bandwidth) P at an arbitrary point x has the form:
                                                1 N  x  xi 
                                    fˆ  x        K      
                                               NP d i 1  P  .                                      (2)

   As K(x), we can use the classical Gaussian kernel:
                                       x  xi         x  xi 
                                   KG           exp     2 
                                       P              2P  .                                       (3)

   However, in practice, in order to reduce computational costs, limited kernels are used, such as, for
example, the Epanechnikov kernel:
                                        x  xi        x  xi 
                                  K Ep 
                                       
                                                  1 
                                                 
                                                            2      
                                                                  I x  xi  P2

                                           P              P                       ,                (4)
   where I(x) is the indicator function.
   In this approach, clusters correspond to local maxima of the density estimation function (modes).
And the data elements refer to clusters using the MeanShift procedure) [25], converging along the
gradient to the corresponding local maximum. An iterative procedure, starting its work from a point,
sequentially moves to a shift point xk+1=m(xk) until convergence, where:

                                             x K x  x  .
                                               N

                                   m  x     i 1 i          i
                                                                                                      (5)
                                              K x  x 
                                                   N
                                                   i 1    i


    The vector is called the "mean shift" vector and its direction coincides with the direction of the
maximum density growth at the point x. Clustering algorithms based on the use of the mean shift
procedure allow obtaining high-quality partitions, however, the main problem for using this approach
is the high computational complexity [26].

4.4.    Bayesian network methods
    Let G  (V , Bi ) be a graph in which the ending V is a set of variables; Bi is non-reflexive binary
relation on [27]. Each variable v has a kit of parent variables c(v)  V and a kit of all descendants
d (v) V . A set s(v) is a set of child variables for a variable v and s(v) is a subset of d (v) . Let's
also mark, that:
                                                (a(v) V )  V  (d (v) {v}) .                                 (6)
    That is, a(v) is a kit of propositional signs from the set V, excluding the variable v and its
descendants. The set of variables B, is the contexture of parameters defining the model. It constitution
                                                 i              i
Q x | pa ( X )  P ( x i | pa (X i )) for each x amount from X and pa( X i ) from Pa( X i ) , where Pa( X i ) means
   i   i



the variable Xi parents set in G . Each sign Xi in graph G is proposed as an apex. If we have more than
one graph, then we use the notation to recognize the parents PaG ( X i ) in graph G [28]. The total
probability B of Bayesian model is specified by the formula:
                                      PB ( X 1 ,..., X N )  i 1 PB (Xi | Pa(Xi )) .
                                                              N
                                                                                                               (7)

    The parametric learning procedure purpose is to discover the most likely θ variables that
interpret the data [29].
    For the validation procedure, we chose the algorithm presented in [30]. The method of
maximum expectation EM is a procedure of iterations, which was created for solve optimization tasks
of some functionality, using an analytical search for the objective function extremes. This way is
divided into two steps. At the first step of "expectation" (E - expectation) on the basis of available
observations (patients) the expected values for each incomplete observation are calculated. After
receiving the filled data set, the basic statistical parameters are estimated. In the second stage,
"maximization" (M - maximization) maximizes the degree of compliance of the expected and actually
substituted data [31].

5. Experiments and Results
    The Bayesian model was established via the GeNIe 2.4 Academic software environment. The
structural static Bayesian model is shown in Figure 2. The Bayesian formula is used in Bayesian
networks as an inference tool to find a solution. If the Bayesian network is used to recognize
(identify) objects, then many factors are replaced by factors or characteristics of a particular object.
    Selecting a set of instantiated variables separately has its advantages and disadvantages. The
advantage of this representation is that it prevents looping when forming the output. If the output is
not selected separately, there is a risk that the messages will affect each other and the network will
become unstable.




Figure 2: The static Bayesian network structural model
   The disadvantage of this representation is that computing costs increase. However, the advantages
are so great that the allocation of instantiated variables is completely justified. The final decision to
confirm the effect between pollution data and test results, as well as the appointment of treatment, is
made by the doctor. In Table 2, we present the results of modeling each specific case from a sample
of data. The table shows the predicting result of confirming or refuting an early diagnosis.



Table 2
The scenario analysis results
 Condition                   Result                           Description of cases, of interest
             50 will decrease by 11% (54 to 43)    When choking is at its maximum, the likelihood of
 Clinical    52 will increase by 3% (47 to 50)     breathing faster will decrease by 11%. The risk of
 case 1:     57 will increase by 3% (46 to 49)     being diagnosed with 3 will decrease by 10%
  9 max      1 will decrease by 3% (44 to 41)
               3 will decrease by 10% (61 to 51)
 Clinical     2 will decrease by 6% (43 to 37)     When the body temperature has reached its
 case 2:       3 will increase by 4% (61 to 65)    maximum, the risk of a diagnosis of 2 will decrease
  18 max                                           by 6%, and a diagnosis of 3 will increase by 4%.
 Clinical     2 will increase by 7% (43 to 50)     In the presence of harmful working conditions, the
 case 3:      1 will decrease by 5% (44 to 39)     risk of diagnosis 2 will increase by 7%, and 1 will
  40 max  3 will increase by 2% (61 to 63)        decrease by 5%.
 Clinical     1 will decrease by 5% (44 to 43)     In the presence of impaired breathing, the risk of
 case 4:      3 will increase by 4% (61 to 64)     choking increases by 10%
  50 min     9 will increase by 10% (45 to 64)
 Clinical     1 will increase by 4% (44 to 48)     If the patient's breathing is weakened as much as
 case 5:      3 will decrease by 3% (61 to 58)     possible, the risk of a diagnosis 1 increases by 4%.
  50 max 9 will decrease by 9% (45 to 36)
              2 will increase by 4% (43 to 47)     If the level of wet wheezing decreases, the risk of
 Clinical
              3 will decrease by 6% (61 to 55)     diagnosis 3 will decrease by 6%. In this case,
 case 6:
             9 will increase by 3% (45 to 48)      choking and wheezing volume may increase by 3%
  52 min
              57 will increase by 3% (46 to 49)
 Clinical     2 will decrease by 3% (43 to 40)     In the presence of pronounced wet wheezing, the
 case 7:       3 will increase by 6% (61 to 67)    risk of the diagnosis of 3 increases by 6%
  52 max
 Clinical     1 will increase by 4% (44 to 48)     If the level of wheezing is minimal, the risk of
 case 8:      2 will decrease by 4% (39 to 43)     diagnosis 2 will be reduced by 4%
  51 min      3 will increase by 4% (61 to 65)
 Clinical     1 will increase by 6% (44 to 50)     If the level of dry wheezing is minimal, the risk of
 case 9:      2 will decrease by 4% (43 to 39)     diagnosis 1 increases by 6%
  53 min      3 will increase by 2% (61 to 63)
              1 will decrease by 6% (44 to 38)     In the presence of pronounced dry wheezing, the
 Clinical
              2 will increase by 4% (43 to 47)     risk of the diagnosis of 2 increases by 4%, while
 case 10:
               3 will decrease by 2% (61 to 59)    the risk of making a diagnosis of 1 will decrease by
  53 max
                                                    6%
 Clinical      2 will increase by 5% (43 to 48)    At the maximum volume of wheezing, the risk of making a
 case 11:       3 will increase by 3% (61 to 64)   diagnosis of 2 increases by 5%
  57 max
6. Discussion
    Now let's analyze clinical cases. The analysis results are shown in Table 2 and in Figures 3-6. To
illustrate case 1 from Table 2: When choking is at its maximum, the likelihood that breathing will be
faster will decrease by 11%. At the same time, the risk of making a diagnosis of 3 will decrease by
10%, as shown in the Figure 3.



    70%
                                                                               -10%
    60%
                                        -11%                61%
    50%             54%                                                        51%
    40%                                 43%

    30%

    20%

    10%

     0%
                   X50                X50´                 Y3                 Y3´

Figure 3: The internal profitability index tends to the maximum

   Case 2: Upon reaching the maximum level of body temperature, the risk of a diagnosis of 2 will
decrease by 6%, and a diagnosis of 3 will increase by 4%, as shown in the Figure 4.


                                                                                 +4%
    70%

    60%                                                                          65%
                                                            61%
    50%                                  -6%

    40%             43%
                                         37%
    30%

    20%

    10%

     0%
                    Y2                 Y2´                 Y3                  Y3´

Figure 4: Clinical case 2

   Cases 4,11 and 12: If the patient's professional activity takes place in the presence of the maximum
harmful working conditions, the risk of diagnosis 2 will increase by 7%, and 2 will decrease by
5%, as shown in the Figure 5.
                                          +7%

    50%
                                          50%                                    -5%
    45%
                    43%                                     44%
    40%
                                                                                 39%
    35%
    30%
    25%
    20%
    15%
    10%
     5%
     0%
                   Y2                   Y2´                Y1                  Y1´

Figure 5: Clinical cases 4,11, and 12

   Results of the analysis of case 10 are shown in the Figure 6.


                                                                                 +4%
   50%
                                          -6%
   45%                                                                           47%
                   44%                                      43%
   40%
                                         38%
   35%
   30%
   25%
   20%
   15%
   10%
    5%
    0%
                   Y1                   Y1´                Y2                  Y2´

Figure 6: Clinical case 10

7. Conclusion
   The chronic obstructive pulmonary disease gives dangerous complications to various organs of a
sick person, that is, patients with hypertension, diabetes, and coronary heart disease are at risk of
increased adverse outcomes.
   The incidence of the chronic obstructive pulmonary disease depends on many factors: the standard
of living, social and marital status, working conditions, contact with animals, travel, the presence of
bad habits, contact with sick people, individual characteristics of a person, the geographic prevalence
of a particular pathogen.
   Our study received comments from physicians and community physicians regarding the
applicability of our probabilistic estimates of diagnosis as a working tool of daily practice. Our
Bayesian system has also been evaluated by non-medical independent experts for comments from a
patient perspective.

8. References
[1]  O. Turuta, I. Perova, A. Deineko, Evolving flexible neuro-fuzzy system for medical diagnostic
     tasks, International Journal of Computer Science and Mobile Computing IJCSMC (2015) vol.4,
     Issue 8, 475-480.
[2] H. Kahramanli, N. Allahverdi, Design of a hybrid system for the diabetes and heart diseases,
     Expert Systems with Applications (2008) vols. 1-2, no. 35, 82–89.
[3] E. Lughofer, Single pass active learning with conflict and ignorance, Evolving systems (2012)
     vol. 4, no. 3, 251-271.
[4] V. Lytvyn, A. Hryhorovych, V. Hryhorovych, L. Chyrun, V. Vysotska, M. Bublyk, Medical
     Content Processing in Intelligent System of District Therapist, in: Proceedings of the 3rd
     International Conference on Informatics & Data-Driven Medicine (IDDM 2020), Växjö, Sweden,
     November 19 - 21, 2020, 415-429. http://ceur-ws.org/Vol-2753/paper29.pdf.
[5] B. Badiani, A. Messori, Targeted Treatments for Pulmonary Arterial Hypertension: Interpreting
     Outcomes by Network Meta-analysis, PharmD Heart, lung & circulation, (2015), vol.25 (1), 46-
     52.
[6] G. Battineni, G. Sagaro, N. Chinatalapudi, F. Amenta, Applications of Machine Learning
     Predictive Models in the Chronic Disease Diagnosis, Journal of personalized medicine, 2020-06-
     01, vol.10 (2), p.21.
[7] D. Luciani, M. Marchesi, G. Bertolini, The role of Bayesian Networks in the diagnosis of
     pulmonary embolism Journal of thrombosis and haemostasis, 2003-04, vol.1 (4), 698-707.
[8] D. Luciani, S. Cavuto, L. Antiga, M. Miniati, S. Monti, M. Pistolesi, G. Bertolini, Bayes
     pulmonary embolism assisted diagnosis: a new expert system for clinical use, Emergency
     medicine journal : EMJ, 2007-03, vol.24 (3), 157-164.
[9] Y. Bodyanskiy, Computational intelligence techniques for data analysis, in: Lecture Notes in
     Informatics, 2005, vol.5(72), pp. 15-36.
[10] K. Kaplan and M. Haenlein, Siri, Siri, in my hand: Who’s the fairest in the land? On the
     interpretations, illustrations, and implications of artificial intelligence, Business Horizons, 2019,
     pp. 15-25.
[11] L. Chyrun, E. Leshchynskyy, V. Lytvyn, A. Rzheuskyi, V.Vysotska, Y. Borzov, Intellectual
     Analysis of Making Decisions Tree in Information Systems of Screening Observation for
     Immunological Patients. In: CEUR Workshop Proceedings, Vol-2362, (2019), 281-296.
[12] S. Kendal, M. Creen, An introduction to knowledge engineering, London: Springer, 2007, p. 287.
[13] S. Sahan, K. Polat, H. Kodaz, S. Gunes, The medical applications of attribute weighted artificial
     immune system (AWAIS): diagnosis of hearts and diabetes diseases, in: Proceedings of the 4th
     international conference on Artificial Immune Systems, Banff, AB, Canada, (2005).
[14] S. Lekkas, L. Mikhailov, Evolving fuzzy medical diagnosis of Pima Indians diabetes and of
     dermatological diseases, Artificial Intelligence in Medicine, (2010), no. 50, 117–126.
[15] K. Polat, S. Gunes, A. Arshlan, A cascade learning system for classification of diabetes disease:
     generalized discriminant analysis and least square support vector machine, Expert Systems with
     Applications, 2008, no. 34, pp. 482–487.
[16] P. Lucas, Bayesian networks in medicine : a model-based approach to medical decision making
     (2001). https://doi.org/10.1.1.22.4103
[17] H. Temurtas, N. Yumusak, F. Temurtas, A comparative study on diabetes disease diagnosis using
     neural networks, Expert Systems with Applications, 2009, no. 36, 8610–8615.
[18] C. Bojarczuk, H. Lopes, A. Freitas, E. Michalkiewicz, A constrained-syntax genetic programming
     system for discovering classification rules: application to medical data sets, Artificial Intelligence
     in Medicine, 2004, vol. 1, no. 10, 27-48.
[19] D.-C. Li, C.-W. Liu, S. C. Hu, A fuzzy-based data transformation for feature extraction to
     increase classification performance with small medical data sets, Artificial Intelligence in
     Medicine, 2011, vol. 1, no. 45, pp. 45-52.
[20] M.L. Raymer, W.F. Punch, E.D. Goodman, L.A. Kuhn, L.C. Jain, Dimensionality reduc-tion
     using genetic algorithms, IEEE Trans. on Evolutionary Computation, 2000, no. 4 (2), 164-171.
[21] V. Lytvynenko, M. Voronenko, D. Nikytenko, N. Savina, O. Naumov, Assessing the possibility
     of a country's economic growth using dynamic bayesian network models. In: IEEE-2019 14th
     International Scientific and Technical Conference on Computer Sciences and Information
     Technologies (CSIT). vol. CFP19D36-PRT, (2020), pp. 60-63.
[22] A. Magrini, D. Luciani, F. Stefanini, A probabilistic network for the diagnosis of acute
     cardiopulmonary diseases, Biometrical journal, 2018-01, vol.60 (1), 174-195.
[23] J. Amaral, A. Lopes, J. Jansen, A. Faria, P. Melo, Machine learning algorithms and forced
     oscillation measurements applied to the automatic identification of chronic obstructive pulmonary
     disease, Computer methods and programs in biomedicine, 2011, vol.105 (3), 183-193.
[24] Y. Cheng, Mean Shift, Mode Seeking, and Clustering, IEEE Transactions on Pattern Analysis and
     Machine Intelligence, 1995, August, vol. 17, issue 8, doi: 10.1109 / 34.400568.
[25] I.A. Pestunov, V.B. Berikov, Yu.N. Sinyavsky, Segmentation of multispectral images based on an
     ensemble of nonparametric clustering algorithms, Vestn. Sib. State aerospace un-that them.
     Academician M.F. Reshetnev. 2010, no. 5 (31), 56-64.
[26] I.A. Pestunov, Yu.N. Sinyavsky, Analysis and synthesis of signals and images, a nonparametric
     algorithm for clustering remote sensing data based on the grid approach, Avtometriya, 2006, vol.
     42, no. 2, 90-99.
[27] N. Friedman, D. Koller, Being Bayesian about network structure. A Bayesian approach to
     structure discovery in Bayesian Networks, Machine Learning, 2003, vol. 50, 95–125.
[28] A. A. Darwiche, Differential approach to inference in Bayesian networks, In Uncertainty in
     Artificial Intelligence: Proceedings of the Sixteenth Conference (UAI 2000), pp. 123–132. San
     Francisco, CA: Morgan Kaufmann Publishers, 2000.
[29] A.N. Terentyev, P.I. Bidyuk, The method of probabilistic inference in Bayesian networks
     according to training data, Cybernetics and system analysis, 2007, no. 3, 93—99 (in Rusian).
[30] M. Voronenko et al., Dynamic Bayesian Networks Application for Economy Competitiveness
     Situational Modelling. In: Advances in Intelligent Systems and Computing (2020), CSIT 2020,
     vol. 1293, 210-224.
[31] A.S. Cofino, R. Cano, C.Sordo, J.M. Gutierrez, Bayesian Networks For Probabilistic Weather
     Prediction, In Proceedings Of The 15th European Conference On Artificial Intelligence, Ios Press,
     2002, 695-700.