=Paper= {{Paper |id=Vol-3609/paper20 |storemode=property |title=Uncertain Model for Classification of Children's Neurological Diseases |pdfUrl=https://ceur-ws.org/Vol-3609/paper16.pdf |volume=Vol-3609 |authors=Olga Rybytska,Ivanna Dronyuk,Andriy Pushnyk |dblpUrl=https://dblp.org/rec/conf/iddm/RybytskaDP23 }} ==Uncertain Model for Classification of Children's Neurological Diseases== https://ceur-ws.org/Vol-3609/paper16.pdf
                         Uncertain Model for Classification of Children's Neurological
                         Diseases
                         Olga Rybytskaa, Ivanna Dronyuka,b and Andriy Pushnykc
                    a
                      Lviv Polytechnic National University, Bandery 12, Lviv, 79013, Ukraine
                    b
                      Jan Dlugosz University in Czestochowa, Waszyngtona 4/8, Czestochowa, 42200, Poland
                    c
                      Lviv National Medical University, Pekarska str.,69, Lviv,79010, Ukraine


                                        Abstract
                                        The dataset of thirty-three parameters about one hundred children from three to seven years
                                        old is considered. The biggest part of children (about 70%) have some neurological diseases.
                                        The dataset contains such parameters: body characteristics, physical and emotional
                                        functioning, physical health, communications, relationships in the family, and some results of
                                        medical analysis. Standard statistical analysis was realized for the preprocessing. Based on
                                        the obtained results the k-mean method was proposed for solving classification tasks. The
                                        Silhouette method and Elbow method were realized. Clusters corresponding to groups of
                                        indicators of Psychosocial Health Summary Score, Physical Health Summary Score, Total
                                        score (0-100%), and nutritional status of healthy children and children with neurological
                                        pathology were obtained. The calculation was made according to the PedsQLTM 2.0
                                        questionnaire and was calculated using a special licensed computer program Dietplan 7. An
                                        uncertain model for classification is proposed.

                                        Keywords 1
                                        Children's neurological diseases, classifications of diseases, improving diagnostics with AI,
                                        Silhouette method, Elbow method

                         1. Introduction

                            One of the most important signs of the well-being of society is the health of its members, and first
                         of all, the health of children. Modern research data indicate a tendency to decrease in children's health
                         indicators, particularly in Ukraine. Diseases showing the highest growth rates include damage to the
                         central and peripheral nervous system and primarily psychomotor disorders. The count of children’s
                         neurological diseases is increasing nowadays. The effectiveness of treatment heavily depends on the
                         timely and correct diagnosis. The development of AI technologies and their application for medicine
                         has made a real breakthrough in the diagnosis and treatment of neurological diseases in children.
                            It does not seem possible to establish the exact number (in percent of the child-age population) for
                         the purpose of comparisons by regions and the level of welfare of countries using today's existing
                         methods of diagnosis. This is due, in particular, to different possibilities in organizing samples,
                         methods, and criteria of randomization, diagnostic criteria, and instruments used. In some studies,
                         cases of pervasive developmental disorders associated with severe mental retardation, congenital
                         abnormalities of brain development, epilepsy, and prematurity are not taken into account. There is
                         often a lack of registration of children who demonstrate accelerated development, which is not
                         characteristic of their age. This explains why the prevalence of Autism Spectrum Disorders (ASD) in
                         particular in China (6.4 per 10,000) is significantly lower than in Asia, Europe, and North America

                         1                    IDDM’2023: 6th International Conference on Informatics & Data-Driven Medicine, November 17 - 19, 2023,
                         Bratislava, Slovakia
                                     EMAIL: olha.m.rybytska@lpnu.ua (A. 1); i.dronyuk@ujd.edu.pl (A. 2); apushnyk@gmail.com (A. 3)
                                     ORCID: 0000-0002-2394-355X (A. 1); 0000-0003-1667-2584 (A. 2); 0000-0001-7902-9639 (A. 3)
                                                      ©️ 2023 Copyright for this paper by its authors.
                                        Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
                                                      CEUR Workshop Proceedings (CEUR-WS.org)


CEUR
                  ceur-ws.org
Workshop      ISSN 1613-0073
Proceedings
(the average prevalence of ASD is about 1%) [1]. In South Korea, where schoolchildren were studied,
this indicator was 2.6% (3.7% among boys and 1.5% among girls) [2].
    The rate of incidence of ASD in Ukraine is increasing. Thus, from 2006 to 2017, the number of
newly diagnosed cases of the disease increased 8.5 times (by the end of 2017 it was 998) [3]. Over the
last decade, a stable trend of growth in the share of children with developmental disorders has been
recorded in Ukraine. According to information from the Center for Medical Statistics of the Ministry
of Health of Ukraine for 2017, among the children, the incidence and prevalence of central and
peripheral nervous system lesions were 40.19, respectively; 0.97 and 15.19 and 0.42 per 1000 child
population; behavioral disorders - 24.13 and 3.77 per 1000 child population.
    In particular, until 2006, ASD was almost undiagnosed in our country. The rate of incidence of
ASD in Ukraine has been increasing since 2006. The number of registered ASD patients increased
from 662 in 2005 to 7,491 in 2017. According to a large study conducted by the US Centers for
Disease Control and Prevention (CDC), the prevalence of autism spectrum disorders in the US was 1
in 88 children. The disease was five times more common in boys (1 in 54) than in girls (1 in 252).
    The prevalence of the syndrome (disorder) of attention deficit hyperactivity disorder (ADHD) is
from 4 to 12% worldwide, and of minimal brain dysfunction (MBD) - from 2 to 20%, according to
other data - from 3-8% to 30 - 50% of the child population. There is a steady trend towards an
increase in the number of children with these psychomotor disorders, as well as with mental
retardation (MR). Therefore, it can be argued that the available literature does not fully reflect the
current situation [4].
    This condition is associated with the lack of mass-planned diagnosis of psychomotor disorders at
an early age. This problem must be solved, because early diagnosis and medical, in particular, dietary
correction can significantly alleviate and sometimes eliminate symptoms. Therefore, the sooner the
disease is detected, the causes of its occurrence, as well as measures to reduce the severity of the
disease course (and in some cases complete elimination of symptoms) in children become an
extremely important task. The exact mechanisms of the effect of genetic factors on damage to the
central and peripheral nervous system are investigated with the help of genomic screening,
cytogenetic studies, and evaluation of candidate genes [5].
    Also, at the stage of primary medical care during routine preventive examinations at 9, 18, 24, and
36 months, screening of children's development is currently required, during which children with
developmental delays are identified. Additional special screening is recommended if the child has
previously described risk factors for ASD, diseases that have a high level of comorbidity with ASD.
Special screening is also recommended for older children if they have disorders of social interaction
and communication. The final diagnosis of ASD was carried out by a specialist doctor based on the
diagnostic criteria of ICD-10 (WHO, 1992), and since 2018 - ICD-11. However, the connection
between the disease and the nutritional background is mostly not considered today.
    Diseases of the central nervous system are currently considered as the result of the interaction of
genetic and environmental factors. Diseases that disrupt the work of the brain and physical influences,
particularly nutrition, are a risk of morbidity. Ensuring the quality of life is an integral part of the
treatment program for children with psychomotor disorders. In children with neurological pathology,
treatment tactics are aimed not only at alleviating the child's condition and controlling clinical
symptoms but also at improving the quality of his life [6].
    Adequate nutritional status is the key to the physiological development of the child's body, in
particular the central nervous system, in critical periods of ontogenesis (pregnancy and the first years
of life). Nutritional deficiency (excess) during these time periods leads to a violation of the formation
of cognitive functions, behavioral reactions, and productivity of the child in various spheres of his
activity, which will have a negative impact on both childhood and youth years, as well as on all
further adult life of the individual, limiting his opportunities for further development [7]. Thus, the
development and formation of the central nervous system critically depend on nutritional support that
is adequate to the child's needs in one or another period of life. Sufficient intake of proteins in the
child's body, and provision of energy substrates, including glucose, fats (long-chain polyunsaturated
fatty acids), iron, zinc, copper, iodine, and folic acid, are especially important. Their absence or
excess leads to increased metabolic disorders in the CNS, and structural and epigenetic changes in it,
which will have a long-term, often lifelong, impact on the regulation of its functions [8-12].
    Children with disorders of psychomotor development against the background of existing
neurological disorders and various somatic pathologies become especially vulnerable to deficits
(surplus) of nutritional support due to their inherent nutritional features, which significantly
complicates their rehabilitation [13]. The sensory profile of children with ASD, with its characteristic
hypo- and hyper-reactivity to tactile, olfactory, and visual stimuli, causes a strict selectivity in food
due to its appearance, consistency, taste, and temperature. [14-16]. In work [17-19] the systematic
review of the research about using modern information technologies for the diagnosis of neural
diseases. For medical diagnostics, the classification task is very important. In many modern kinds of
research, the method of linear regression [20], Nearest neighbor, Nearest Center [21], randomized
classification trees [22], and Bayesian classifier [23] are used for solving classification tasks. In work
[24,25,26] the systematic review of the research about using modern information technologies for the
diagnosis of neural diseases. For medical diagnostics, the classification task is very important. In
many modern kinds of research, the method of linear regression [27], Nearest neighbor, Nearest
Center, randomized classification trees, and Bayesian classifier [28] are used for solving classification
tasks. The recurrent approximation is used for the tasks of neural network synthesis in the medical
therapies control [29].
    Therefore, the problem of early and reliable detection of possible problems in the nutritional
supply of the child and the development of recommendations for its balancing is an important factor
in reducing the severity of the course of CNS diseases, rehabilitation, and improving the quality of life
and its expected duration.

2. Research methods

   The work uses such research methods as statistical collection of information, cluster analysis, and
construction of membership functions of a fuzzy logical model. Obtained statistical information
regarding the physical condition of children with absent central nervous system (CNS) diseases
(hereinafter "Healthy", abbreviated H) (20 children), attention deficit hyperactivity disorder (ADHD)
(13 children), mental retardation (MR) (30 children), autism spectrum disorder (ASD) (17 children),
and minimal brain dysfunction (MBD) (10 children) were systematized by age (3 years, 4-5 years, 6-7
years) and gender. The study used indicators (Group 1) of physical condition (Weight (kg), Height
(cm), Head Circumference (cm), Chest Circumference (cm), Shoulder Circumference (cm), Hip
Circumference (cm), Leg Circumference (cm), BMI (body mass index)), as well as subjective
indicators: Physical functioning, Social functioning, Emotional functioning, Functioning of
schools/kindergartens; calculated indicators: Psychosocial Health Summary Score, Physical Health
Summary Score, Total score (0-100%). The calculation was made according to the PedsQLTM 2.0
questionnaire.
   Also, in the clustering of the entire sample, the indicators were taken into account: Awareness,
Family involvement, Communication Technical skills, Emotional needs, Overall satisfaction, Total
score, Physical functioning, Emotional functioning, Social functioning, Mental functioning,
communication, Concern, Everyday activity, Relationships in the family, The Parent HRQL Summary
Score, The Family Functioning Summary.
   Average values (Group 2) of nutrients obtained from food (Water, Proteins, Fats, Carbohydrates,
Energy, Total sugar, Saturated fat, Monounsaturated fat, Polyunsaturated fat, Dietary fiber, Na, K Ca
Mg, Phosphorus (P), Fe, Cu, Zn, Cl, Mn, Se, I, Retinol, Vitamins D, E, B1, B2, PP, B6, B12, C,
Folates, Pantothenic acid, Biotin was calculated using a special of the licensed computer program
Dietplan 7, developed by the British company Forestfield Software Limited (Horsham, UK) (user
agreement dated 30.07.19).
   Clustering (k-means) was carried out in order to identify groups of similarity based on the
indicators of Group 1 and Group 2. The number of clusters [24-26] was determined according to the
elbow and silhouette methods. Initial value clusters have a large impact on the clustering model, so
algorithms are used for different given initial values. The location measure is used to find the closest
points in the clusters to the cluster center, By definition different measures of distance can be created
for different clusters. The number of clusters (k) is a defined parameter in K-Means clustering. To
determine the optimal value of various methods, in this research we use the following methods:
Elbow method; and Silhouette method.
    The Elbow method is an empirical method for determining the optimal number of clusters for a
data set. In this method, we select a subset of these k values and then apply K-Means clustering using
each k value. We find the average distance of each point in the cluster to its center. Then we calculate
the sum of squares S of the distances between the centroids. Now to determine the best number of
clusters (k), we plot k against their value S. We choose the value of k from which the plot looks like a
straight line.
    The silhouette method is unique in the way it interprets and checks for consistency in data clusters.
This method provides a concise graphical representation of how well each object is classified. The
size of the silhouette is given by a strictly mathematical formula for each cluster of points using the
minimum value of the average distance of a given point in relation to other points of the cluster.
Silhouette magnitude is a measure of how similar an object is to its own cluster (cohesion) and
different from other clusters (separation). The silhouette ranges from -1 to +1, where a high value
indicates that the object matches well with its own cluster and poorly matches neighboring clusters. If
one object has a high value, then the clustering configuration is appropriate. If many points have a low
or negative value, then the clustering configuration may have too many or too few clusters. The
silhouette can be calculated using any distance metric. In the presented research the Euclidian metric
is used.

3. Results of data clustering of patients aged 3-7 years according to all
indicators of Groups 1 and 2.

   The results of clustering the data of the entire sample into 3 clusters according to all indicators, in
particular of the 1st and 2nd groups, made it possible to confirm the fact that the composition of
nutrients is associated with diseases of the central nervous system. Thus, clusters 0 and 1 included
both healthy and sick patients. At that time, only patients with ASD, MR and ADHD were in cluster
2. The value of the centroids is shown in Fig. 1. The significant difference between the content in the
diet of such nutrients as Water, Carbohydrates, Energy, Na, K, Ca, P, Cl, Retinol, and Folates gave
the impetus for more detailed studies. Since early diagnosis and rehabilitation is the most effective, it
was decided to carry out more detailed research with data on children aged 3 years. It is obvious that
age differences played a significant role in the clustering of the sample of the 3-7 age group.


                                   0 Cluster       1 Cluster        2 Cluster

 6000
 5000
 4000
 3000
 2000
 1000
     0
                   The Family…
                    Emotional…




         Relationships in the …
                    Emotional…
           Leg circumference…


              Physical Health…
                         Head…




                       Energy




                            B2
                            B6
                   Awareness




                    Vitamin С
                             Cl
                        Water


                 Saturated fat
         Polyunsaturated fat

                            Ca

                            Cu
          Shoulder girth (cm)




                            Se

                             Е
            Emotional needs
                   Total score
                     Mass (kg)




          Mental functioning




                  Phosphorus



                       Retinol
                          Fats
                      Concern




                       Folates
         Physical functioning



             Communication




                           Na




Figure 1: The value of the centroids of clusters 0-2 of all sample participants by all indicators
4. Results of clustering of patients in the age category of 3 years.

   The results of clustering Fig. 2,3 were obtained in relation to groups of similarity according to the
indicators of Group 1 and centroid values (A group of 3-year-old patients according to the indicators
of Group 1 was clustered into 5 clusters (0-cluster - 4-cluster). Value centroid indicators for each
cluster are presented in Table 2. Both healthy and sick patients were included in clusters 0 and 3.
Patients with MR, ASD, and ADHD were included in clusters 1,2,4.




Figure 2: Results of clustering indicators of Group 1 (Emotional functioning – Social functioning)




Figure 3: Results of clustering of indicators of Group 1 (Psychosocial Health Summary Score - Physical
Health Summary Score)
   The obtained results drew attention to the non-synchronicity of the trends to increase or decrease
in the values of individual indicators of Group 1 with the presence (absence) of CNS diseases.
Moreover, when comparing the values of such an indicator as Weight in Ukrainian boys corresponds
to the norms (Tabl. 1,2), while Height and Head Circumference are deviated in the direction of
decrease from the norm. The percentage of head circumference to height corresponds to the norm. At
the same time, the breast circumference, as well as the relative index to height, exceeds the normal
values [27].
   Studies have shown that in patients with MR and ASD (to a lesser extent, ADHD), deviations in
many indicators of physical condition are extremely small and extremely large. Studies have shown
that in patients with MR and ASD (to a lesser extent, ADHD), the deviations of many indicators of
physical condition from the norm are significant, that is, their values are extremely small or extremely
large. Thus, according to the Weight indicator m, healthy children have fixed values 13.5