=Paper=
{{Paper
|id=Vol-3609/paper20
|storemode=property
|title=Uncertain Model for Classification of Children's Neurological Diseases
|pdfUrl=https://ceur-ws.org/Vol-3609/paper16.pdf
|volume=Vol-3609
|authors=Olga Rybytska,Ivanna Dronyuk,Andriy Pushnyk
|dblpUrl=https://dblp.org/rec/conf/iddm/RybytskaDP23
}}
==Uncertain Model for Classification of Children's Neurological Diseases==
Uncertain Model for Classification of Children's Neurological Diseases Olga Rybytskaa, Ivanna Dronyuka,b and Andriy Pushnykc a Lviv Polytechnic National University, Bandery 12, Lviv, 79013, Ukraine b Jan Dlugosz University in Czestochowa, Waszyngtona 4/8, Czestochowa, 42200, Poland c Lviv National Medical University, Pekarska str.,69, Lviv,79010, Ukraine Abstract The dataset of thirty-three parameters about one hundred children from three to seven years old is considered. The biggest part of children (about 70%) have some neurological diseases. The dataset contains such parameters: body characteristics, physical and emotional functioning, physical health, communications, relationships in the family, and some results of medical analysis. Standard statistical analysis was realized for the preprocessing. Based on the obtained results the k-mean method was proposed for solving classification tasks. The Silhouette method and Elbow method were realized. Clusters corresponding to groups of indicators of Psychosocial Health Summary Score, Physical Health Summary Score, Total score (0-100%), and nutritional status of healthy children and children with neurological pathology were obtained. The calculation was made according to the PedsQLTM 2.0 questionnaire and was calculated using a special licensed computer program Dietplan 7. An uncertain model for classification is proposed. Keywords 1 Children's neurological diseases, classifications of diseases, improving diagnostics with AI, Silhouette method, Elbow method 1. Introduction One of the most important signs of the well-being of society is the health of its members, and first of all, the health of children. Modern research data indicate a tendency to decrease in children's health indicators, particularly in Ukraine. Diseases showing the highest growth rates include damage to the central and peripheral nervous system and primarily psychomotor disorders. The count of children’s neurological diseases is increasing nowadays. The effectiveness of treatment heavily depends on the timely and correct diagnosis. The development of AI technologies and their application for medicine has made a real breakthrough in the diagnosis and treatment of neurological diseases in children. It does not seem possible to establish the exact number (in percent of the child-age population) for the purpose of comparisons by regions and the level of welfare of countries using today's existing methods of diagnosis. This is due, in particular, to different possibilities in organizing samples, methods, and criteria of randomization, diagnostic criteria, and instruments used. In some studies, cases of pervasive developmental disorders associated with severe mental retardation, congenital abnormalities of brain development, epilepsy, and prematurity are not taken into account. There is often a lack of registration of children who demonstrate accelerated development, which is not characteristic of their age. This explains why the prevalence of Autism Spectrum Disorders (ASD) in particular in China (6.4 per 10,000) is significantly lower than in Asia, Europe, and North America 1 IDDM’2023: 6th International Conference on Informatics & Data-Driven Medicine, November 17 - 19, 2023, Bratislava, Slovakia EMAIL: olha.m.rybytska@lpnu.ua (A. 1); i.dronyuk@ujd.edu.pl (A. 2); apushnyk@gmail.com (A. 3) ORCID: 0000-0002-2394-355X (A. 1); 0000-0003-1667-2584 (A. 2); 0000-0001-7902-9639 (A. 3) ©️ 2023 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings (CEUR-WS.org) CEUR ceur-ws.org Workshop ISSN 1613-0073 Proceedings (the average prevalence of ASD is about 1%) [1]. In South Korea, where schoolchildren were studied, this indicator was 2.6% (3.7% among boys and 1.5% among girls) [2]. The rate of incidence of ASD in Ukraine is increasing. Thus, from 2006 to 2017, the number of newly diagnosed cases of the disease increased 8.5 times (by the end of 2017 it was 998) [3]. Over the last decade, a stable trend of growth in the share of children with developmental disorders has been recorded in Ukraine. According to information from the Center for Medical Statistics of the Ministry of Health of Ukraine for 2017, among the children, the incidence and prevalence of central and peripheral nervous system lesions were 40.19, respectively; 0.97 and 15.19 and 0.42 per 1000 child population; behavioral disorders - 24.13 and 3.77 per 1000 child population. In particular, until 2006, ASD was almost undiagnosed in our country. The rate of incidence of ASD in Ukraine has been increasing since 2006. The number of registered ASD patients increased from 662 in 2005 to 7,491 in 2017. According to a large study conducted by the US Centers for Disease Control and Prevention (CDC), the prevalence of autism spectrum disorders in the US was 1 in 88 children. The disease was five times more common in boys (1 in 54) than in girls (1 in 252). The prevalence of the syndrome (disorder) of attention deficit hyperactivity disorder (ADHD) is from 4 to 12% worldwide, and of minimal brain dysfunction (MBD) - from 2 to 20%, according to other data - from 3-8% to 30 - 50% of the child population. There is a steady trend towards an increase in the number of children with these psychomotor disorders, as well as with mental retardation (MR). Therefore, it can be argued that the available literature does not fully reflect the current situation [4]. This condition is associated with the lack of mass-planned diagnosis of psychomotor disorders at an early age. This problem must be solved, because early diagnosis and medical, in particular, dietary correction can significantly alleviate and sometimes eliminate symptoms. Therefore, the sooner the disease is detected, the causes of its occurrence, as well as measures to reduce the severity of the disease course (and in some cases complete elimination of symptoms) in children become an extremely important task. The exact mechanisms of the effect of genetic factors on damage to the central and peripheral nervous system are investigated with the help of genomic screening, cytogenetic studies, and evaluation of candidate genes [5]. Also, at the stage of primary medical care during routine preventive examinations at 9, 18, 24, and 36 months, screening of children's development is currently required, during which children with developmental delays are identified. Additional special screening is recommended if the child has previously described risk factors for ASD, diseases that have a high level of comorbidity with ASD. Special screening is also recommended for older children if they have disorders of social interaction and communication. The final diagnosis of ASD was carried out by a specialist doctor based on the diagnostic criteria of ICD-10 (WHO, 1992), and since 2018 - ICD-11. However, the connection between the disease and the nutritional background is mostly not considered today. Diseases of the central nervous system are currently considered as the result of the interaction of genetic and environmental factors. Diseases that disrupt the work of the brain and physical influences, particularly nutrition, are a risk of morbidity. Ensuring the quality of life is an integral part of the treatment program for children with psychomotor disorders. In children with neurological pathology, treatment tactics are aimed not only at alleviating the child's condition and controlling clinical symptoms but also at improving the quality of his life [6]. Adequate nutritional status is the key to the physiological development of the child's body, in particular the central nervous system, in critical periods of ontogenesis (pregnancy and the first years of life). Nutritional deficiency (excess) during these time periods leads to a violation of the formation of cognitive functions, behavioral reactions, and productivity of the child in various spheres of his activity, which will have a negative impact on both childhood and youth years, as well as on all further adult life of the individual, limiting his opportunities for further development [7]. Thus, the development and formation of the central nervous system critically depend on nutritional support that is adequate to the child's needs in one or another period of life. Sufficient intake of proteins in the child's body, and provision of energy substrates, including glucose, fats (long-chain polyunsaturated fatty acids), iron, zinc, copper, iodine, and folic acid, are especially important. Their absence or excess leads to increased metabolic disorders in the CNS, and structural and epigenetic changes in it, which will have a long-term, often lifelong, impact on the regulation of its functions [8-12]. Children with disorders of psychomotor development against the background of existing neurological disorders and various somatic pathologies become especially vulnerable to deficits (surplus) of nutritional support due to their inherent nutritional features, which significantly complicates their rehabilitation [13]. The sensory profile of children with ASD, with its characteristic hypo- and hyper-reactivity to tactile, olfactory, and visual stimuli, causes a strict selectivity in food due to its appearance, consistency, taste, and temperature. [14-16]. In work [17-19] the systematic review of the research about using modern information technologies for the diagnosis of neural diseases. For medical diagnostics, the classification task is very important. In many modern kinds of research, the method of linear regression [20], Nearest neighbor, Nearest Center [21], randomized classification trees [22], and Bayesian classifier [23] are used for solving classification tasks. In work [24,25,26] the systematic review of the research about using modern information technologies for the diagnosis of neural diseases. For medical diagnostics, the classification task is very important. In many modern kinds of research, the method of linear regression [27], Nearest neighbor, Nearest Center, randomized classification trees, and Bayesian classifier [28] are used for solving classification tasks. The recurrent approximation is used for the tasks of neural network synthesis in the medical therapies control [29]. Therefore, the problem of early and reliable detection of possible problems in the nutritional supply of the child and the development of recommendations for its balancing is an important factor in reducing the severity of the course of CNS diseases, rehabilitation, and improving the quality of life and its expected duration. 2. Research methods The work uses such research methods as statistical collection of information, cluster analysis, and construction of membership functions of a fuzzy logical model. Obtained statistical information regarding the physical condition of children with absent central nervous system (CNS) diseases (hereinafter "Healthy", abbreviated H) (20 children), attention deficit hyperactivity disorder (ADHD) (13 children), mental retardation (MR) (30 children), autism spectrum disorder (ASD) (17 children), and minimal brain dysfunction (MBD) (10 children) were systematized by age (3 years, 4-5 years, 6-7 years) and gender. The study used indicators (Group 1) of physical condition (Weight (kg), Height (cm), Head Circumference (cm), Chest Circumference (cm), Shoulder Circumference (cm), Hip Circumference (cm), Leg Circumference (cm), BMI (body mass index)), as well as subjective indicators: Physical functioning, Social functioning, Emotional functioning, Functioning of schools/kindergartens; calculated indicators: Psychosocial Health Summary Score, Physical Health Summary Score, Total score (0-100%). The calculation was made according to the PedsQLTM 2.0 questionnaire. Also, in the clustering of the entire sample, the indicators were taken into account: Awareness, Family involvement, Communication Technical skills, Emotional needs, Overall satisfaction, Total score, Physical functioning, Emotional functioning, Social functioning, Mental functioning, communication, Concern, Everyday activity, Relationships in the family, The Parent HRQL Summary Score, The Family Functioning Summary. Average values (Group 2) of nutrients obtained from food (Water, Proteins, Fats, Carbohydrates, Energy, Total sugar, Saturated fat, Monounsaturated fat, Polyunsaturated fat, Dietary fiber, Na, K Ca Mg, Phosphorus (P), Fe, Cu, Zn, Cl, Mn, Se, I, Retinol, Vitamins D, E, B1, B2, PP, B6, B12, C, Folates, Pantothenic acid, Biotin was calculated using a special of the licensed computer program Dietplan 7, developed by the British company Forestfield Software Limited (Horsham, UK) (user agreement dated 30.07.19). Clustering (k-means) was carried out in order to identify groups of similarity based on the indicators of Group 1 and Group 2. The number of clusters [24-26] was determined according to the elbow and silhouette methods. Initial value clusters have a large impact on the clustering model, so algorithms are used for different given initial values. The location measure is used to find the closest points in the clusters to the cluster center, By definition different measures of distance can be created for different clusters. The number of clusters (k) is a defined parameter in K-Means clustering. To determine the optimal value of various methods, in this research we use the following methods: Elbow method; and Silhouette method. The Elbow method is an empirical method for determining the optimal number of clusters for a data set. In this method, we select a subset of these k values and then apply K-Means clustering using each k value. We find the average distance of each point in the cluster to its center. Then we calculate the sum of squares S of the distances between the centroids. Now to determine the best number of clusters (k), we plot k against their value S. We choose the value of k from which the plot looks like a straight line. The silhouette method is unique in the way it interprets and checks for consistency in data clusters. This method provides a concise graphical representation of how well each object is classified. The size of the silhouette is given by a strictly mathematical formula for each cluster of points using the minimum value of the average distance of a given point in relation to other points of the cluster. Silhouette magnitude is a measure of how similar an object is to its own cluster (cohesion) and different from other clusters (separation). The silhouette ranges from -1 to +1, where a high value indicates that the object matches well with its own cluster and poorly matches neighboring clusters. If one object has a high value, then the clustering configuration is appropriate. If many points have a low or negative value, then the clustering configuration may have too many or too few clusters. The silhouette can be calculated using any distance metric. In the presented research the Euclidian metric is used. 3. Results of data clustering of patients aged 3-7 years according to all indicators of Groups 1 and 2. The results of clustering the data of the entire sample into 3 clusters according to all indicators, in particular of the 1st and 2nd groups, made it possible to confirm the fact that the composition of nutrients is associated with diseases of the central nervous system. Thus, clusters 0 and 1 included both healthy and sick patients. At that time, only patients with ASD, MR and ADHD were in cluster 2. The value of the centroids is shown in Fig. 1. The significant difference between the content in the diet of such nutrients as Water, Carbohydrates, Energy, Na, K, Ca, P, Cl, Retinol, and Folates gave the impetus for more detailed studies. Since early diagnosis and rehabilitation is the most effective, it was decided to carry out more detailed research with data on children aged 3 years. It is obvious that age differences played a significant role in the clustering of the sample of the 3-7 age group. 0 Cluster 1 Cluster 2 Cluster 6000 5000 4000 3000 2000 1000 0 The Family… Emotional… Relationships in the … Emotional… Leg circumference… Physical Health… Head… Energy B2 B6 Awareness Vitamin С Cl Water Saturated fat Polyunsaturated fat Ca Cu Shoulder girth (cm) Se Е Emotional needs Total score Mass (kg) Mental functioning Phosphorus Retinol Fats Concern Folates Physical functioning Communication Na Figure 1: The value of the centroids of clusters 0-2 of all sample participants by all indicators 4. Results of clustering of patients in the age category of 3 years. The results of clustering Fig. 2,3 were obtained in relation to groups of similarity according to the indicators of Group 1 and centroid values (A group of 3-year-old patients according to the indicators of Group 1 was clustered into 5 clusters (0-cluster - 4-cluster). Value centroid indicators for each cluster are presented in Table 2. Both healthy and sick patients were included in clusters 0 and 3. Patients with MR, ASD, and ADHD were included in clusters 1,2,4. Figure 2: Results of clustering indicators of Group 1 (Emotional functioning – Social functioning) Figure 3: Results of clustering of indicators of Group 1 (Psychosocial Health Summary Score - Physical Health Summary Score) The obtained results drew attention to the non-synchronicity of the trends to increase or decrease in the values of individual indicators of Group 1 with the presence (absence) of CNS diseases. Moreover, when comparing the values of such an indicator as Weight in Ukrainian boys corresponds to the norms (Tabl. 1,2), while Height and Head Circumference are deviated in the direction of decrease from the norm. The percentage of head circumference to height corresponds to the norm. At the same time, the breast circumference, as well as the relative index to height, exceeds the normal values [27]. Studies have shown that in patients with MR and ASD (to a lesser extent, ADHD), deviations in many indicators of physical condition are extremely small and extremely large. Studies have shown that in patients with MR and ASD (to a lesser extent, ADHD), the deviations of many indicators of physical condition from the norm are significant, that is, their values are extremely small or extremely large. Thus, according to the Weight indicator m, healthy children have fixed values 13.5