=Paper=
{{Paper
|id=Vol-2753/paper47
|storemode=property
|title=Retrospective Analysis by Multifactor Regression in the Evaluation of the Results of Fine-needle Aspiration Biopsy of Thyroid Nodules
|pdfUrl=https://ceur-ws.org/Vol-2753/short18.pdf
|volume=Vol-2753
|authors=Askold Kucher,Oksana Boyko,Kateryna Ilkanych,Andriy Fechana,Natalya Shakhovska
|dblpUrl=https://dblp.org/rec/conf/iddm/KucherBIFS20
}}
==Retrospective Analysis by Multifactor Regression in the Evaluation of the Results of Fine-needle Aspiration Biopsy of Thyroid Nodules==
Retrospective Analysis by Multifactor Regression in the Evaluation of the Results of Fine-needle Aspiration Biopsy of Thyroid Nodules Askold Kuchera, Oksana Boykoa, Kateryna Ilkanycha, Andriy Fechana,b, Natalya Shakhovskab a Danylo Halytsky Lviv National Medical University, 69 Pekarska str., Lviv, 79010, Ukraine b Lviv Polytechnic National University, 12 Bandera str., Lviv, 79013, Abstract A retrospective study of the results of the examination of 160 patients who underwent fine- needle aspiration biopsy of thyroid nodules has been conducted. Possibilities of using the TI- RADS (Thyroid Imaging Reporting and Data System) scale as a standardized USG thyroid examination protocol have been considered. Combinations of indicators (sex, age, ultrasound characteristics) were analyzed with the help of multifactor regression in order to recommend the thyroid fine-needle aspiration biopsy. Measures to increase the ultrasound database, which will allow further use of neural networks for their analysis, have been developed. Keywords 1 thyroid nodules, TI-RADS, fine-needle aspiration biopsy, multifactor regression 1. Introduction Nodal thyroid formation occurs in 30-40% of the general population and is the most common pathology among endocrine diseases. Malignant thyroid tumors account for about 3-4% of the total number of human tumors [1]. Recently, however, due to the Chernobyl accident and radioactive contamination of the territories, the number of thyroid tumors has increased markedly and the tendency to increase their frequency remains [2.3]. The use of modern ultrasonic technologies allows to detect nodular formations with a size of 0.3 cm. Fine-needle aspiration biopsy (FNAB) under ultrasonographic (USG) navigation has become widespread in the diagnosis of thyroid diseases. Cytological examination is the fastest method of preoperative diagnosis [3]. The purpose of this study was to conduct a comparative analysis of ultrasonographic (USG) examination of the thyroid gland with the results of cytological examination of biopsies of thyroid nodules based on multifactorial linear regression analysis to determine indications for thyroid FNAB. 2. Materials and Methods We have conducted a retrospective study of the results of the examination of 160 patients who underwent FNAB of nodal thyroid formations. Data on age, sex of patients, thyroid ultrasound protocols and cytological examination of bioptats have been analyzed. FNAB procedures of thyroid nodules have been performed under ultrasound control (transduser10 MHz). The obtained samples IDDM’2020: 3rd International Conference on Informatics & Data-Driven Medicine, November 19–21, 2020, Växjö, Sweden EMAIL: kuchermd@gmail.com (A. 1); oxana_bojko@ukr.net (A. 2); ilkanych.katja@gmail.com (A. 3); andrii.v.fechan@lpnu.ua (A. 4); nataliya.b.shakhovska@lpnu.ua (A. 5) ORCID: 0000-0002-7511-7057 (A. 1); 0000-0002-8810-8969 (A. 2); 0000-0002-6536-7802 (A. 3); 0000-0001-9970-5497 (A. 4); 0000- 0002-6875-8534 (A. 5) ©️ 2020 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings (CEUR-WS.org) were fixed in 95% ethanol solution followed by Romanosky-Giemza staining with May-Grunwald fixation. Cytological examination of the material has been performed by a qualified cytologist. The ultrasonographic protocol was descriptive in accordance with the TI-RADS (Thyroid Imaging Reporting and Data System) scale: TI-RADS 1 –normal thyroid gland, absence of focal changes in the thyroid parenchyma; TI-RADS 2 –formations with a low level of malignancy (benign nodules); TI-RADS 3 – formations with an average level of malignancy (probably benign nodules); TI-RADS 4А – formations with a high level of malignancy without changes in regional lymph nodes; TI-RADS 4В – formations with a high level of malignancy with changes in regional lymph nodes; TI-RADS 5 – formations with a pre-established malignant process in it. The cytological conclusion was descriptive in nature, which included a category from I to VI according to the Bethesda system: category I – nondiagnostic/ unsatisfactory bioptat; category II – benign process; category III – atypia of undetermined significance or follicular lesion of undetermined significance; category IV – follicular neoplasm or suspicious for a follicular neoplasm; category V – suspicious for malignancy; category VI – malignant. Determining the relationship between ultrasonographic (USG) examination of the thyroid gland and the results of cytological examination of biopsies of thyroid nodules was carried out by multivariate regression analysis, which allows to make reasoned conclusions about the development of the studied processes based on and supported by specific mathematical calculations [4]. Linear regression involves the construction of such a straight line, in which the values of the indicators lying on it will be as close as possible to the actual, then extrapolating this line can predict the result. In the process of constructing multifactor regression models, the following stages can be distinguished: 1. Selection and analysis of all possible factors that affect the process (or indicator) being studied. 2. Mathematical and statistical analysis of factors - the main assumptions of classical regression analysis are checked. 3. Selection the type of regression multifactor model. 4. Verifying the significance of the found parameters of the model and assessing it for the adequacy of real reality. The following indications have been analyzed: age, sex, shape, margin, composition, echogenicity, conglomerate, vascularization and number of nodules. To conclude about the feasibility of using the found model, the analysis was conducted in the following areas: 1) The Fisher test was calculated and the found model was checked for adequacy with the source data; 2) Variance of indicators was calculated and analyzed; 3) The correlation coefficient was calculated and analyzed; 4) The coefficient of elasticity was calculated and analyzed; 5) The confidence interval for predicted measures was calculated. Fisher's F-statistic is calculated with m and (n-m-1) degrees of freedom: i 1 yip yc (1) n 2 F m i 1 yi yip n 2 n m 1 whеre m– the number of factors included into the model; n – total number of observations; уір – the calculated value of the dependent variable in the i-th observation; ус – the average value of the dependent variable; уі – the value of the dependent variable in the i-th observation. According to Fisher's F-tables, the critical value Fcr with m and (n-m-1) degrees of freedom and given confidence level is found. The procedure for finding the regression equation between different numerical sets, usually includes the following: 1. establishing the significance of the correlation between them; 2. the possibility of representing this dependence in the form of a mathematical expression (regression equation). The second stage at creation of regression model is the choose of a formula of the regression equation. Linear multifactor regression, which describes the linear relationship between the studied data, is written as: y a0 a1 x1 ... an xn (2) where у – dependent variable, function; а0, …, аn – regression coefficients; х1, …, хn – independent variables. Dependent variables in linear multifactorial regression were: sex, nodule size, presence of conglomerates, shape, contours, nodule structure, presence of calcifications, echogenicity, vascularization, elastography, presence of lymph nodes, number of nodules. To establish adequate coefficients, a logistic regression model with the same set of predictors was created, but with introducing L1-regularization (least absolute shrinkage and selection operator, LASSO) into the construction process. The essence of L1-regularization is in addition to the target regression function a fine for the complexity of the model, proportional to the norm of the coefficient vector. As a result, the coefficients were obtained: a0= 0,3699845; a1= 0,4097208; a2= -0,135977; a3= -0,5495012; a4= -0,0949706; a5= 0,154647; a6= 0,0502302; a7= 0,6084696; a8= -0,2844752; a9= 0,4817886; a10= 0,0574823; a11= 0,4017387; a12= 1,3212888. 3. Results The results of ultrasonography (USG). The results of USG in 160 patients have been analyzed. The material from 226 thyroid nodules was obtained by FNAB method. According to USG, 4 nodules are assigned to the TI-RADS 2 category; 145 nodules to the TI-RADS 3 category; 77 nodules to category TI-RADS 4. The results of FNAB. According to the results of cytological examination of 226 biopsies, 5 samples were assigned to category I, 119 to category II, 44 to category III; 24 to category IV, 15 to category V, 19 to category VI. Comparison of USG results and cytological examination of thyroid nodules is presented in Table.1. 4. Discussion Among 145 nodules that were assigned to TI-RADS 3 (formations with an average level of malignancy - probably benign nodules) in accordance with the TI-RADS scale, the benign process (Bethesda II) was confirmed in 88 (60.7%) cases. Given the large percentage (60.7%) of benign nodules that were suspected of being malignant on the TI-RADS scale, it can be assumed that there is a certain combination of indicators (sex, age, ultrasound characteristics) that will allow monitoring of the nodule and abandoning FNAB in first detected nodule. Table 1 Comparison of USG results and cytological examination of thyroid nodules Cytological TI-RADS 2 TI-RADS 3 TI-RADS 4 conclusion (summarily 4А and 4В) Bethesda I 0 4 (2,7%) 1(1,3%) (unsatisfactory bioptat) Bethesda II (benign process) 3(75,0%) 88 (60,7%) 28 (36,4%) Bethesda III (atypia of 1 (25,0%) 29 (20,0%) 14 (18,2%) undetermined significance or follicular lesion of undetermined significance) Bethesda IV (follicular 0 13 (8,9%) 11 (14,3%) neoplasm or suspicious for a follicular neoplasm) Bethesda V 0 9 (6,2%) 6 (7,8%) (suspicious for malignancy) Bethesda VI (malignant) 0 2 (1,4%) 17 (22,1%) Total 4 (100%) 145 (100%) 77 (100%) Unequivocally, it can be argued that it is very important to find certain combinations, which is typical for nodules that correspond to the TI-RADS 3, but were classified as Bethesda III, IV, V and VI. Among the 77 nodules that were assigned to TI-RADS 4 (formations with a high level of malignancy) on the TI-RADS scale, the benign process was confirmed in 28 (36.4%) cases. This once again confirms the importance of finding a certain combination of indicators in nodules corresponding to the cytological Bethesda II. In turn, if the nodule corresponds to the TI-RADS 4 in according with the USG, it is mandatory to perform FNAB, and not immediately perform surgery. 5. Neural Network Applications Perspectives Nowadays, neural networks are a promising area of research in medicine, especially in the field of oncology diagnostics [5-6]. Consider the prospects for the application of neural networks in the medical field of oncology diagnosis in a preclinical study. In carcinogenesis, artificial neural networks are successfully applied to the problems of both preclinical and post-clinical diagnoses. The primary goal of medical diagnostic research is to develop more cost-effective and easy-to-use systems, procedures, and methods to support clinicians. In our case, neural networks can be used to analyze ultrasound results in order to develop diagnostic algorithms that can improve data sorting practices. The neural network plays an important role in the decision support system. Preliminary data analysis suggests that the optimal system for our analysis would be a 5-layer model with 6 outputs and 37 inputs. However, nowadays, there is a critical lack of data for this analysis. To solve this problem, we propose to create a database of ultrasound results with remote access. The specifics of its work is that third-party users after a preliminary verification will be able to enter their own data into the general database. To facilitate our work, we use a browser application that easily adapts to work with stationary and mobile devices. Figer 1 shows the interface view of the first user login. The process of filling in the results of studies looks like a consistent selection of options from the drop-down windows. The first entry involves answering several narrowly professional questions to avoid clogging registration by unauthorized persons. In the future, the user is provided with a personal password and a code that will accompany his data in the database. Figure 1: The interface of the first input to fill the ultrasound database. 6. Conclusions • The use of the TI-RADS scale is recommended for mandatory use in order to standardize the USG thyroid examination protocol. • Cytological examination of biopsies of thyroid nodules with assessment by the Bethesda system is an informative method for diagnosing cancer. • In order to determine the indications for thyroid FNAB, it is recommended to continue searching for combinations of criteria that could clearly recommend this manipulation. • Measures to increase the ultrasound database, which will allow further use of neural networks for their analysis have been developed. [1] Cristina Romei, Raffaele Ciampi and Rossella Elisei “A comprehensive overview of the role of the RET proto-oncogene in thyroid carcinoma”. Nature Reviews Endocrinology, 12.4, (2016): 192-202. [2] M. Tronko, T. Bogdanova, L. Voskoboynyk, L. Zurnadzhy, V. Shpak and L. Gulak. “Radiation induced thyroid cancer: fundamental and applied aspects”. Experimental oncology. 32/3 (2010): 200-204. [3] Gabriella Pellegriti, Francesco Frasca, Concetto Regalbuto, Sebastiano Squatrito and Riccardo Vigneri. “Worldwide increasing incidence of thyroid cancer: update on epidemiology and risk factors”. Journal of cancer epidemiology, 2013 (2013): 965212 https://doi.org/10.1155/2013/965212. [4] Guiwen Rong, Xu Wang, Hui Xu and Baiqing Xiao. Multifactor Regression Analysis for Predicting Embankment Dam Breaching Parameters. Journal of Hydraulic Engineering, 146(2) (2020): 04019051 [5] Athanasios V. Vasilakos, Yu Tang and Yuanzhe Yao. “Neural networks for computer-aided diagnosis in medicine: A review”. Neurocomputing. 216. (2016): 700-708. https://doi.org/10.1016/j.neucom.2016.08.039 [6] Benjamin H. Kann, Reid Thompson, Charles R. Thomas, Jr, Adam Dicker and Sanjay Aneja. Artificial intelligence in oncology: current applications and future directions. Oncology, 33(2). (2019).