<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>R.O. Moiseienko, N.G. Gojda, O.O. Dudina, N.M. Bodnaruk, Development of perinatal
medicine in Ukraine in the context of international approaches, Wiadomosci Lekarskie</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.1016/j.socscimed.2014.05.034</article-id>
      <title-group>
        <article-title>of the Informative Features of Cardiac Studies Diagnostic Data using Shannon Method</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Kseniia Bazilevych</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Serhii Krivtsov</string-name>
          <email>krivtsovpro@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mykola Butkevych</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>National Aerospace University “Kharkiv Aviation Institute”</institution>
          ,
          <addr-line>Chkalow str., 17, Kharkiv</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <volume>74</volume>
      <issue>3</issue>
      <fpage>761</fpage>
      <lpage>766</lpage>
      <abstract>
        <p>The paper is devoted to the important issue of separating more informative data from less informative data for further analysis and use. This determines the relevance of the study. As a result of the study, the methods for assessing the informativeness of signs based on medical data were analyzed. On the basis of Shannon's method, a model for assessing information content has been built and a software package has been implemented. For the experimental study, data from 303 patients and 13 signs were used. The informative value was calculated for various groups of cardiac data. We found that the following signs are the informative: tala, type of chest pain, colored vessels, angina pectoris, age. The Shannon method is also compared with other methods for assessing the informativeness of features. Features informativeness, Shannon method, diagnostics, heart disease, cardiac studies.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>The COVID-19 pandemic caused by the SARS-CoV-2 coronavirus has become a real challenge
not only for health systems, but also for the economy around the world [1]. Announced March 11,
2020. It began with the discovery at the end of December 2019 in the city of Wuhan in the Hubei
province of central China. There are still no specific antiviral drugs for treatment or prevention
against the disease [2]. In severe cases, funds are used to maintain the functions of vital organs.
People of all ages are susceptible to infection. Severe forms of the disease are more likely to develop
in older people and in people with certain medical conditions, including asthma, diabetes, and heart
disease [3].</p>
      <p>The coronavirus pandemic has clearly demonstrated that we must act together and give our fight
against this crisis the necessary momentum to achieve the Sustainable Development Goals [4]. The
COVID-19 pandemic has accelerated the digitalization of all spheres of social activity [5]: education
[6], commerce [7], public administration [8], personnel management [9-10], logistics [11], etc.
Particular attention should be paid to the many approaches to digitalizing medicine [12]. In this area,
information technologies have been developed for insurance [13], decision-making [14], medical
diagnostics [15-16], epidemic control systems [17] and morbidity simulation [18]. In this article, we
will focus on the diagnostic problem that has arisen sharply in connection with the pandemic. There
are not enough people in hospitals [19], and COVID-19 is especially difficult with concomitant
diseases [20].</p>
      <p>Diseases of the cardiovascular system continue to be the leading cause of death in many countries
of the world. Every year 17 million people die from diseases of the cardiovascular system in the
world. According to the Centers for Disease Control and Prevention, life expectancy would be 10
years longer in the absence of such a high prevalence of cardiovascular diseases, covering all
countries and continents [21]. They lead to long-term disability of the adult population and require
colossal economic costs.
EMAIL:
(K.</p>
      <p>Bazilevych);
nikolai.butkevych@gmail.com
(M. Butkevych).</p>
      <p>2021 Copyright for this paper by its authors.</p>
      <p>High-risk groups include people who have had heart attacks and strokes. It is important for
patients with repeated heart attacks and high blood pressure to be under medical supervision. High
cholesterol in the patient's blood contributes to narrowing of the blood vessels and requires long-term
medication. Excess weight, high blood sugar and a sedentary lifestyle have an extremely negative
effect on the state of the cardiovascular system, and smoking is one of the most common risk factors.
In the development of atherothrombosis, heredity and age play a significant role, and it is noted that in
recent years cardiovascular diseases have become significantly “younger” [22]. The growth and
occurrence of cardiovascular diseases in young people is associated not only with an incorrect
lifestyle, but also with increased neuropsychological stress. The Internet, TV, phones, radio give us
such a stream of information that our ancestor cannot cope with in a week. Negative emotions and
stress cause an increased amount of adrenaline in the blood, hence fear, anxiety, anxiety, panic, and
increased heart rate [23]. The state of the cardiovascular system quickly reacts to changes in mood,
and the constant imbalance between physical and neuropsychological stress leads to pathological
changes and the development of cardiovascular diseases.</p>
      <p>In Ukraine, cardiovascular diseases are the main cause of death among the population [24].
According to this indicator, the country remains one of the world leaders.</p>
      <p>According to the ranking data, based on the number of deaths of the population in Ukraine [25],
common causes are:
1. Cardiovascular diseases (64.3%)
2. Neoplasm (14.1%)
3. Diseases of the digestive system (4.3%)
4. Neurological disorders (3.1%)
5. Self-harm and interpersonal violence (2.7%)</p>
      <p>Nationally, mortality from cardiovascular diseases over the past 29 years has increased by almost
8%: to 449,376 in 2019 and accounts for 64.3% of the total number of deaths, while in 1990 there
were 350,605 deaths from cardiovascular diseases, amounted to 56.5% respectively [26].</p>
      <p>Thus, aim of the paper is development of intelligent information system of heart diseases
diagnostics. To achieve the aim, we are going to develop model based on Shannon method to evaluate
the informative features of cardiac studies.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Informative features evaluation</title>
      <p>The informativeness of signs is a relative concept. One and the same system of signs can be
considered informative for solving some problems and uninformative for others. For example, in
medicine, some signs may be significant for the differential diagnosis of diabetes diseases [27], and
others for the diagnosis of heart diseases.</p>
      <p>In the tasks of medical diagnostics, patients act as objects. Signs characterize the results of
examinations, symptoms of diseases and the methods of treatment used. The specifics of modern
requirements for data processing in order to discover knowledge are as follows: data are large,
heterogeneous (binary, ordinal, quantitative), the results must be specific and understandable.
Examples of binary signs are gender, headache, weakness, nausea, etc. An ordinal sign is the severity
of the condition (mild, moderate, severe, life-threatening). Quantitative signs are age, pulse, blood
pressure, hemoglobin content in the blood, respiratory rate, drug dose, etc. The symptomatic
description of the patient is, in fact, a formalized medical history. Having accumulated a sufficient
number of precedents, it is possible to solve various problems: to classify the type of disease
(differential diagnosis), to determine the most appropriate method of treatment, to predict the duration
and outcome of the disease, to assess the risk of complications, and to find syndromes - the most
characteristic set of symptoms for a given disease. When studying objects characterized by a large
number of factors, it is often important to determine which of these factors most affect the properties
of objects of interest to us. In particular, the determination of the informativeness of factors is one of
the important stages in the analysis of the object under study.</p>
      <p>The block diagram of the method for informative features evaluation is shown in Figure 1.</p>
      <p>It is also possible to single out various methods for assessing the informativeness of signs: energy
and information.</p>
      <p>The energy approach is based on the fact that the information content is assessed by the value of
the attribute. The signs are sorted by values, and those whose values are greater are considered the
most informative. For example, according to the amplitude-time analyzes of the electrocardiogram,
the amplitude of the R waves is considered the most informative signs among the amplitudes. But,
such approaches to assessing the information content may turn out to be poorly suitable for object
recognition. If some features are large in absolute values, but are almost the same for objects of
different classes, then by the values of these features it is difficult to assign objects to some classes.
Conversely, if the features are relatively small in magnitude, but differ greatly for objects of different
classes, then objects can be easily classified by their values.</p>
      <p>The method for determining the informativeness is selected depending on the purpose of the study,
the number of studied classes and medical data (coding methods, the number of gradations, the
sample size, etc.)</p>
      <p>Therefore, information methods are more suitable for classification in medical diagnostics,
according to which information of signs is considered as reliable differences between classes of
images in spaces of signs. If, when classifying objects, they need to be attributed to one of two
classes, then the differences in the probability distributions of features constructed from samples of
two compared classes can act as such a reliable difference.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Shannon method application</title>
      <p>Shannon's method suggests evaluating information content as a weighted average amount of
information per different grades of a feature [29]. In information theory, information is understood as
the value of the eliminated entropy.
where G is the number of gradations of the feature;</p>
      <p>K is quantity of classes;</p>
      <p>Pi is the probability of the i-th gradation of the feature
where mi,k is the frequency of occurrence of the i-th grade in the K-th class,</p>
      <p>
        N is the total number of observations;
Pi,k is probability of occurrence of the i-th gradation of a feature in the K-th class.
. (
        <xref ref-type="bibr" rid="ref3">3</xref>
        )
      </p>
      <p>Shannon's method gives an estimate of the informativeness as a normalized value, which varies
from 0 to 1. Therefore, the informativeness of a feature determined by Shannon's method can be said
in absolute terms: closer to 1 for high; closer to 0 for low.</p>
      <p>
        The block diagram of the Shannon method for informative features evaluation is shown in Figure
,
,
(
        <xref ref-type="bibr" rid="ref1">1</xref>
        )
(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )
Figure 2: The block diagram of the Shannon method for informative features evaluation.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <p>The input data is a dataset of information on the diagnostic data of patients based on cardiac
studies, their age, gender, type of chest pain, cholesterol level, etc., a complete list of parameters in
Table 1.</p>
      <p>Before software implementation of an information system, it is necessary to design it. For this, the
IDEF0 and DFD methodologies were used.</p>
      <p>The model is based on the concepts of an external entity, process, data storage (storage) and data
flow.</p>
      <p>An external entity is a material object or individual acting as sources or receivers of information,
for example, customers, personnel, suppliers, bank customers, and the like.</p>
      <p>Process is converting input data streams to output in accordance with a certain algorithm. Each
process in the system has its own number and is associated with the executor who performs this
transformation. As in the case of functional diagrams, physical transformation can be carried out by
computers, manually or by special devices. At the upper levels of the hierarchy, when the processes
have not yet been defined, instead of the concept of “process”, the concepts of “system” and
“subsystem” are used, which respectively denote the system as a whole or its functionally complete
part.</p>
      <p>A data warehouse is an abstract device for storing information. The type of device and methods of
placement, removal and storage for such a device are not detailed. Physically, it can be a database, a
file, a table in RAM, a card file on paper, and the like.</p>
      <p>Data flow is the process of transferring some information from a source to a receiver. Physically,
the process of transferring information can occur through cables under the control of a program or
software system, or manually with the participation of devices or people outside the designed system.</p>
      <p>The functional model of the system is presented in Figure 3.</p>
      <p>Decomposition of the system is presented in Figure 4.</p>
      <p>In total, for example, data from 303 patients and 13 features was taken (their age, gender, type of
chest pain, cholesterol level, ECG, blood pressure, maximum pressure, blood sugar level, type and
presence of tonsillitis, colored vessels, etc.)</p>
      <p>For software implementation, the C# programming language was used in the Microsoft Visual
Studio environment. To start the software package, you need to upload the data presented in the *.csv
file (Figure 5).</p>
      <sec id="sec-4-1">
        <title>The data is divided into two classes A – “Healthy” and B – “Sick”. The results of the calculation by the Shannon method for assessing the informativeness of the attribute m = “Patient's age” is shown in Figure 6.</title>
      </sec>
      <sec id="sec-4-2">
        <title>The numerical results are shown in Table 2.</title>
        <p>Shannon method gives an estimate of the informativeness of the investigated feature in the form of
a value, takes values from 0 to 1. In this case, it is believed that the closer I (x) to 1, the higher the
informativeness of the feature, on the contrary, the closer I (x) to 0, the lower the informative value of
x.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions</title>
      <p>As a result of the study, methods for assessing the informativeness of signs for medical data were
analyzed. The Shannon method was chosen as the most appropriate method for medical data. On the
basis of the Shannon method, a model for assessing the information content was built and a software
package was implemented. For the experimental study, data from 303 patients and 13 features were
used. The information content was calculated for various groups of cardiac data. We got that the
following signs are the most informative: thal, chest pain type, colored vessels, angina, age. The
Shannon method is used to determine the informativeness of a feature that is involved in the
recognition of two classes of objects. Also, comparisons of the Shannon method with other methods
(Kullback and Сumulative frequency method) for assessing the informativeness of features are made.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Acknowledgements</title>
      <p>The study was funded by the National Research Foundation of Ukraine in the framework of the
research project 2020.02/0404 on the topic “Development of intelligent technologies for assessing the
epidemic situation to support decision-making within the population biosafety management” [30].
7. References
[5] A. Abd-Alrazaq, et. al., Artificial Intelligence in the Fight Against COVID-19: Scoping Review,</p>
      <p>Journal of Medical Internet Research 22 (12) (2020) e20756. doi: 10.2196/20756.
[6] D. Chumachenko, V. Balitskii, T. Chumachenko, V. Makarova, M. Railian, Intelligent expert
system of knowledge examination of medical staff regarding infections associated with the
provision of medical care, CEUR Workshop Proceedings 2386 (2019) 321-330.
[7] P. Piletskiy, et. al., Development and Analysis of Intelligent Recommendation System Using
Machine Learning Approach, Advances in Intelligent Systems and Computing 1113 (2020)
186197. doi: 10.1007/978-3-030-37618-5_17.
[8] N. Davidich, et. al., Monitoring of urban freight flows distribution considering the human factor,</p>
      <p>
        Sustainable Cities and Society 75 (2021) 103168. doi: 10.1016/j.scs.2021.103168.
[9] N. Dotsenko, et. al. Modeling of the processes of stakeholder involvement in command
management in a multi-project environment, Proceedings of 2018 IEEE 13th International
Scientific and Technical Conference on Computer Sciences and Information Technologies 1
(2018) 29-33. doi: 10.1109/STC-CSIT.2018.8526613
[10] N. Dotsenko, et. al. Project-oriented management of adaptive teams' formation resources in
multi-project environment, CEUR Workshop Proceedings 2353 (2019) 911-920.
[11] M. Bielecki, et. al., Air travel and COVID-19 prevention in the pandemic and peri-pandemic
period: A narrative review, Travel Medicine and Infectious Disease 39 (2021) 101915. doi:
10.1016/j.tmaid.2020.101915.
[12] S.C. Mathews, et. al., Digital health: a path to validation, NPJ Digital Medicine 2 (2019) 38. doi:
10.1038/s41746-019-0111-3.
[13] K. Bazilevych, et al. Stochastic modelling of cash flow for personal insurance fund using the
cloud data storage, International Journal of Computing 17 (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) (2018) 153-162.
doi: 10.47839/ijc.17.3.1035
[14] D. Chumachenko, et. al. On Intelligent Decision Making in Multiagent Systems in Conditions of
Uncertainty, Proceedings of 2019 11th International Scientific and Practical Conference on
Electronics and Information Technologies (2019) 150-154. doi: 10.1109/ELIT.2019.8892307
[15] M. Mazorchuck, et. al. Web-Application Development for Tasks of Prediction in Medical
Domain, 2018 IEEE 13th International Scientific and Technical Conference on Computer
Sciences and Information Technologies (CSIT) (2018) 5-8. doi:
10.1109/STCCSIT.2018.8526684
[16] O. Skitsan, et. al., Evaluation of the informative features of cardiac studies diagnostic data using
the Kullback method, CEUR Workshop Proceedings 2917 (2021) 186-195.
[17] D. Chumachenko, et. al. On-Line Data Processing, Simulation and Forecasting of the
Coronavirus Disease (COVID-19) Propagation in Ukraine Based on Machine Learning
Approach, Communications in Computer and Information Science 1158 (2020) 372-382. doi:
10.1007/978-3-030-61656-4_25
[18] Yu. Polyvianna, et. al. Computer Aided System of Time Series Analysis Methods for Forecasting
the Epidemics Outbreaks, 2019 15th International Conference on the Experience of Designing
and Application of CAD Systems (2019) pp. 7.1-7.4. doi: 10.1109/CADSM.2019.8779344
[19] J. Wosik, et. al., Telehealth transformation: COVID-19 and the rise of virtual care, Journal of
      </p>
      <p>American Medical Informatics Association 27 (6) (2020) 957-962. doi: 10.1093/jamia/ocaa067
[20] M.S. Gold, et. al., COVID-19 and comorbidities: a systematic review and meta-analysis,</p>
      <p>Postgraduate Medicine 132 (8) (2020) 749-755. doi: 10.1080/00325481.2020.1786964.
[21] C.Y. Cheng, C.Y. Hsu, T.C. Wang, Y.C. Jeng, W.H. Yang, The risk of cardiac mortality in
patients with status epilepticus: A 10-year study using data from the Centers for Disease Control
and Prevention (CDC), Epilepsy and Behaviour 117 (2021) 107901. doi:
10.1016/j.yebeh.2021.107901
[22] R.D. Bagnall, E.S. Singer, J. Tfelt-Hansen, Sudden Cardiac Death in the Young, Heart, Lung and</p>
      <p>
        Circulation 29 (
        <xref ref-type="bibr" rid="ref4">4</xref>
        ) (2020) 498-504. doi: 10.1016/j.hlc.2019.11.007.
[23] A. Tajbakhsh, et. al., COVID-19 and cardiac injury: clinical manifestations, biomarkers,
mechanisms, diagnosis, treatment, and follow up, Expert Review of Anti-Infective Therapy 19
(
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) (2021) 345-357. doi: 10.1080/14787210.2020.1822737
[24] O. Makar, G. Siabrenko, Influence of physical activity on cardiovascular system and prevention
of cardiovascular diseases (review), Georgian Medical News 285 (2018) 69-74.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>E.R.</given-names>
            <surname>Fox</surname>
          </string-name>
          ,
          <article-title>Budgeting in the time of COVID-19, American Journal of Health-System Pharmacy: official journal of the American Society of Health-System Pharmacists</article-title>
          <volume>77</volume>
          (
          <issue>15</issue>
          ) (
          <year>2020</year>
          )
          <fpage>1174</fpage>
          -
          <lpage>1175</lpage>
          . doi:
          <volume>10</volume>
          .1093/ajhp/zxaa185.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>M. Gavriatopoulou</surname>
          </string-name>
          <article-title>M, et</article-title>
          . al.,
          <source>Emerging treatment strategies for COVID-19 infection, Clinical and Experimental Medicine</source>
          <volume>21</volume>
          (
          <issue>2</issue>
          ) (
          <year>2021</year>
          )
          <fpage>167</fpage>
          -
          <lpage>179</lpage>
          . doi:
          <volume>10</volume>
          .1007/s10238-020-00671-y.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>H.</given-names>
            <surname>Ejaz</surname>
          </string-name>
          , et. al.,
          <article-title>COVID-19 and comorbidities: Deleterious impact on infected patients</article-title>
          ,
          <source>Journal of Infection and Public Health</source>
          <volume>13</volume>
          (
          <issue>12</issue>
          ) (
          <year>2020</year>
          )
          <fpage>1833</fpage>
          -
          <lpage>1839</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.jiph.
          <year>2020</year>
          .
          <volume>07</volume>
          .014.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>K.</given-names>
            <surname>Heggen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.J.</given-names>
            <surname>Sandset</surname>
          </string-name>
          , E. Engebretsen, COVID-19 and sustainable development goals,
          <source>Bulletin of World Health Organization</source>
          <volume>98</volume>
          (
          <issue>10</issue>
          ) (
          <year>2020</year>
          )
          <article-title>646</article-title>
          . doi:
          <volume>10</volume>
          .2471/BLT.20.263533.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>