<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Features Contributing Towards Heart Disease Prediction Using Machine Learning</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Chetan Sharma</string-name>
          <email>chetan.sharma@chitkarauniversity.edu.in</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Shankar Shambhu</string-name>
          <email>shankar.shambhu@chitkarauniversity.edu.in</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Prasenjit Das</string-name>
          <email>prasenjit.das@chitkarauniversity.edu.in</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Shaily Jain</string-name>
          <email>shaily.jain@chitkarauniversity.edu.in</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sakshi</string-name>
          <email>sakshi@chitkara.edu.in</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Chitkara University Himachal Pradesh</institution>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Chitkara University Institute of Engineering and Technology, Chitkara University</institution>
          ,
          <addr-line>Himachal Pradesh</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Chitkara University Institute of Engineering and Technology, Chitkara University</institution>
          ,
          <addr-line>Punjab</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Chitkara University School of Computer Applications, Chitkara University</institution>
          ,
          <addr-line>Himachal Pradesh</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
      </contrib-group>
      <fpage>84</fpage>
      <lpage>92</lpage>
      <abstract>
        <p>WHO and other health organizations claimed that the death rate due to cardiovascular disease is one-third of worldwide. Although, many researchers have worked in this direction to help our medical professionals diagnose this disease at an early stage. This paper aims to apply data mining algorithms to predict heart disease occurrence in patients based on some features like diabetes, blood pressure, etc. We have implemented two data mining algorithms, Naive Bayes and NB tree, on two data different datasets of the UCI repository to evaluate the accuracy, f-measure, precision, and recall. Our results show NB tree outperforms with 84.6% accuracy compared to Naive Bayes with only 80.58 % accuracy.</p>
      </abstract>
      <kwd-group>
        <kwd>Machine Learning</kwd>
        <kwd>Classification</kwd>
        <kwd>Heart</kwd>
        <kwd>Disease</kwd>
        <kwd>WEKA</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        The heart is the essential central part of the human
body, which provides the purified blood to each
part of the body. Without a healthy working heart,
a person cannot live a single second. But,
nowadays, heart diseases are increasing at a rapid
speed. As per the WHO, over 17.9 million people
died every year because of heart disease, and 80%
of people died because of a heart attack [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Heart
disease has been recognized as one of the world's
most complex and life-threatening human
____________________________________
diseases. Typically, the heart is unable to push the
necessary amount of blood to other areas of the
body to satisfy the body's normal functioning.
Because of this, heart failure eventually occurs
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In the United States, the incidence of heart
illness is very high [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Swelling in the feet, Chest
pain, breathe shortness, body tiredness, Pain in
the neck and shoulders, etc., are some significant
symptoms of heart disease [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Techniques used
to diagnose heart diseases at an early stage have
been complicated, and the resulting difficulty is
one of the critical factors affecting the standard of
living [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Because of the low availability of
instruments and lack of physician, diagnosis of
heart diseases and their treatment is very involved
in developing countries [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. It affects the
prediction results and treatment of heart patients,
which is the main reason for the high mortality
rate of heart patients. Hence, to reduce the
mortality rate of heart patients and provide the
best treatment of heart diseases, appropriate and
accurate heart disease diagnosis techniques are
required [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. These techniques should be capable
of detecting heart disease at an early stage
[1920]. The rest of the paper is organized as
follows: Section 2 discusses the background and
history of this work, Methodology is explained in
section 3 along with the description of tools,
datasets, and algorithms used in evaluation,
evaluation matrices etc, Results are discussed in
section 4 and finally section 5 gives us conclusion
of the research done in this paper.
2.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Work</title>
    </sec>
    <sec id="sec-3">
      <title>Literature Review and Related</title>
      <p>In the last decade, many researchers worked on
heart disease datasets to predict heart diseases.
They used multiple machine learning and data
mining algorithms for the implementation and
achieved different results. Yet today, we also face
a lot of issues with heart disease. Following are
the literate review of recent research:
The authors implemented three different
algorithms Naive Bayes(NB), Artificial Neural
Network, and J48 to find the best heart disease
prediction results. Researchers used a dataset of 8
additional attributes and 210 instances of male
persons. WEKA tool was used for the
3.
3.1</p>
    </sec>
    <sec id="sec-4">
      <title>Methodology</title>
    </sec>
    <sec id="sec-5">
      <title>Proposed Work</title>
      <p>
        implementations of the algorithms. Archived
results have shown that the Naive Bayes
algorithm provided the best results compared to
Artificial Neural Network and J48. Naive Bayes
achieved an accuracy of 79.90% and took 0.01
second to build the model, where J48 attained the
accuracy of 77.03% and took 0.01 second to build
the model. Artificial Neural Network achieved an
Singh et al. developed a new hybrid model named
"Hybrid Genetic Naive Bayes Model". This
model was developed with two different
supervised techniques (Naive Bayes, Genetic
Algorithm) for the correct prediction of heart
diseases. To develop this model, the researcher
used a dataset taken from the UCI repository with
303 instances and 14 important attributes.
Implementation results gave the accuracy of
97.14% with 98% precision value and 97.14%
recall value [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>
        Krishnan et al. used two machine learning
algorithms, Decision Tree and Naive Bayes
algorithms, to predict Heart Diseases. They used
a dataset of 300 instances and 14 attributes taken
from the UCI repository. Researchers
implemented the python programming language
model and achieved the highest accuracy of 91%
with a Decision tree and 87% with Naive Bayes
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
WEKA 3.8.4 machine learning tool is used to
conduct this study, written in Java and developed
at the University of Waikato. WEKA tool
provides us with different classifiers to examine
the performance. WEKA is used to evaluate other
data mining tasks like preprocessing,
classification, regression, and many more.
WEKA accepts .csv and .arff file format and the
chosen dataset has already created the required
data in the mentioned format.
3.3
      </p>
    </sec>
    <sec id="sec-6">
      <title>Data Preprocessing</title>
      <p>
        The real-life data consists of redundant values
and lots of noise. The data needs to be cleaned,
and the missing values need to be filled before the
data is fed to generate a model [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. In the
preprocessing process, these issues are taken care
of so that the prediction can be made accurately.
Once the cleaning of data is done, i.e., the noise
is removed, and the missing values are filled, we
need to transform it. Many supervised learning
algorithms work on nominal or cardinal data. So
data transformation is applied to the dataset
obtained from UCI in the present work.
Reduction of the dataset is applied to convert the
complex dataset into a more straightforward
form, which improves the accuracy of the model.
3.4
      </p>
    </sec>
    <sec id="sec-7">
      <title>Classification Algorithms</title>
      <p>After going through an intensive literature
review, we have selected two classification
algorithms: naive Bayes tree, naive Bayes
classification based on their dependency on
attributes.</p>
      <p>
        Naive Bayes Tree [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]: It is a hybrid approach
in which the model is generated using the Naive
Bayes and Decision tree Approach. The naive
Bayes classification assumes that the features are
independent of each other, and the decision tree
assumes that the components are dependent on
each other. So the hybrid approach takes
advantage of both approaches. The decision tree
is built by considering only one feature, and
output is fed to the node. Based on the outcome
of each node, other features are selected. In this
hybrid approach, the split is done in the same
manner by considering only one feature at every
node but with Naive-Bayes classifiers at the
leaves. In large datasets, data splitting is regarded
as a vital and essential task for classification
using the features we have implemented the naive
Bayes tree classification.
      </p>
      <p>
        Naive Bayes Classification [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]–[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] : This
classification technique is based on Baye's
theorem, which works on the assumption that the
existence of one feature is independent of the
other feature. The advantage of the Naive Bayes
classification is that it requires a small amount of
data to create/train the model.
      </p>
      <p>Bayes theorem provides a way of calculating
posterior probability (conditional probability
where we are finding probability under a given
condition assumed to be confirmed) P(c|x) from
P(c), P(x), and P(x|c). The following is the
formula to calculate posterior probability:</p>
      <p>P(c|x)=P(x|c)*P(c)/P(x|c)
Where:
P(c|x) is the conditional probability that occurs
when x has already occurred
P(c) is the known probability of the class.
P(x|c) is the conditional probability of x condition
that c has occurred.</p>
      <p>P(x) the known probability of the class.</p>
      <sec id="sec-7-1">
        <title>Dataset Description</title>
        <p>
          Two datasets were used in this study. The
first one was obtained from the "Cleveland
Clinic Foundation", the First dataset
comprises 303 instances. The second dataset
is taken from the public available platform, a
combination of five other datasets named
Heart Disease Dataset (Comprehensive). All
the dataset are available for heart disease
having a total of 76 attributes and each
dataset choose their dataset features
accordingly. Initially, both the dataset was
selected for the study with 76 attributes, but
they were preprocessed to produce 14 and 11
characteristics to reduce redundant variables.
Consequently, we used these specific
attributes (listed in Table 1 and Table 3) to
compare.
The first dataset is taken as the Cleveland
database, which is publically available at
[
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. There are 303 instances in the dataset,
and their description is given in Table 1, and
the results using the WEKA tool are given in
Table 2.
        </p>
        <sec id="sec-7-1-1">
          <title>Attribute Used Age Sex</title>
        </sec>
        <sec id="sec-7-1-2">
          <title>Chest Pain</title>
        </sec>
        <sec id="sec-7-1-3">
          <title>Resting Blood</title>
        </sec>
        <sec id="sec-7-1-4">
          <title>Pressure</title>
        </sec>
        <sec id="sec-7-1-5">
          <title>Cholesterol</title>
        </sec>
        <sec id="sec-7-1-6">
          <title>Fasting Blood Sugar</title>
        </sec>
        <sec id="sec-7-1-7">
          <title>Resting ECG</title>
        </sec>
        <sec id="sec-7-1-8">
          <title>Heart Rate</title>
        </sec>
        <sec id="sec-7-1-9">
          <title>Exercise Included</title>
        </sec>
        <sec id="sec-7-1-10">
          <title>Angina</title>
        </sec>
        <sec id="sec-7-1-11">
          <title>Old Peak</title>
        </sec>
        <sec id="sec-7-1-12">
          <title>Slope</title>
          <p>Age of Patient. The value ranges from 29 years to 77 years
Gender of the patient represented in binary form
1 = male.
0 = female
Chest pain. Its value range from 1 to 4.
1 used to represent typical angina, 2 used to describe atypical angina, 3 used to
represent non-anginal Pain, and 4 is used to represent asymptomatic.
The attribute is used to represent the patient's resting BP, and the unit to
measure it is mm Hg.</p>
          <p>The attribute is used to represent the patient's serum cholesterol, and its unit
of measurement is mg/dl.</p>
          <p>An attribute represents the Fasting blood sugar of the patient. There are two
values used in the dataset if the recorded value is &gt; 120 mg/dl, then it is shown
by 1 (true), else it is shown by 0 (false).
1 = True.
0 = False.</p>
          <p>The attribute is used to represent the resting electro-cardiographic records of
the patient. The value ranges from 0 to 2
0 is representing the Normal range.
1 is representing the ST-T wave abnormality of the patient.
2 is used to show probable or definite left ventricular hypertrophy by Estes'
criteria.</p>
          <p>The attribute is used to represent the maximum heart rate of the patient
achieved.</p>
          <p>Exercise-induced angina and represented in binary
1 is used to represent yes.
0 is used to represent no.</p>
          <p>The attribute is used to represent ST depression induced by exercise, which is
relative to rest.</p>
          <p>The attribute is used to measure the slope for peak exercise. The range of the
recorded values is from 1 to 3.</p>
          <p>Up sloping is represented by 1, flat is shown through value 2, and 3 is used to
represent downsloping.</p>
        </sec>
        <sec id="sec-7-1-13">
          <title>Major Vessels</title>
        </sec>
        <sec id="sec-7-1-14">
          <title>Thallium Scan</title>
          <p>The attribute is used to represent the no. of significant vessels colored by
fluoroscopy. Recorded values are range from 0 to 3, and the value is related to
the darkness of the color.</p>
          <p>
            The attribute is used to record the Thallium Scan of the patient. It represents
the values 3, 6, or 7. 3 represents a normal range, 6 is used to represent fixed
defect, and 7 represents reversible defect.
The second dataset is taken from [
            <xref ref-type="bibr" rid="ref17">17</xref>
            ],
collected from five other heart disease
databases. There is a total of 1190 instances
in the dataset, and these instances are
collected from the dataset Cleveland heart
disease dataset instances taken 303,
Hungarian heart disease dataset instances
have taken 294, Switzerland heart disease
dataset instances have taken 123, Long Beach
VA heart disease dataset instances have taken
200 and Stalog heart disease dataset instances
taken 270. Dataset is a combination of 11
common features between all the datasets.
          </p>
          <p>Description of all feature used in the dataset
is given in Table 3, and their results using the</p>
          <p>WEKA tool is given in Table 4.
of measurement is mg/dl.
1 used to represent typical angina, 2 used to represent atypical angina, 3 used
to represent non-anginal Pain, and 4 is used to represent asymptomatic.</p>
          <p>The attribute is used to represent the patient's resting BP, and the unit to
The attribute is used to represent the patient's serum cholesterol, and its unit
Fasting Blood Sugar An attribute represents the Fasting blood sugar of the patient. There are two
values used in the dataset if the recorded value is &gt; 120 mg/dl then it is shown
by 1 (true), else it is shown by 0 (false).
1 = True.</p>
          <p>0 = False.</p>
          <p>Resting ECG The attribute is used to represent the resting electro-cardiographic records of
the patient. The value ranges from 0 to 2
0 is representing the Normal range.
1 is representing the ST-T wave abnormality of the patient.
2 is used to show probable or definite left ventricular hypertrophy by Estes'
criteria.</p>
          <p>Maximum Heart Rate The attribute is used to represent the maximum heart rate of the patient
achieved.</p>
          <p>Exercise Angina Exercise-induced angina and represented in binary
1 is used to represent yes.</p>
          <p>0 is used to represent no.</p>
          <p>Old Peak The attribute is used to represent ST depression induced by exercise, which is
relative to rest.</p>
          <p>ST Slope The attribute is used to measure the slope for peak exercise. The range of the
recorded values is from 1 to 3.</p>
          <p>Up sloping is represented by 1, flat is shown through value 2, and 3 is used to
represent downsloping.</p>
          <p>Target Used for the prediction</p>
        </sec>
        <sec id="sec-7-1-15">
          <title>Algorithms</title>
        </sec>
        <sec id="sec-7-1-16">
          <title>NB Tree</title>
        </sec>
        <sec id="sec-7-1-17">
          <title>Naive Bayes</title>
        </sec>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Evaluation Matrices</title>
      <p>We have considered four parameters for our
paper. In the present work, the prediction class is
if the person having specific attributes has died
because of heart disease or not, so the class C in
the above table is no. of instances belonging to
the class. Figure 2 is the confusion matrix.
TP is the actual no of people who died because of
heart disease, and the model also predicted the
same. Similarly, TN is the person who didn't die
of a heart ailment, and our model also predicted
the same. False Positive (FP) is a Type I error
because the model predicted that the person died
of an ailment, but actually, the patient didn't.
False-negative is a type II error. The model
predicted that the person didn't die of the
alignment, but actually, he/she did.</p>
      <p>The accuracy of the model is calculated through
the formula given below:
Accuracy = (TP+TN)/Total no. of instance (1)
Recall is the measure of correctly predicted
classes out of the total positive classes. The
formula is as follows:
Recall= (TP)/(TP+FN) (2)
Precision is the measure of actual positive classes
out of all the correctly predicted positive classes.
The formula for the recall is as follows:
Precision = TP/(TP+FP) (3)
Comparing the two models becomes problematic
when the precision is low, and the recall value is
high. In the case of vice versa is true. The two
parameters are not of much use for comparison of
the models. F-score is used to compare the
models in such cases. F-score uses the harmonic
mean of the two values. This helps to measure the
recall and precision at the same time. Instead of
the Arithmetic mean, harmonic mean is used
because Arithmetic mean is sensitive to extreme
values.</p>
      <p>F-score= (2*Recall*Precision) / (Recall +</p>
      <sec id="sec-8-1">
        <title>Precision) (4)</title>
      </sec>
      <sec id="sec-8-2">
        <title>Actual class\Predicted class C Not in C</title>
        <p>C
Not in C</p>
      </sec>
      <sec id="sec-8-3">
        <title>True Positives (TP)</title>
      </sec>
      <sec id="sec-8-4">
        <title>False Negatives (FN)</title>
      </sec>
      <sec id="sec-8-5">
        <title>False Positives (FP)</title>
      </sec>
      <sec id="sec-8-6">
        <title>True Negatives (TN)</title>
      </sec>
    </sec>
    <sec id="sec-9">
      <title>Results and Discussion</title>
      <p>We have used two datasets with 303
instances in the present work in the first and
1190 in the second set. Naive Bayes and
Naive Bayes tree Algorithm has been applied
on the two datasets. We find that the NB tree
performs better in the two datasets, which are
of different sizes and attributes. The accuracy
and other measures are better in the NB tree
case, which is a hybrid of Naive Bayes and
Decision tree. We have applied these two
algorithms because the Naive Bayes
Algorithm works on the hypothesis that the
features are independent of each other. At the
same time, the decision tree assumes that the
features are dependent on each other. The
present work tries to determine if the
parameters age, gender, cholesterol, etc., do
contribute towards heart disease, and a
machine learning algorithm can be used to
predict the alignment based on these
parameters with an accuracy of 88%.
5.</p>
    </sec>
    <sec id="sec-10">
      <title>Conclusion</title>
      <p>The two datasets used in the present work
show a similar accuracy, which leads us to
conclude that the machine learning
algorithms can predict heart diseases in
patients with specific existing alignments like
High BP, cholesterol, etc. We find a
difference in the accuracy of the two methods
applied on the two datasets, namely Naive
Bayes and NB tree. The difference in
accuracy is that Naive Bayes assumes the
independence of features. NB Tree (a hybrid
of the Decision tree) assumes that the features
are dependent on each other. Higher accuracy
in the NB tree makes us conclude that
parameters like age, gender, cholesterol, and
high Bp are dependent on each other, leading
to a heart ailment in patients.</p>
      <sec id="sec-10-1">
        <title>References:</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>W. H. O.</surname>
          </string-name>
          (WHO),
          <article-title>"Cardiovascular Diseases." https://www</article-title>
          .who.int/healthtopics/cardiovascular-diseases#
          <article-title>tab=tab_1 (accessed Nov</article-title>
          .
          <volume>15</volume>
          ,
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A. L.</given-names>
            <surname>Bui</surname>
          </string-name>
          , T. B.
          <string-name>
            <surname>Horwich</surname>
            , and
            <given-names>G. C.</given-names>
          </string-name>
          <string-name>
            <surname>Fonarow</surname>
          </string-name>
          ,
          <article-title>"Epidemiology and risk profile of heart failure,"</article-title>
          <source>Nat. Rev. Cardiol.</source>
          , vol.
          <volume>8</volume>
          , no.
          <issue>1</issue>
          , p.
          <fpage>30</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>P. A.</given-names>
            <surname>Heidenreich</surname>
          </string-name>
          et al.,
          <article-title>"Forecasting the future of cardiovascular disease in the United States: a policy statement from the American Heart Association,"</article-title>
          <source>Circulation</source>
          , vol.
          <volume>123</volume>
          , no.
          <issue>8</issue>
          , pp.
          <fpage>933</fpage>
          -
          <lpage>944</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Durairaj</surname>
          </string-name>
          and
          <string-name>
            <given-names>N.</given-names>
            <surname>Ramasamy</surname>
          </string-name>
          ,
          <article-title>"A comparison of the perceptive approaches for preprocessing the data set for predicting fertility success rate,"</article-title>
          <source>Int. J. Control theory Appl.</source>
          , vol.
          <volume>9</volume>
          , no.
          <issue>27</issue>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Mourao-Miranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. L. W.</given-names>
            <surname>Bokde</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Born</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Hampel</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Stetter</surname>
          </string-name>
          ,
          <article-title>"Classifying brain states and determining the discriminating activation patterns: support vector machine on functional MRI data,"</article-title>
          <source>Neuroimage</source>
          , vol.
          <volume>28</volume>
          , no.
          <issue>4</issue>
          , pp.
          <fpage>980</fpage>
          -
          <lpage>995</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Ghwanmeh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mohammad</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A</given-names>
            .
            <surname>Al-Ibrahim</surname>
          </string-name>
          ,
          <article-title>"Innovative artificial neural networks-based decision support system for heart diseases diagnosis,"</article-title>
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>F.</given-names>
            <surname>Amato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>López</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. M.</given-names>
            <surname>PeñaMéndez</surname>
          </string-name>
          , P. Va\vnhara,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hampl</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Havel</surname>
          </string-name>
          , “
          <article-title>Artificial neural networks in medical diagnosis</article-title>
          .
          <source>” Elsevier</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S. K.</given-names>
            <surname>Gomath</surname>
          </string-name>
          ,
          <article-title>"Heart Disease Prediction Using Data Mining Classification,"</article-title>
          <source>Int. J. Res. Appl. Sci. Eng</source>
          . Technol., vol.
          <volume>4</volume>
          , no.
          <issue>2</issue>
          ,
          <year>2016</year>
          , doi: 10.18775/ijmsba.1849-
          <volume>5664</volume>
          -
          <fpage>5419</fpage>
          .
          <year>2014</year>
          .
          <volume>43</volume>
          .1004.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>R. V.</given-names>
            <surname>Sarangam Kodati</surname>
          </string-name>
          ,
          <article-title>"A Comparative Study on Open Source Data Mining Tool for Heart Disease,"</article-title>
          <source>Int. J. Innov. Adv. Comput. Sci.</source>
          , vol.
          <volume>7</volume>
          , no.
          <issue>3</issue>
          ,
          <year>2018</year>
          , [Online]. Available: http://www.divaportal.org/smash/get/diva2:1080911/FULLTEX T01.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>N.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Firozpur</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Jindal</surname>
          </string-name>
          ,
          <article-title>"Heart disease prediction system using hybrid technique of data mining algorithms,"</article-title>
          <source>Int. J. Adv. Res. Ideas Innov. Technol.</source>
          , vol.
          <volume>4</volume>
          , no.
          <issue>2</issue>
          , pp.
          <fpage>982</fpage>
          -
          <lpage>987</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S.</given-names>
            <surname>Krishnan</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Geetha</surname>
          </string-name>
          ,
          <article-title>"</article-title>
          <source>Prediction of Heart Disease Using Machine Learning Algorithms.," in 2019 1st International Conference on Innovations in Information and Communication Technology (ICIICT)</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>5</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>S.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Jiang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <article-title>"Adapting naive Bayes tree for text classification,"</article-title>
          <source>Knowl. Inf. Syst.</source>
          , vol.
          <volume>44</volume>
          , no.
          <issue>1</issue>
          , pp.
          <fpage>77</fpage>
          -
          <lpage>89</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>L.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Ye</surname>
          </string-name>
          ,
          <article-title>"Experimental comparisons of multi-class classifiers,"</article-title>
          <source>Informatica</source>
          , vol.
          <volume>39</volume>
          , no.
          <issue>1</issue>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>P.</given-names>
            <surname>Ahmad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Qamar</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S. Q. A.</given-names>
            <surname>Rizvi</surname>
          </string-name>
          ,
          <article-title>"Techniques of data mining in healthcare: a review,"</article-title>
          <source>Int. J. Comput. Appl.</source>
          , vol.
          <volume>120</volume>
          , no.
          <issue>15</issue>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>S. S.</given-names>
            <surname>Nikam</surname>
          </string-name>
          ,
          <article-title>"A comparative study of classification techniques in data mining algorithms,"</article-title>
          <source>Orient. J. Comput. Sci. Technol</source>
          ., vol.
          <volume>8</volume>
          , no.
          <issue>1</issue>
          , pp.
          <fpage>13</fpage>
          -
          <lpage>19</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Ronit</surname>
            ,
            <given-names>"Heart</given-names>
          </string-name>
          <string-name>
            <surname>Disease</surname>
            <given-names>UCI</given-names>
          </string-name>
          ,"
          <year>2018</year>
          . https://www.kaggle.com/ronitf/heart
          <article-title>-disease-uci (accessed Nov</article-title>
          .
          <volume>12</volume>
          ,
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>M.</given-names>
            <surname>Siddhartha</surname>
          </string-name>
          ,
          <article-title>"Heart Disease Dataset (Comprehensive)," 2019</article-title>
          . https://www.kaggle.com/sid321axn/heartstatlog
          <article-title>-cleveland-hungary-final (accessed Nov</article-title>
          .
          <volume>12</volume>
          ,
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>V.</given-names>
            <surname>Madaan</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Goyal</surname>
          </string-name>
          ,
          <article-title>"Predicting Ayurveda-Based Constituent Balancing in Human Body Using Machine Learning Methods,"</article-title>
          <source>in IEEE Access</source>
          , vol.
          <volume>8</volume>
          , pp.
          <fpage>65060</fpage>
          -
          <lpage>65070</lpage>
          ,
          <year>2020</year>
          , doi: 10.1109/ACCESS.
          <year>2020</year>
          .
          <volume>2985717</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>Vishu</given-names>
            <surname>Madaan</surname>
          </string-name>
          and Anjali Goyal, “
          <article-title>Analysis and Synthesis of a Human Prakriti Identification System Based on Soft Computing Techniques”</article-title>
          ,
          <source>Recent Patents on Computer Science</source>
          ,
          <volume>12</volume>
          (
          <issue>1</issue>
          ), pp
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          ,
          <year>2019</year>
          . DOI:
          <volume>10</volume>
          .2174/2213275912666190207144831 [20]
          <string-name>
            <surname>Prateek</surname>
            <given-names>Agrawal</given-names>
          </string-name>
          , Vishu Madaan, Vikas Kumar, “
          <article-title>Fuzzy Rule Based Medical Expert System to Identify the Disorders of Eyes, ENT</article-title>
          and Liver”,
          <source>International Journal of Advanced Intelligence Paradigm (IJAIP)</source>
          , vol
          <volume>7</volume>
          ,
          <fpage>issue3</fpage>
          -
          <lpage>4</lpage>
          , pp.
          <fpage>352</fpage>
          -
          <lpage>367</lpage>
          ,
          <string-name>
            <given-names>Inderscience</given-names>
            <surname>Publications</surname>
          </string-name>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>