<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sreekumari S</string-name>
          <email>sreekumari.sreelesh@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rajni Bhalla</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Geetha Ganesan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Random</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Workshop</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Coronary Artery Disease</institution>
          ,
          <addr-line>Early Prediction, Regression and Classification, Machine Learning</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Jain (Deemed-to-be) University</institution>
          ,
          <addr-line>Bengaluru</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Lovely Professional University</institution>
          ,
          <addr-line>Phagwara, Punjab</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Network</institution>
          ,
          <addr-line>Logistic</addr-line>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Regression (LR)</institution>
          ,
          <addr-line>Decision Tree, DT</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <fpage>72</fpage>
      <lpage>82</lpage>
      <abstract>
        <p>This abstract focuses on the significance of heart disease as the leading cause of disability in the human body, primarily stemming from the reduction or blockage of coronary arteries. Coronary Artery Disease (CAD) emerges as the most prevalent form of heart disease, often remaining asymptomatic until symptoms of a heart attack or heart failure arise. This paper aims to explore various machine learning techniques utilized in heart disease prediction, supporting the medical field in expedient diagnosis. The availability of vast amounts of patient information online presents an opportunity for effective utilization. Accurate diagnosis of heart disease is of utmost importance, as incorrect predictions can have severe consequences on individuals' lives. Numerous regression and classification Machine Learning Algorithms are available, including K Nearest Neighbor Algorithm (KNN), Support Vector Machine (SVM), Neural Classification. Previous research indicates that SVM and Random Forest methods often exhibit higher levels of accuracy compared to other approaches. This paper provides a comprehensive discussion of commonly employed machine learning algorithms in the field of heart disease prediction.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Congestive heart failure is the main root of dysfunction in anatomy. It is affected by the
reduction/blockage of coronary arteries. Coronary Artery Disease (CAD) is the most common type of
Congestive heart failure. Common signs of Heart attack are Chest pain, body discomfort, and breath
shortness. Common symptoms of Arrhythmia are fluttering feelings in the chest or in common we can
quote it as “Palpitation.” Whereas common symptom of Heart failure is Breath shortness, Fatigue, or
swelling on the Leg, feet, ankles, neck veins, or even in the abdomen. Other than these there are other
medical conditions and life cycles that can make people at high risk for heart disease like, Diabetes,
Obesity, an Unbalanced diet, Physical incapacity or inactivity, and excessive usageof alcohol.</p>
      <p>The current method of using an angiogram to identify heart diseases has few risks to be overruled.
They include allergic reactions to the dye used to see the coronary arteries, bleeding at the sitewhere
the catheter is placed, and the usage of dye can even cause damage to the kidney. So, we recommend
the usage of non-invasive methods. Predicting heart disease is extremely critical. Becauseone error in
prediction can lead to the death of the person. If a patient has diseases such as kidney failure or diabetes,
then they must experience a higher risk after an angiogram. Some of such risks are:
•</p>
      <sec id="sec-1-1">
        <title>May be allergic to anesthesia and contrast dye.</title>
        <p>2023 Copyright for this paper by its authors.
CEUR</p>
        <p>ceur-ws.org
•
•</p>
      </sec>
      <sec id="sec-1-2">
        <title>Bleeding at the insertion point of the catheter.</title>
        <p>Blood clots or injury to an artery or vein.</p>
        <p>These kinds of people must take contrast medicine before 24 hours to check whether they are allergic
or not. All human beings do not possess higher pain tolerance. Those who are having low pain tolerance
cannot undergo an angiogram because of the high-risk factors. In such cases, we must use non-invasive
methods to predict heart diseases. Heart diseases are of diverse types. They are:
•
•
•
•
•</p>
      </sec>
      <sec id="sec-1-3">
        <title>Coronary Artery disease (CAD) Heart Arrhythmias Heart failure Heart valve disease</title>
        <p>Stroke etc.</p>
        <p>To predict heart disease with the best accuracy prediction algorithm should be the right one.
Otherwise, the prediction will go wrong which leads to the death of a person. Here, we have summarized
recent activities in which we can find distinct predictive analytics.</p>
        <p>Since heart disease datasets are easily available online, data collection becomes remarkably simple.
There is a lack of knowledge of data. That will be achieved after processing the data. This should be
performed by collecting the proper data after avoiding inappropriate data. Then find out the missing
data and remove all the duplicates. This step is called data preprocessing. After this split the dataset into
two such as the training set and testing set. Then apply different classification and regression algorithms
to predict whether a person has a chance of any heart disease. From the accuracy, choose the best
predictive method. Fig-1 depicts the predictive method.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. Supervised Machine Learning Algorithms</title>
      <p>In supervised machine learning, training data is provided to the machine which works as a supervisor
that teaches the machines to predict the output accurately. In this method, labeled data is provided to
train the model and the model learns about distinct types of data. After the training process, the model
is evaluated based on test data which is a subset of the training set and then it starts predicting the output.
Supervised learning can again be divided into two. They are Regression and Classification. (Fig-2). The
following are different machine learning algorithms.</p>
    </sec>
    <sec id="sec-3">
      <title>2.1. Logistic Regression</title>
      <p>It is one of the most significant machine learning algorithms because it can provide probabilities and
classify new data by using discrete and continuous datasets. The first step is to extract the dependent
and independent variables. Then splitting of the dataset and feature scaling is performed. The outcome
must be of a categorical or discrete value.</p>
    </sec>
    <sec id="sec-4">
      <title>2.2. K- nearest neighbor algorithm</title>
      <p>This supervised learning algorithm stores all the given data and based on the similarity it classifies
a new data point. It is also known as the “Lazy- Learner Algorithm.” Because it acts on the dataset
during classification instead of learning from the training dataset. It can be used for both Regression
and Classification. But typically used for Classification problems. It will be more effective if the training
data is large.</p>
    </sec>
    <sec id="sec-5">
      <title>2.3. Support Vector Machine</title>
      <p>This is a well-liked supervised learning algorithm. In this plot, each data is a point in n- dimension
space with each value of feature being a particular coordinate value. Then classification is performed
after finding the hyper-plane which is also known as the best decision boundary.</p>
    </sec>
    <sec id="sec-6">
      <title>2.4. Artificial Neural Network</title>
      <p>It is a special type of machine-learning algorithm that is used to model the human brain. ANN is
capable of learning from the data, and it provides results as classification or prediction such as neurons
in our nervous system learning from past data. ANNs are non-linear statistical models that show
complex relationships between inputs and outputs. The three important layers are the input layer, hidden
layers, and output layer. The hidden layer may be single or multiple. In this model, different parameters
affect the performance of the model. The output is dependent on this data. The input layer communicates
with the hidden layer and the activated neurons will continue passing until it reaches the output layer.</p>
    </sec>
    <sec id="sec-7">
      <title>2.5. Decision Trees</title>
      <p>It is a type of supervised machine learning algorithm, in which the data set is always split based on
a defined constraint. Decision nodes and leaves are the two results that can be derived from the decision
tree.</p>
    </sec>
    <sec id="sec-8">
      <title>2.6. Random Forest Classification (RFC):</title>
      <p>RFC is based on the concept of combo learning, which means a process of solving a complex
problem to an improved and more accurate level of performance of the model by combining the
multiple classifiers.</p>
      <p>Since this review paper is about machine learning techniques, we have gone through different
machine learning algorithms. There are ample methods used in heart disease prediction other than
machine learning. They are deep learning, Data mining, IoT tools, etc.</p>
    </sec>
    <sec id="sec-9">
      <title>3. Literature Review</title>
      <p>
        E. K. Oikonomou et al. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], they have given a clear idea about Artificial intelligence, Machine
learning, big data, and different prediction methods like supervised and unsupervised methods. CT
(Cardiac Computed Tomography) imaging is the best way to predict Cardio Artery Disease (CAD),
which will mitigate the complexity of doctors to predict CAD by using images, and it will be helpful
for the patients to identify the disease faster. Predictions are performed using Regression analysis and
Classification methods. Regression analysis is performed on the image using a deep neural network
(DNN) algorithm or Support vector machines or decision trees. They have given a good explanation of
Radiomics. For image processing, detection, and segmentation they have used CCTA (Coronary CT
Angiography) scans and Machine Learning (ML) Algorithms. They stated that it can predict many risks
which are not possible with humans. They concluded by explaining the limitations and stating that this
work has offered a lot to the patients as well as doctors.
      </p>
      <p>
        Li Bin et al. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] tried to establish a disease pre-alert model within the 3 Year gap for hypertension
patients. To achieve public data analysis through AI technologies they have used spark data analysis
for hypertension patients. Using data mining, unbalanced data is converted to usable form using the
Zscore standard. Stroke, Heart failure, and Renal Failure symptoms viz. 3 group predictions ascertain the
pre-alert risk warning using a data mining algorithm. AI technologies used in this model are Logistic
Regression (LR), Support Vector Machine (SVM), and Naive Bayes Algorithm (NB). For feature
selection, they have used the Chi-Square test. In this, they have used three types of datasets. They are
Stroke, Heart failure, and Renal Failure. But they perform classification on two datasets viz. Stroke and
Heart failure. The algorithms used were Naïve Bayes, SVM, and Logistic regression. The stroke data
set shows an accuracy of more than .75, while classification results on the heart failure dataset show an
accuracy of more than .85. Logistic regression is superior to Naïve Bayes and SVM. They have shown
the Receiver Operating Characteristic curve as well (ROC). They have used SVM-RFE for feature
selection, which is not suitable for choosing the most suitable feature subset. For getting better results
need to use a different algorithm for comparing and analyzing in the next research because SVM-RFE
cannot evaluate various feature selection algorithms.
      </p>
      <p>
        S Manish et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] they have given a brief description of the technologies used for the study. They
have stated that this is the first paper that uses Optimally Time-Frequency Concentrated (OTFC)
evenlength biorthogonal wavelet filter bank (BWFB) for automatically detecting CAD. The Fuzzy Entropy
(FE) and log Energy (LogE) features were extracted from various subbands of the filter. They used Raw
ECG data from twenty males and twenty females and CAD ECG from Six females and one male. Then
signals were classified using Gaussian Support Machine Classifier (GSVM). They have stated that this
study shows high accuracy compared with other methods and while others are using thirty features for
classification, they use only twelve features.
      </p>
      <p>
        A Moloud et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] proposed a data mining method for CAD analysis. Data manipulation and
normalization were performed first. Then they selected features using GA and PSO algorithms. 10-fold
cross-validation was performed on subsets. For classification purposes in the prelims test, they used ten
algorithms. Among them, the best three SVM classifiers were selected for the final. They are nu-SVC,
nu-SVM, and Linear SVM. To optimize data, they used two optimization methods. Then they generated
a confusion matrix for evaluating results. They have selected fitness functions. It has shown an accuracy
of 93.08% based on the confusion matrix. They concluded by saying that instead of normalization other
preprocessing approaches should be used in the future.
      </p>
      <p>
        R Silvia et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] The objective of this review article is to describe the contemporary state of artificial
intelligence in clinical practice. In this paper, they have explained very well about Artificial Intelligence
and different subfields of Artificial intelligence like Machine learning, and deep learning and their
subfields. They have given proper descriptions of new emerging communication and information
technologies like Mobile health and IOT. These are subfields of E-health. And explained applications
in cardiovascular imaging as well. Echocardiography, Magnetic resonance Imaging, Cardiac computed
tomography, and Electrocardiography are different applications in Cardiovascular Imaging.
      </p>
      <p>
        Johnson et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] have done a study about how AI and ML relate to statistics and why cardiology
needs AI. They have explained in detail how to choose the best algorithm for feature selection. Through
this paper, we can easily understand different supervised Machine Learning algorithms used in
Cardiovascular disease prediction like Logistic Regression, Regularized Regression, Tree-Based
methods, Bootstrap, and Support Vector Machine. For explaining these they have given proper
examples. In Unsupervised Machine learning, they have given a brief description of Neural networks
and Deep Learning methods. The two most common forms of DL are CNN and RNN and explained the
major disadvantage of deep learning. They have concluded by stating to use this study as a decision tool
for medical practitioners.
      </p>
      <p>
        Acharya et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] explained invasive and non-invasive techniques used in CAD detection. They have
used Gaussian Mixture Model (GMM) Classifier for this purpose. Non-linear parameters are extracted
from echocardiography images. They have taken four hundred normal images of thirty normal subjects
and Four hundred CAD images of thirty affected subjects. They have noticed the ischemic changes.
Twelve lead electrocardiogram images were checked for wall motion abnormality. The training set and
test set are there. An expert physician can mark the image collected as belonging to a normal or
CADaffected person. Then different classifiers are used to extract the features. A total of 559 features are
extracted from each image. T-test was used for statistical analysis. If p&lt;.01 or.05, the feature is
measured as very perceptive. A three-fold stratified cross-validation technique is used for developing
and testing the classifiers. In this, they have used six classifiers. They are DT, FS, GMM, RBPNN,
KNN, and NBC. Then the heart index is calculated which is a combination of nine distinctive features.
The heat index for normal is 2.52± 0.07 and for CAD 2.86± 0.11. They have stated that this can be
transformed into software, can be implemented on any device, and concluded that this shows 100%
accuracy.
      </p>
      <p>
        S E Golovenkin et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] tried to improve prediction by using the voting method of three competing
systems and by eliminating sparse data columns. In this study, the prediction was performed using
Matlab2016 System, the Neural Network Toolbox module. In Artificial Neural Network, there was one
input layer and two hidden layers. All layers are interconnected, and the output layer predicts the results.
The voting method is used in this. The “Sliding- Window” method was used to increase the efficiency
of the learning process. Standard indicators are Accuracy, Sensitivity, and specificity. They have
eliminated sparse set input data columns. They have also stated that the use of 3 Neural Networks
increases accuracy by 1.2%.
      </p>
      <p>
        Babaoglu et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] used Principle Component Analysis (PCA) which is a method of identifying
patterns in data and expressing them by highlighting differences and similarities. They have taken a
dataset of 480 patients after excluding patients with some problems. They have used a Support Vector
Machine for classification and regression tasks. The assessments are implemented in Matlab 7.0
application using the LIBSVM package. The SVM method for twenty-three features shows less
accuracy. So, using the PCA method, features were reduced to eighteen. Then it shows an accuracy of
79.17%. They have concluded by stating that, PCA reduced dataset using the SVM method increases
the diagnostic accuracy rate, decreases the sum of the training and test time, and training error in the
assessment of EST (Exercise Stress test) in the determination of CAD with SVM method.
      </p>
      <p>
        Zong Chen et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] tried to provide the recent adaptive image-based classification techniques.
They have explained diverse types of heart diseases. They have compared the prediction methods used
in different studies. The first step is image registration which can be performed on a raw image database
and de-noising processing should be done. The feature is extracted from the image. They have used two
different training models. They are Naive- Bayes Classification and SVM. They have concluded by
stating that SVM is more accurate than Naïve Bayes Classification. The database needs to be updated
with more descriptions of patients.
      </p>
      <p>
        M M Ali et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] used the Kaggle dataset. They have done a comparison study and developed their
experimental model. They have applied six classification algorithms to compare the accuracy of the
best performer and statistical variables by using a ten-fold cross-validation method. The six classifiers
were KNN, Random Forest, Decision Tree, AdaboostM1(ABM1), Logistic Regression, and Multilayer
Perception (MLP). They have listed the five most crucial features based on feature importance and
correlation value. In this study compared to other classifiers RF shows 100% accuracy. KNN and MLP
failed to generate feature importance scores or coefficient values. But this method has one limitation,
this data model is not enough to address all the issues.
      </p>
      <p>
        Yazdani A et al. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] used WARM (Weighted Associative Rule Mining) to predict heart disease.
They have compared the efficiency of earlier models that used the WARM method to predict heart
disease. In this model, weights are assigned to identify the best features mined. Feature values are
evaluated. For Eg: Feature values for sex are Male and Female. After that calculate the total weight for
the feature. Then applied WARM using the Apriori algorithm. Of thirteen features, eight significant
features were identified. Finally, the confidence score is generated. This model shows a 98% confidence
score which is better than the earlier studies.
      </p>
      <p>
        Sumit S et al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] proposed this study to create a deep learning neural network heart disease
prediction model by using a new optimization technique called Talos Hyperparameter. They have used
UCI Heart Disease Dataset. They applied different learning algorithms like Logistic regression, KNN,
SVM, Naïve Bayes, and Hyperparameter optimization (Talos). Talos follows POD (Prepare, Optimize,
and Deploy) process workflow. Reporting and evaluation are the last process after POD. Talos method
shows an accuracy of 90.78% which is better compared to other learning methods.
      </p>
      <p>Bharati R et al. [14] conducted this study to compare the performance of both Machine learning as
well as deep learning methods. They have considered a public dataset. In this, they compared both
machine learning as well as deep learning methods. In pre-processing the data, checked the distribution
of data, skewness of data, stats of normal distribution data, feature selection, and checking for
duplicates. after that proposed machine learning classifiers as well as deep learning classifiers. Then
they evaluated the process and concluded by stating that dataset size needs to be increased and more
optimization and normalization techniques need to be used.</p>
      <p>Ishaq A et al. [15] conducted this study for comparing different machine learning techniques to select
the most suitable method for heart disease survival prediction. The dataset used was UCI Repository.
Then applied RF to employ feature ranking. Classified the data into training and testing sets. They
applied one oversampling method called SMOTE (Synthetic Minority Oversampling Technique) to deal
with imbalanced data in medicine. Performance evaluation matrices were Accuracy, Precision, Recall,
and F-Score. Among 9 classifiers, they got more accuracy for RF, and it was .8889. With SMOTE, it
shows 10% more accuracy. Extra Tree Classifier (ETC) shows an accuracy of .9262. Here RF produces
Constant approximation, ETC produces multi-linear approximation.</p>
      <p>Rajni Bhalla et al. [16] presents a detailed examination of the effectiveness and performance of both
proposed and existing methodologies for classification task. The study aims to compare and evaluate
the advantages and limitations of different methodologies for collecting and analyzing data for
structured data. By conducting a comparative analysis, the paper provides valuable insights for
researchers and practitioners seeking to optimize their methodologies for classification tasks and for
data that is in a structured format.</p>
      <p>Abdel-Basset M et al. [17] proposed this study to develop an IoT model to detect and monitor heart
failure-infected patients. Achieved from various wearable sensors. They used WBAN or medical sensor
nodes. The data extracted from WBAN convey are sent to the mobile application through Bluetooth or
ZigBee and stored in the cloud server to store, process, and broadcast data. Then N- MCDM model is
used to determine the percentage of heart failure disease. If the patient has severe heart failure, then an
ambulance will be sent to the patient and treatments will be started. This will provide the ratio of heart
failure so that the clinicians can easily decide the type of treatment.</p>
      <p>Rajni Bhalla et al. [18] presents a novel ensemble-based machine learning model that combines
multiple categorical datasets to enhance prediction accuracy. The proposed approach consists of three
main components: dataset alignment, model training, and prediction aggregation. The dataset alignment
phase focuses on mapping categorical variables across different datasets, ensuring consistency and
compatibility.</p>
      <p>Rajni Bhalla et al. [19] concludes by identifying the current research trends and challenges in
sentiment classification using hybrid ensemble-based approaches. It highlights the need for further
investigation into the combination of diverse classifiers and the integration of deep learning techniques
in ensemble frameworks. The hybrid ensemble approach can be applied to classification dataset like
heart disease detection to achieve better accuracy.</p>
      <p>Sivagowry S et al. [20] created a review of the application of data mining in heart disease prediction.
In this, they have compared different data mining algorithms and concluded like heart disease prediction
with data mining will become most successful with a smaller number of attributes, and text mining the
medical data needs to be extended in predicting the health care data.</p>
    </sec>
    <sec id="sec-10">
      <title>4. Methodology</title>
      <p>Data collection is the first step. Data can be collected in two ways such as Online data and real-time
data. Then we have to remove noise, missing values, useless values and all. This step is called data
preprocessing. Most of the online datasets are already preprocessed. So, we can directly use it without
preprocessing. But for real-time data, preprocessing can be done in different ways such as by using
machine learning, deep learning or data mining methods. After this step we will get the data in usable
format. Now, need to split the dataset into Training and Testing set. We can split the dataset in such a
way that 70% for training the model and 30% for testing. First train the model with train set. Then select
the most significant feature for improving the prediction accuracy. Now, give test set to the model. It
will predict accordingly. There are different evaluation metrics for checking the efficiency of the model
such as Precision, Recall, F1-score, confusion matrix and ROC curves etc. (Fig-3 depicts the same).</p>
    </sec>
    <sec id="sec-11">
      <title>5. Results</title>
      <p>There are different regression and classification algorithms used to predict heart disease. Different
methods are used in different papers, and they have shown the accuracy that they obtained while using
different methods. Table 1 shows the comparison of different works. In most of the papers, the different
algorithms used are Naive Bayes, Logistic Regression, ANN, SVM, DT, RF, etc. Among them, ANN
and RF show more accuracy.</p>
      <sec id="sec-11-1">
        <title>Heart Index 400 normal images of 30 The heart index for normal subjects and 400 CAD images is 2.52± 0.07 and for CAD.</title>
      </sec>
      <sec id="sec-11-2">
        <title>Signals were segmented into</title>
        <p>two seconds (Set A) and five
seconds (Set B)duration.</p>
      </sec>
      <sec id="sec-11-3">
        <title>Then applied GSVM.</title>
      </sec>
      <sec id="sec-11-4">
        <title>Cross-validation – stratified 10-fold.</title>
      </sec>
      <sec id="sec-11-5">
        <title>Machine learning- KNN,</title>
        <p>reglog, GaussNB, LDA, QDA,</p>
      </sec>
      <sec id="sec-11-6">
        <title>RandomForest,MLP, SVM (3</title>
        <p>types, including C-SVC,
nu</p>
      </sec>
      <sec id="sec-11-7">
        <title>SVC, and Linear SVM.</title>
      </sec>
      <sec id="sec-11-8">
        <title>1. Signals used are</title>
        <p>i)Normal ECG database.
ii)CAD ECG signals.</p>
      </sec>
      <sec id="sec-11-9">
        <title>Raw ECG data from twenty</title>
        <p>males and twenty females and</p>
      </sec>
      <sec id="sec-11-10">
        <title>CAD ECG from 6 females and</title>
        <p>one male.
303 patients annotated with nuSVM method with GA
fifty-four factors. (CAD and and ACC fitness function</p>
      </sec>
      <sec id="sec-11-11">
        <title>Normal, with216 CAD and 87 testing set shows an Normal (non-CAD) patients). accuracy of 93.08% [8] [11]</title>
      </sec>
    </sec>
    <sec id="sec-12">
      <title>6. Conclusion and Future Work</title>
      <p>In conclusion, the diagnosis of heart disease remains a paramount concern within our medical
society. Manual predictions, though rare, can still yield incorrect results due to the need for precise
image assessment. Non-invasive methods provide an alternative for individuals concerned about
traditional diagnostic approaches, enabling them to take proactive preventive measures. This study
highlights various approaches such as machine learning, deep learning, data mining, and IoT tools for
predicting heart disease. However, the accuracy of these methods poses challenges, given the high
stakes involved in predicting human life. Even models achieving 100% accuracy during testing may
encounter failures when applied to real-time data. In this software-driven era, it is crucial to develop
robust and cost-effective solutions that deliver more accurate results. Further comprehensive studies
should be conducted to advance our understanding and achieve better outcomes in this field.</p>
    </sec>
    <sec id="sec-13">
      <title>7. References</title>
      <p>[14] Bharti R, Khamparia A, Shabaz M, Dhiman G, Pande S, Singh P. Prediction of Heart Disease
Using a Combination of Machine Learning and Deep Learning. Comput Intell Neurosci. 2021 Jul
1; 2021:8387680. doi- 10.1155/2021/8387680. PMID: 34306056; PMCID: PMC8266441.
[15] Abid Ishaq, Saima Sadiq, Muhammad Umer, Saleem Ullah, Seyedali Mirjalili, Vaibhav Rupapara,
and Michele Nappi, Improving the Prediction of Heart Failure Patients’ Survival Using SMOTE
and Effective Data Mining Techniques, IEEE Access, VOLUME 9, 2021, Pages 39707 -39716.
[16] Bhalla R, Amandeep. A comparative analysis of application of proposed and the existing
methodologies on a mobile phone survey. In Futuristic Trends in Networks and Computing
Technologies: Second International Conference, FTNCT 2019, Chandigarh, India, November 22–
23, 2019, Revised Selected Papers 2 2020 (pp. 107-115). Springer Singapore.
[17] Abdel-Basset, M., Gamal, A., Manogaran, G., Son, L. H., &amp; Long, H. V. (2019). A novel group
decision-making model based on neutrosophic sets for heart disease diagnosis. Multimedia Tools
and Applications. doi:10.1007/s11042-019-07742-7.
[18] Bhalla R, Sharma A, Gupta J. An Ensemble-based Machine Learning Model for Accurate
Predictions using Multiple Categorical Datasets. In2022 Second International Conference on
Interdisciplinary Cyber Physical Systems (ICPS) 2022 May 9 (pp. 1-4). IEEE.
[19] R. Bhalla, A. Sharma, and G. Ganesan, “A Bibliographic Survey of Sentiment Classification using
Hybrid Ensemble-based Machine Learning Approaches,” in ceur workshop proceedings, 2022,
vol. 3146, pp. 33–38, [Online]. Available: https://ceur-ws.org/Vol-3146/PAPER_04.pdf.
[20] Sivagowry, S.; Durairaj, M.; Persia, A. (2013). - An empirical study on applying data mining
techniques for the analysis and prediction of heart disease., [IEEE 2013 International Conference
on Information Communication and Embedded Systems (ICICES 2013)
doi:10.1109/ICICES.2013.6508204.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Oikonomou</surname>
            <given-names>E K</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Siddique</surname>
            <given-names>M</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Antoniades</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <article-title>Artificial Intelligence in Medical Imaging: a radiomic guide to precision phenotyping of Cardiovascular disease</article-title>
          ,
          <source>Cardiovascular Research</source>
          ,
          <volume>116</volume>
          (
          <issue>13</issue>
          ),
          <fpage>2040</fpage>
          -
          <lpage>2054</lpage>
          . https:// doi:10.1093/cvr/cvaa021.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ding</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Song</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            <given-names>Q.</given-names>
          </string-name>
          <year>2019</year>
          .
          <article-title>Computer-Aided Diagnosis and Clinical Trials of Cardiovascular Diseases Based on Artificial Intelligence Technologies for Risk-Early Warning Model</article-title>
          .
          <source>J Med Syst</source>
          <volume>43</volume>
          ,
          <fpage>228</fpage>
          . https://doi.org/10.1007/s10916-019- 1346-x.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Manish</given-names>
            <surname>Sharma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            <surname>Rajendra Acharya</surname>
          </string-name>
          ,
          <year>2019</year>
          .
          <article-title>A New Method to Identify Coronary Artery Disease with ECG Signals and Time-Frequency Concentrated Antisymmetric Biorthogonal Wavelet Filter Bank, Pattern Recognition Letters</article-title>
          .
          <fpage>S0167</fpage>
          -
          <volume>8655</volume>
          (
          <issue>19</issue>
          )
          <fpage>30121</fpage>
          -
          <lpage>7</lpage>
          . https://doi.org/10.1016/j.patrec.
          <year>2019</year>
          .
          <volume>04</volume>
          .014.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Abdar</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ksiazek</surname>
            <given-names>W</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rajendra Acharya</surname>
            <given-names>U</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tan R S</surname>
            , Makarenkov
            <given-names>V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Plaviak</surname>
            <given-names>P.</given-names>
          </string-name>
          <year>2019</year>
          .
          <article-title>A New Machine Learning Technique for an Accurate Diagnosis of Coronary Artery Disease</article-title>
          , Computer Methods, and Programs in Biomedicine. S0169-
          <volume>2607</volume>
          (
          <issue>18</issue>
          )
          <fpage>31458</fpage>
          -
          <lpage>5</lpage>
          . https://doi.org/10.1016/j.cmpb.
          <year>2019</year>
          .
          <volume>104992</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Romiti</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vinciguerra</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saade</surname>
            <given-names>W</given-names>
          </string-name>
          et al.,
          <year>2020</year>
          .
          <article-title>"Artificial Intelligence (AI) and Cardiovascular Diseases: An Unexpected Alliance"</article-title>
          ,
          <source>Cardiology Research and Practice</source>
          , vol.
          <year>2020</year>
          ,
          <string-name>
            <surname>Article</surname>
            <given-names>ID</given-names>
          </string-name>
          4972346, 8 pages. https://doi.org/10.1155/
          <year>2020</year>
          /4972346
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Johnson</surname>
            ,
            <given-names>K. W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Torres</surname>
            <given-names>Soto</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Glicksberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. S.</given-names>
            ,
            <surname>Shameer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Miotto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Ali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            , ... &amp;
            <surname>Dudley</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. T.</surname>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Artificial intelligence in cardiology</article-title>
          .
          <source>Journal of the American College of Cardiology</source>
          ,
          <volume>71</volume>
          (
          <issue>23</issue>
          ),
          <fpage>2668</fpage>
          -
          <lpage>2679</lpage>
          . https://www.jacc.org/doi/abs/10.1016/j.jacc.
          <year>2018</year>
          .
          <volume>03</volume>
          .521.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Acharya</surname>
            <given-names>U R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sree</surname>
            <given-names>S V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krishnan M M R</surname>
            , Krishnananda
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ranjan</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Umesh</surname>
            <given-names>P</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Suri</surname>
            <given-names>J S</given-names>
          </string-name>
          , (
          <year>2013</year>
          ).
          <article-title>Automated classification of patients with coronary artery disease using grayscale features from left ventricle echocardiographic images</article-title>
          ,
          <source>Computer Methods and Programs in Biomedicine</source>
          , Volume
          <volume>112</volume>
          ,
          <string-name>
            <surname>Issue</surname>
            <given-names>3</given-names>
          </string-name>
          ,
          <year>2013</year>
          , Pages
          <fpage>624</fpage>
          -
          <lpage>632</lpage>
          . https://doi.org/10.1016/j.cmpb.
          <year>2013</year>
          .
          <volume>07</volume>
          .012.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Golovenkin</surname>
            <given-names>S E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dorrer</surname>
            <given-names>M G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nikulina</surname>
            <given-names>S Y</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Orlova</surname>
            <given-names>Y V</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Pelipeckaya</surname>
            <given-names>E Y</given-names>
          </string-name>
          , (
          <year>2020</year>
          ),
          <article-title>Evaluation of the effectiveness of using artificial intelligence to predict the response of the human body to cardiovascular diseases</article-title>
          .
          <source>Journal of Physics: Conference Series</source>
          , Volume
          <volume>1679</volume>
          . No.
          <volume>4</volume>
          , https://iopscience.iop.org/article/10.1088/
          <fpage>1742</fpage>
          - 6596/1679/4/042017/meta.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Babaoglu</surname>
            <given-names>I</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Findik</surname>
            <given-names>O</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Bayrak</surname>
            <given-names>M</given-names>
          </string-name>
          , (
          <year>2010</year>
          ).
          <article-title>Effects of principle component analysis on assessment of coronary artery diseases using support vector machine</article-title>
          ,
          <source>Expert Systems with Applications</source>
          , Volume
          <volume>37</volume>
          ,
          <string-name>
            <surname>Issue</surname>
            <given-names>3</given-names>
          </string-name>
          ,
          <year>2010</year>
          , Pages
          <fpage>2182</fpage>
          -
          <lpage>2185</lpage>
          . https://doi.org/10.1016/j.eswa.
          <year>2009</year>
          .
          <volume>07</volume>
          .055.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Chen</surname>
            <given-names>J I Z</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Hengjinda</surname>
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2021</year>
          ).
          <article-title>Early Prediction of Coronary Artery Disease (CAD) by Machine Learning Method- A Comparative Study</article-title>
          ,
          <source>Journal of Artificial Intelligence and Capsule Networks</source>
          , Vol.
          <volume>03</volume>
          / No.01, Pages:
          <fpage>17</fpage>
          -
          <lpage>33</lpage>
          . https://doi.org/10.36548/jaicn.
          <year>2021</year>
          .
          <volume>1</volume>
          .002.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M M</given-names>
            <surname>Ali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B K</given-names>
            <surname>Paul</surname>
          </string-name>
          , Kawsar Ahmed,
          <string-name>
            <surname>Francis M B</surname>
            , Julian M W,
            <given-names>Q M A Moni</given-names>
          </string-name>
          ,
          <article-title>(</article-title>
          <year>2021</year>
          ),
          <article-title>Heart disease prediction using supervised machine learning algorithms: Performance analysis and comparison</article-title>
          ,
          <source>Computers in Biology and Medicine</source>
          , Volume
          <volume>136</volume>
          , https://doi.org/10.1016/j.compbiomed.
          <year>2021</year>
          .
          <volume>104672</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Yazdani</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Varathan</surname>
            <given-names>K D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chiam Y K</surname>
            , Malik
            <given-names>A W</given-names>
          </string-name>
          , &amp; Wan
          <string-name>
            <surname>Ahmad</surname>
            ,
            <given-names>W A</given-names>
          </string-name>
          (
          <year>2021</year>
          ).
          <article-title>A novel approach for heart disease prediction using strength scores with significant predictors</article-title>
          .
          <source>BMC Medical Informatics and Decision Making</source>
          <volume>21</volume>
          (
          <issue>1</issue>
          ). https://doi.org/10.1186/s12911-021-01527-5.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Sumit</surname>
            <given-names>Sharma</given-names>
          </string-name>
          , Mahesh
          <string-name>
            <surname>Parmar</surname>
          </string-name>
          (
          <year>2020</year>
          ),
          <article-title>Heart Diseases Prediction using Deep Learning Neural Network Model</article-title>
          ,
          <source>International Journal of Innovative Technology and Exploring Engineering (IJITEE)</source>
          , Volume-
          <volume>9</volume>
          , Issue-3. https://doi.org/10.35940/ijitee.C9009.019320.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>