<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>A Comparative Study of Heart Disease Prediction using Machine Learning</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sreekumari S</string-name>
          <email>sreekumari.sreelesh@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rajni Bhalla</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Geetha Ganesan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Workshop</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Jain (Deemed-to-be) University</institution>
          ,
          <addr-line>Bengaluru</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Lovely Professional University</institution>
          ,
          <addr-line>Phagwara, Punjab</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>including Random Forest, Decision Tree Classifier, Logistic Regression, K-Nearest Neighbor</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <fpage>54</fpage>
      <lpage>65</lpage>
      <abstract>
        <p>The heart, as the primary organ responsible for circulating blood throughout the body, is of great concern in society due to heart disease. Diagnosing heart disease presents significant challenges for doctors, considering its diverse types. Multiple prediction methods have been explored by researchers to address this issue. The accuracy of these predictions remains a key consideration. In this study, we focused on five different machine learning algorithms, Classifier, and Decision Tree Classifier with Grid search. Additionally, we developed an ensemble model with the primary objective of accurately predicting heart disease. Our analysis utilized a heart disease dataset from Kaggle, and among the five algorithms examined, the Decision Tree Classifier achieved the highest accuracy of 92%. This finding highlights its effectiveness in predicting heart disease. Ensemble model, Prediction, machine learning algorithms.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>The heart is the primary organ that sends blood to entire parts of our body. This pumped blood
carries oxygen and nutrients. The embryology of cardiovascular disease has now become an unsolved
problem. Due to increasing stress and change in lifestyle the number of people getting affected by
cardiovascular diseases is growing enormously day by day. Earlier heart disease prediction is very
strenuous. Because it depends upon several factors. There are a variety of features that cause heart
disease such as Hypertension, cholesterol, abnormal pulse rate, etc. The symptoms also may vary for
different genders. For example: if females have symptoms such as Chest pain with chest discomfort,
men may have only chest pain. One American dies every 34 seconds due to cardiovascular diseases.</p>
      <p>There are diverse Types of cardiovascular diseases. Each has distinct types of symptoms and various
causes. Diverse types of CVD are cardiac arrhythmia, Aorta disease, coronary Thrombosis, ischemic
heart disease, angina pectoris, cardiac infarction, myocardial infarction, stroke, etc. If it is possible to
predict heart disease prematurely, it will save several lives.</p>
      <p>CAD is a condition that narrows or blocks the arteries that supply blood to our heart due to plaque
buildup. This decreased blood flow may lead to chest pain and dyspnea, and a complete impasse of
blood flow may cause a heart attack. The most widespread symptom of CAD is chest discomfort.</p>
      <p>2023 Copyright for this paper by its authors.
CEUR</p>
      <p>ceur-ws.org
Following a heart-healthy lifestyle can stave off one's life from CAD. In this work, we are going to use
machine learning techniques to predict heart disease.</p>
    </sec>
    <sec id="sec-2">
      <title>1.1. Machine Learning</title>
      <p>Artificial Intelligence is a sub-part of Machine Learning, in which machines are learning things from
past data like human beings learn from experiences. Nowadays, it is used commonly in the medical
field, computer - vision, speech recognition, etc. Supervised Learning, Unsupervised Learning,
Semisupervised Learning, and Reinforcement Learning are four kinds of machine learning.</p>
    </sec>
    <sec id="sec-3">
      <title>1.1.1. Supervised Learning</title>
      <p>It is a kind of Machine Learning, where we are providing a labeled dataset to train the model, and
based on this it predicts the output. In Supervised learning, the system creates a model by using the
labeled data and will test the model by providing sample data to check whether it predicts accurately
or not. Classification and Regression are the two classifications of supervised learning.</p>
    </sec>
    <sec id="sec-4">
      <title>1.1.2. Unsupervised Learning</title>
    </sec>
    <sec id="sec-5">
      <title>1.1.3. Semi-Supervised Learning</title>
      <p>Here, we are providing an unlabeled dataset. In unsupervised learning, training is provided without
any supervision. The two classifications of Unsupervised learning are Clustering and Association.</p>
      <sec id="sec-5-1">
        <title>Here, we are providing both labeled as well as unlabeled datasets.</title>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>1.1.4. Reinforcement Learning</title>
      <p>Reinforcement learning (RL) is a branch of machine learning that focuses on an agent's learning
process through interactions with an environment. It involves an agent learning optimal actions by
receiving rewards or penalties based on its actions. RL has gained significant attention in research due
to its ability to tackle sequential decision-making problems. It has been applied in various domains,
such as robotics, game playing, and autonomous systems. RL algorithms, such as Q-learning and Deep
Q-Networks (DQN), have shown promising results in training agents to make intelligent decisions in
dynamic and uncertain environments. It is feedback-based learning. It learns from experience and the
feedback it tries to improve.</p>
    </sec>
    <sec id="sec-7">
      <title>2. Related Work</title>
      <p>Heart disease prediction is extremely critical. A plethora of studies are conducted in this area.
Different methods are already used. Some are using different machine learning algorithms; some are
using deep learning, and some are using data mining techniques. The selection of features is especially
important.</p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] they used CT and CCTA scans as input and then they applied ML algorithms like DNN, SVM,
and Random Forest. They have concluded by stating that if it is possible to use AI-based Cardiovascular
images then it will be helpful for the prediction. Authors in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] stated that this was the first paper that
uses a modern technology called Optimally Time- Frequency Concentrated (OTFC) Even-length
Bioorthogonal wavelet Filter Bank (BWFB). They have concluded that they need more CAD ECG data
sets for better prediction. Authors in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], have used the CAD dataset and applied ten different machine
learning algorithms such as C-SVC, nu-SVC, Logistic regression, Naïve Bayes, etc. Applied Genetic
Algorithm and Particle Swarm Optimization. They eliminated the sparse set, but still, they noticed that
there is no effect on the accuracy. In [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], the authors explained very well about invasive and
noninvasive methods used in CAD detection. They have used another method called Gaussian Mixture
Model (GMM) classifier. Different classifiers were used in this. In this work, they have used six
classifiers Decision Tree, Fuzzy Sugeno, GMM, KNN, RBPNN, and Naïve Bayes Classifier. They have
introduced a new parameter, the Heart index which is a 16-digit number getting after combining all the
feature's values. For normal subjects, the heat index is 2.52 ± 0.07, and for CAD subjects 2.86 ± 0.11.
They concluded by stating that they have 100% accuracy, which can be developed as software and can
be implemented on mobile phones. Different methods used for prediction are well explained in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]
and [19] and about ANN in [21] as well.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], they used SVM for Classification and Regression tasks. They used Principal Component
Analysis for identifying patterns in such a way that it should highlight all the similarities and differences
as well. They have implemented this in Matlab 7.0 and tried the SVM method for 23 features. But they
got less accuracy. Then using PCA they reduced the features to 18. Then it shows an accuracy of
79.17%. In [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] they used SVM-RFE to reduce the dimension to find the optimal feature subset. In [10],
they compared various CAD prediction methods. In this study, they have used images for prediction
and two different training models such as Naïve-Bayes Classifier and SVM. They concluded by stating
that SVM is better for prediction and that a database needs to be updated with more descriptions of
patients.
      </p>
      <p>In [12] they used the Weighted Associative Rule Mining method to predict heart disease. In this
model, weights were assigned to unique features, and calculated the total score obtained for features
was. Then they applied WARM using Apriori Algorithm and selected the best features. When compared
with the previous models, this model got a 98% Confidence score. Vardhan S et al. [13] in 2021, used
the CVD dataset from Kaggle. Along with traditional classifiers, they applied some ensemble
techniques such as boosting, bagging, and stacking. The bagged model shows 1.96% more accuracy,
the bagged model shows an accuracy of 73.4%, and Boosted model shows an accuracy of 75.1%.</p>
      <p>In [14], a new method called Hybrid Random Forest with Linear Model (HRFLM) got introduced.
DT, Language model, SVM, RF, Naïve Bayes, neural network, and K nearest neighbor were the six
classifiers used for this study. They got an accuracy of 0.8847. In addition to these basic methods
AdaBoost M1 and MLP were used in [11]. Sabrina m et al., [15] used two data sets, an Italian data set,
and an American dataset. They have applied several scaling methods such as normalization, Min Max
scalar, and standardization. LR, KNN, classification decision tree (CART), NB, linear svc, and support
vector classifier with radial basis function Kernel were the different Machine learning algorithms. They
got more accuracy for SVC with RBF and Grid search algorithm.</p>
      <p>Sumit S et al. in [16] created a deep-learning Neural Network for heart disease prediction. Also used
a new optimization technique called Talos hyperparameter. The different learning algorithms used were
LR, KNN, SVM, NB, and Talos. Talos follows prepare, optimize, and deploy process flow. The last
process is the evaluation. Compared to other methods, it shows an accuracy of 90.78 percent. S M M
Hasan et al.[17] proposed a study with basic machine learning methods and LR got more accuracy and
in [28] KNN got 90.79% accuracy. In [18], they have applied enhancement methods such as AdaBoost,
Bagging, and Boosting along with LMT and Hoeffding Tree Techniques. For LMT they selected
AdaBoost with 80.32% accuracy and for Hoeffding, got 81.96% accuracy with Bagging Technique.</p>
      <p>Jaymin Patel et al., [20], To extract hidden patterns they used data mining techniques. They used
Waikato Environment for Knowledge Analysis (WEKA) Tool and the classification Tree Algorithms
used were J48 with reduced error pruning, Logistic model Tree Algorithm, and Random Forest
Algorithm. The J48 algorithm shows the highest accuracy, and it is 56.76% and LMT shows the lowest
accuracy at 55.77%. Their accuracy was less compared to other models. In [22], they used the KEEL
tool for feature selection, and for feature extraction, they used PCA under the WEKA tool. Under
WEKA Tool, 10-fold cross-validation majority voting got an accuracy of 80.20, and AdaBoostM1
without an ensemble model got an accuracy of 80.01. Under KEEL tool GFS- LogitBoost-c got an
accuracy of 80.53. Bagging and Boosting ensemble models were applied in [23].</p>
      <p>
        In [26], they used the Phonocardiogram heart sound dataset. They applied PCG augmentation, then
samples were doubled. On all signals, amplitude normalization was performed. CNN was used and this
study shows an accuracy of 98.60. In [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], they proposed voting ANN. CNN is used in [25]. In [27],
they used a new methodology called Swarm ANN on the UCI repository and they got an accuracy of
.9578. Abidishaq et al., [29], applied one oversampling method called SMOTE (Synthetic Minority
Oversampling Technique). Among 9 classifiers, they got more accuracy for RF, and it was .8889. With
SMOTE, it shows 10% more accuracy. Extra Tree Classifier shows an accuracy of .9262.
      </p>
      <p>Random Forest algorithm on spark framework is applied in [30] and they got an accuracy of 98%.
Basic machine learning models were applied in [31] and SVM got more accuracy, and it was 94.60%
and boosted SVM got more accuracy in [33]. ANN was created in [32] with a performance of 90%. In
[36] presents a detailed examination of the effectiveness and performance of both proposed and existing
methodologies for classification task. The study aims to compare and evaluate the advantages and
limitations of different methodologies for collecting and analyzing data for structured data. By
conducting a comparative analysis, the paper provides valuable insights for researchers and
practitioners seeking to optimize their methodologies for classification tasks and for data that is in
structured format. Liu J et al in [34], proposed a stacking model. They applied different methods such
as LR, RF, ET, MLP and CatBoost. And for stacking model they got an accuracy of 84.62%. In [35]
they applied RB-Bayes, NB and SVM and got more accuracy for SVM and it was 85.71%.</p>
      <p>In [37] concludes by identifying the current research trends and challenges in sentiment
classification using hybrid ensemble-based approaches. It highlights the need for further investigation
into the combination of diverse classifiers and the integration of deep learning techniques in ensemble
frameworks. The hybrid ensemble approach can be applied to classification dataset like heart disease
detection to achieve better accuracy. In [38] created a review of the application of data mining in heart
disease prediction. In this, they have compared different data mining algorithms and concluded like
heart disease prediction with data mining will become most successful with a smaller number of
attributes, and text mining the medical data needs to be extended in predicting the health care data.</p>
    </sec>
    <sec id="sec-8">
      <title>3. Proposed Methodology</title>
      <p>This section discusses the structure of the Model. Data collection is the first step. There are different
datasets available online or we can collect real-time datasets from hospitals. In this proposed
methodology we have used Kaggle dataset which contains 303 records of patients with 14 attributes
including the result. Then data preprocessing is needed. Since this was an online dataset, there were no
missed values or redundant values. Split the dataset into two as Training set and Testing set. Choose
the best correlated feature using correlation matrix. Train the model with the training set. Now the
model has been trained with the labels and tested the model with Testing set. Performance of the model
will be evaluated. Different evaluation metrics used here are Precision, Recall, F1-score, Accuracy,
Confusion matrix and ROC curve. Then compared the accuracy of different models. The framework of
the proposed model is in figure 1. These are briefly discussed in the following subsections.</p>
    </sec>
    <sec id="sec-9">
      <title>3.1. Collection of Dataset</title>
      <p>Dataset collection is the initial step in heart disease prediction. Here, we have used the Kaggle
dataset. This dataset contains 14 attributes including the output. Table 1 depicts the attributes of the
datasets used in this study.
thall
slp
output</p>
    </sec>
    <sec id="sec-10">
      <title>3.2. Selection of Attributes</title>
      <sec id="sec-10-1">
        <title>Thalium Stress results</title>
      </sec>
      <sec id="sec-10-2">
        <title>The slope of the peak exercise-ST segments</title>
        <p>int64
0 = less chance of heart attack, 1= more chance of int64
heart attack</p>
        <p>The correlation matrix is used for feature selection. Before doing this step, we must remove the
missing values to clean the dataset. Figure 2 shows the correlation matrix.</p>
      </sec>
    </sec>
    <sec id="sec-11">
      <title>3.3. Splitting of the Dataset</title>
      <p>We must split the dataset into Training and Testing. Here, we have split the dataset in 75:25
proportion. The training set is used for training the model and for testing the model test set is used.</p>
    </sec>
    <sec id="sec-12">
      <title>3.4. Applying Machine Learning Algorithms</title>
      <p>Different machine learning algorithms were applied to the dataset to find out the finest prediction
model. The 5 types of algorithms compared here are Decision Tree Classifier, Random Forest, Logistic
regression, KNN classifier, and Decision Tree Classifier with Gridsearch.</p>
    </sec>
    <sec id="sec-13">
      <title>3.4.1. Decision Tree Classifier</title>
      <p>It is a supervised machine-learning approach used in predictive models. If discrete values are taken
by the target variable, then it is called a classification tree and if it takes continuous values, then it is
called a regression tree.</p>
    </sec>
    <sec id="sec-14">
      <title>3.4.2. Random Forest</title>
      <p>It is suitable for both classification and regression tasks. Random forest algorithm combines several
decision trees. In this, each tree in the forest predicts in which category a new record belongs. Then it
is assigned to the category which got the majority vote.</p>
    </sec>
    <sec id="sec-15">
      <title>3.4.3. Logistic Regression</title>
    </sec>
    <sec id="sec-16">
      <title>3.4.4. KNN Classifier</title>
      <p>It is used to solve binary classification problems. This algorithm predicts one of the possible
outcomes based on distinctive features pertinent to the problem.</p>
      <p>KNN is used in both classifications as well as regression tasks. Here, we use the Euclidian-distance
formula to calculate the distance. ‘k’ in KNN is the number of neighbors we need to take into
consideration while predicting the output.</p>
    </sec>
    <sec id="sec-17">
      <title>3.4.5. Decision Tree classifier with Grid search</title>
      <p>We are using Grid search to get the best parameters. It passes all the possible parameters one by one
into the model and finally, it gives out the best parameters. These best parameters are used in Decision
Tree Classifier. It is a supervised machine-learning approach used in predictive models. If the target
variable takes discrete values, then it is called a classification tree and if it takes continuous values, it
is called a regression tree.</p>
      <p>Here, the voting method combines Naïve Bayes, Deep learning, Generalized Linear Model, and
Random Forest.</p>
      <p>Here we are calculating different evaluation metrics such as Precision, Recall, confusion matrix,
F1score, and accuracy.</p>
    </sec>
    <sec id="sec-18">
      <title>3.4.6. Voting Method</title>
    </sec>
    <sec id="sec-19">
      <title>3.5. Evaluation Metrics</title>
      <sec id="sec-19-1">
        <title>Precision is calculated as,</title>
      </sec>
      <sec id="sec-19-2">
        <title>Recall is calculated as,</title>
      </sec>
      <sec id="sec-19-3">
        <title>F1 -Score is calculated as,</title>
      </sec>
    </sec>
    <sec id="sec-20">
      <title>3. Results and Discussions</title>
      <p>.97</p>
      <sec id="sec-20-1">
        <title>F1-score .93</title>
      </sec>
      <sec id="sec-20-2">
        <title>Accuracy .92</title>
      </sec>
      <sec id="sec-20-3">
        <title>Random Forest</title>
      </sec>
      <sec id="sec-20-4">
        <title>Logistic regression</title>
      </sec>
      <sec id="sec-20-5">
        <title>Decision Tree with Grid search</title>
        <p>KNN
.88
.86
.86
.70
.95
.95
.93
.82
.92
.90
.89
.76
.91
.89
.88
.72</p>
        <p>In this study, as of now, we have compared five machine learning algorithms such as Decision Tree
Classifier with Grid search, DT, RF, LR, and KNN. Precision, Recall, F1- score, confusion matrix,
ROC curve, and accuracy are the different evaluation metrics calculated here.</p>
        <p>Table 2 and Figure 3 depict the confusion matrix and ROC curves for the five diverse
machinelearning algorithms.</p>
        <p>Predicted</p>
        <p>Total - 303
l
a
tu Positive (1) - 165
c
A Negative (0) - 138</p>
        <sec id="sec-20-5-1">
          <title>Positive (1)</title>
        </sec>
        <sec id="sec-20-5-2">
          <title>Negative (0) TP FP FN</title>
          <p>TN</p>
          <p>Here, the Decision Tree classifier got more accuracy, and it is 92%. In this study, we developed a
novel ensemble model which is a voting method that combines four different algorithms such as Naïve
Bayes, Deep learning, Generalized Linear Model, and Random Forest. For this model, we got an
accuracy of 85.44%. Figure 5 depicts the same.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-21">
      <title>4. Conclusion</title>
      <p>The field of heart disease prediction continues to witness extensive research, reflecting the critical
nature of accurate predictions in saving lives. As demonstrated in this study, numerous methods,
including deep learning, machine learning, data mining, Artificial Neural Networks, and IoT, are being
explored by researchers to predict heart disease. However, there is a persistent need for improved results
in this domain. Given the gravity of this life-threatening disease, the development of more effective
ensemble models becomes crucial. The Decision Tree Classifier demonstrated a higher accuracy rate
in predicting heart disease in this study. With an accuracy of 92%, it outperformed the other evaluated
machine learning algorithms, including Random Forest, Logistic Regression, K-Nearest Neighbour
Classifier, and Decision Tree Classifier with Grid search. This finding highlights the effectiveness of
the Decision Tree Classifier in accurately predicting heart disease. Future endeavors aim to incorporate
real-time data and identify the optimal ensemble model for heart disease prediction. It is imperative to
continue this quest for enhanced accuracy, as even a small error can have fatal consequences.</p>
    </sec>
    <sec id="sec-22">
      <title>5. References</title>
      <p>[10] Chen J I Z, &amp; Hengjinda P. (2021). Early Prediction of Coronary Artery Disease (CAD) by
Machine Learning Method- A Comparative Study, Journal of Artificial Intelligence and Capsule
Networks, Vol.03/ No.01, Pages: 17-33. https://doi.org/10.36548/jaicn.2021.1.002.
[11] M M Ali, B K Paul, Kawsar Ahmed, Francis M B, Julian M W, Q M A Moni, (2021), Heart disease
prediction using supervised machine learning algorithms: Performance analysis and comparison,
Computers in Biology and Medicine, Volume 136,
https://doi.org/10.1016/j.compbiomed.2021.104672.
[12] Yazdani A, Varathan K D, Chiam Y K, Malik A W, &amp; Wan Ahmad, W A (2021). A novel approach
for heart disease prediction using strength scores with significant predictors. BMC Medical
Informatics and Decision Making 21(1). https://doi.org/10.1186/s12911-021-01527-5.
[13] Vardhan Shorewala,(2021), Early detection of Coronary Heart Disease using ensemble
Techniques, Informatics in Medicine Unlocked, Volume 26, 100655,
https://doi.org/10.1016/j.imu.2021.100655.
[14] Mohan, S., Thirumalai, C., &amp; Srivastava, G. (2019). Effective heart disease prediction using hybrid
machine learning techniques. IEEE Access, 7, 81542-81554.
[15] Sabrina Mezzatesta, Claudia Torino, Pasquale De Meo, Giacomo Fiumara, Antonio Vilasi, (2019),
A MACHINE LEARNING-BASED APPROACH FOR PREDICTING
THE OUTBREAK OF CARDIOVASCULAR DISEASES IN PATIENTS
ON DIALYSIS, Computer Methods and Programs in Biomedicine,
https://doi.org/10.1016/j.cmpb.2019.05.005.
[16] Sumit Sharma, Mahesh Parmar (2020), Heart Diseases Prediction using Deep Learning
Neural Network Model, International Journal of Innovative Technology and Exploring
Engineering (IJITEE), Volume-9, Issue-3. https://doi.org/10.35940/ijitee.C9009.019320.
[17] S M M Hasan, M A Mamun, M P Uddin, and M A Hossain, (Hasan, S. M. M., Mamun, M. A.,
Uddin, M. P., &amp; Hossain, M. A. (2018, February). Comparative analysis of classification
approaches for heart disease prediction. In 2018 International Conference on Computer,
Communication, Chemical, Material and Electronic Engineering (IC4ME2) (pp. 1-4). IEEE.
[18] Pranav Motarwar, Ankitha Duraphe, G Suganya and M Premalatha (2020), Cognitive Approach
for Heart Disease Prediction Using Machine Learning, IEEE 2020 International Conference on
Emerging Trends in Information Technology and Engineering (ic-ETITE), 1-5,
doi:10.1109/icETITE47903.2020.242.
[19] Ramalingam, V. V., Dandapath, A., &amp; Raja, M. K. (2018). Heart disease prediction using machine
learning techniques: a survey. International Journal of Engineering &amp; Technology, 7(2.8),
684687.
[20] Patel, J., TejalUpadhyay, D., &amp; Patel, S. (2015). Heart disease prediction using machine learning
and data mining technique. Heart Disease, 7(1), 129-137.
[21] Diwakar, Manoj; Tripathi, Amrendra; Joshi, Kapil; Memoria, Minakshi; Singh, Prabhishek;
Kumar, Neeraj (2020). Latest trends on heart disease prediction using machine learning and
image fusion. Materials Today: Proceedings, (), S2214785320367675–. doi:
10.1016/j.matpr.2020.09.078
[22] Abdeldjouad, F. Z., Brahami, M., &amp; Matta, N. (2020, June). A hybrid approach for heart disease
diagnosis and prediction using machine learning techniques. In International conference on smart
homes and health telematics (pp. 299-306). Springer, Cham.
https://doi.org/10.1007/978-3-03051517-1_26.
[23] Pouriyeh, Seyedamin; Vahid, Sara; Sannino, Giovanna; De Pietro, Giuseppe; Arabnia, Hamid;
Gutierrez, Juan (2017). [IEEE 2017 IEEE Symposium on Computers and Communications (ISCC)
- Heraklion, Greece (2017.7.3-2017.7.6)] 2017 IEEE Symposium on Computers and
Communications (ISCC) - A comprehensive investigation and comparison of Machine Learning
Techniques in the domain of heart disease., 204–207. doi:10.1109/ISCC.2017.8024530.
[24] Bharti R, Khamparia A, Shabaz M, Dhiman G, Pande S, Singh P. Prediction of Heart Disease
Using a Combination of Machine Learning and Deep Learning. Comput Intell Neurosci. 2021 Jul
1; 2021:8387680. doi 10.1155/2021/8387680. PMID: 34306056; PMCID: PMC8266441.
[25] Neeraj Baghel, Malay Kishore Dutta, Radim Burget, Automatic diagnosis of multiple cardiac
diseases from PCG signals using convolutional neural network, Computer Methods, and Programs
in Biomedicine, Volume 197, 2020, 105750, ISSN 0169-2607,
https://doi.org/10.1016/j.cmpb.2020.105750.
[26] Harshit Jindal et al 2021 IOP Conf. Ser.: Mater. Sci. Eng. 1022 012072
[27] Nandy, S., Adhikari, M., Balasubramanian, V. et al. An intelligent heart disease prediction system
based on a swarm-artificial neural network. Neural Comput &amp; Applic (2021).
https://doi.org/10.1007/s00521-021-06124-1
[28] Shah, D., Patel, S. &amp; Bharti, S.K. Heart Disease Prediction using Machine Learning</p>
      <p>Techniques. SN COMPUT. SCI. 1, 345 (2020). https://doi.org/10.1007/s42979-020-00365-y
[29] ABID ISHAQ, SAIMA SADIQ, MUHAMMAD UMER, SALEEM ULLAH, SEYEDALI
MIRJALILI, VAIBHAV RUPAPARA, AND MICHELE NAPPI, Improving the Prediction of
Heart Failure Patients’ Survival Using SMOTE and Effective Data Mining Techniques, IEEE
Access, VOLUME 9, 2021, Pages 39707 -39716.
[30] Saboji, R. G. (2017, August). A scalable solution for heart disease prediction using classification
mining technique. In 2017 International Conference on Energy, Communication, Data Analytics
and Soft Computing (ICECDS) (pp. 1780-1785). IEEE.
[31] Vaidya, N., Kandu, M., Yadav, R. K., &amp; Bharadwaj, N. (2020). The Working of Various Prediction
Techniques for Heart Diseases–A Case Study. IJRAR-International Journal of Research and
Analytical Reviews (IJRAR), 7(1), 447-453.
[32] Nikolaiev, S., &amp; Timoshenko, Y. (2015, October). Reinvention of the cardiovascular disease
prevention and prediction due to ubiquitous convergence of mobile apps and machine learning.</p>
      <p>In 2015 Information Technologies in Innovation Business Conference (ITIB) (pp. 23-26). IEEE.
[33] Owusu, E., Boakye-Sekyerehene, P., Appati, J. K., &amp; Ludu, J. Y. (2021). Computer-aided
diagnostics of heart disease risk prediction using boosting support vector machine. Computational
Intelligence and Neuroscience, 2021.
[34] Liu J, Dong X, Zhao H, Tian Y. Predictive Classifier for Cardiovascular Disease Based on</p>
      <p>Stacking Model Fusion. Processes. 2022; 10(4):749. https://doi.org/10.3390/pr10040749
[35] Bhalla, R. and Bagga, A., 2019. Opinion mining framework using proposed rb-bayes model for
text classification. International Journal of Electrical &amp; Computer Engineering (2088-8708), 9(1).
[36] Bhalla, R., 2020. A comparative analysis of application of proposed and the existing
methodologies on a mobile phone survey. In Futuristic Trends in Networks and Computing
Technologies: Second International Conference, FTNCT 2019, Chandigarh, India, November 22–
23, 2019, Revised Selected Papers 2 (pp. 107-115). Springer Singapore.
[37] Bhalla R, Sharma A, Gupta J. An Ensemble-based Machine Learning Model for Accurate
Predictions using Multiple Categorical Datasets. In2022 Second International Conference on
Interdisciplinary Cyber Physical Systems (ICPS) 2022 May 9 (pp. 1-4). IEEE.
[38] R. Bhalla, A. Sharma, and G. Ganesan, “A Bibliographic Survey of Sentiment Classification using
Hybrid Ensemble-based Machine Learning Approaches,” in ceur workshop proceedings, 2022,
vol. 3146, pp. 33–38, [Online]. Available: https://ceur-ws.org/Vol-3146/PAPER_04.pdf.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Oikonomou</surname>
            <given-names>E K</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Siddique</surname>
            <given-names>M</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Antoniades</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <article-title>Artificial Intelligence in Medical Imaging: a radiomic guide to precision phenotyping of Cardiovascular disease</article-title>
          ,
          <source>Cardiovascular Research</source>
          ,
          <volume>116</volume>
          (
          <issue>13</issue>
          ),
          <fpage>2040</fpage>
          -
          <lpage>2054</lpage>
          . https:// doi:10.1093/cvr/cvaa021.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ding</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Song</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            <given-names>Q.</given-names>
          </string-name>
          <year>2019</year>
          .
          <article-title>Computer-Aided Diagnosis and Clinical Trials of Cardiovascular Diseases Based on Artificial Intelligence Technologies for Risk-Early Warning Model</article-title>
          .
          <source>J Med Syst</source>
          <volume>43</volume>
          ,
          <fpage>228</fpage>
          . https://doi.org/10.1007/s10916-019-1346-x.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Manish</given-names>
            <surname>Sharma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            <surname>Rajendra Acharya</surname>
          </string-name>
          ,
          <year>2019</year>
          .
          <article-title>A New Method to Identify Coronary Artery Disease with ECG Signals and Time-Frequency Concentrated Antisymmetric Biorthogonal Wavelet Filter Bank, Pattern Recognition Letters</article-title>
          .
          <fpage>S0167</fpage>
          -
          <volume>8655</volume>
          (
          <issue>19</issue>
          )
          <fpage>30121</fpage>
          -
          <lpage>7</lpage>
          . https://doi.org/10.1016/j.patrec.
          <year>2019</year>
          .
          <volume>04</volume>
          .014.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Abdar</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ksiazek</surname>
            <given-names>W</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rajendra Acharya</surname>
            <given-names>U</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tan R S</surname>
            , Makarenkov
            <given-names>V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Plaviak</surname>
            <given-names>P.</given-names>
          </string-name>
          <year>2019</year>
          .
          <article-title>A New Machine Learning Technique for an Accurate Diagnosis of Coronary Artery Disease</article-title>
          , Computer Methods, and Programs in Biomedicine. S0169-
          <volume>2607</volume>
          (
          <issue>18</issue>
          )
          <fpage>31458</fpage>
          -
          <lpage>5</lpage>
          . https://doi.org/10.1016/j.cmpb.
          <year>2019</year>
          .
          <volume>104992</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Romiti</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vinciguerra</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saade</surname>
            <given-names>W</given-names>
          </string-name>
          et al.,
          <year>2020</year>
          .
          <article-title>"Artificial Intelligence (AI) and Cardiovascular Diseases: An Unexpected Alliance</article-title>
          ,”
          <source>Cardiology Research and Practice</source>
          , vol.
          <year>2020</year>
          ,
          <string-name>
            <surname>Article</surname>
            <given-names>ID</given-names>
          </string-name>
          4972346, 8 pages. https://doi.org/10.1155/
          <year>2020</year>
          /4972346
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Johnson</surname>
            ,
            <given-names>K. W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Torres</surname>
            <given-names>Soto</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Glicksberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. S.</given-names>
            ,
            <surname>Shameer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Miotto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Ali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            , ... &amp;
            <surname>Dudley</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. T.</surname>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Artificial intelligence in cardiology</article-title>
          .
          <source>Journal of the American College of Cardiology</source>
          ,
          <volume>71</volume>
          (
          <issue>23</issue>
          ),
          <fpage>2668</fpage>
          -
          <lpage>2679</lpage>
          . https://www.jacc.org/doi/abs/10.1016/j.jacc.
          <year>2018</year>
          .
          <volume>03</volume>
          .521.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Acharya</surname>
            <given-names>U R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sree</surname>
            <given-names>S V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krishnan M M R</surname>
            , Krishnananda
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ranjan</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Umesh</surname>
            <given-names>P</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Suri</surname>
            <given-names>J S</given-names>
          </string-name>
          , (
          <year>2013</year>
          ).
          <article-title>Automated classification of patients with coronary artery disease using grayscale features from left ventricle echocardiographic images</article-title>
          ,
          <source>Computer Methods and Programs in Biomedicine</source>
          , Volume
          <volume>112</volume>
          ,
          <string-name>
            <surname>Issue</surname>
            <given-names>3</given-names>
          </string-name>
          ,
          <year>2013</year>
          , Pages
          <fpage>624</fpage>
          -
          <lpage>632</lpage>
          . https://doi.org/10.1016/j.cmpb.
          <year>2013</year>
          .
          <volume>07</volume>
          .012.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Golovenkin</surname>
            <given-names>S E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dorrer</surname>
            <given-names>M G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nikulina</surname>
            <given-names>S Y</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Orlova</surname>
            <given-names>Y V</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Pelipeckaya</surname>
            <given-names>E Y</given-names>
          </string-name>
          , (
          <year>2020</year>
          ),
          <article-title>Evaluation of the effectiveness of using artificial intelligence to predict the response of the human body to cardiovascular diseases</article-title>
          .
          <source>Journal of Physics: Conference Series</source>
          , Volume
          <volume>1679</volume>
          . No.
          <volume>4</volume>
          , https://iopscience.iop.org/article/10.1088/
          <fpage>1742</fpage>
          -6596/1679/4/042017/meta.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Babaoglu</surname>
            <given-names>I</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Findik</surname>
            <given-names>O</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Bayrak</surname>
            <given-names>M</given-names>
          </string-name>
          , (
          <year>2010</year>
          ).
          <article-title>Effects of principle component analysis on assessment of coronary artery diseases using support vector machine</article-title>
          ,
          <source>Expert Systems with Applications</source>
          , Volume
          <volume>37</volume>
          ,
          <string-name>
            <surname>Issue</surname>
            <given-names>3</given-names>
          </string-name>
          ,
          <year>2010</year>
          , Pages
          <fpage>2182</fpage>
          -
          <lpage>2185</lpage>
          . https://doi.org/10.1016/j.eswa.
          <year>2009</year>
          .
          <volume>07</volume>
          .055.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>