<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>AutoML-Driven ECG Classification of Cardiac Pathologies with Explainable AI⋆</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dmytro Tymoshchuk</string-name>
          <email>Tymoshchuk@tntu.edu.ua</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andriy Sverstiuk</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vitaliy Tymoshchuk</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lyubomyr Mosiy</string-name>
          <email>lmosiy@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oksana Bahrii-Zaiats</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>I. Horbachevsky Ternopil National Medical University</institution>
          ,
          <addr-line>Maidan Voli St., 1, Ternopil, 46002</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Ternopil Ivan Puluj National Technical University</institution>
          ,
          <addr-line>Ruska str. 56, Ternopil, 46001</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2026</year>
      </pub-date>
      <abstract>
        <p>Cardiovascular diseases (CVD) remain one of the leading causes of mortality in the world, which emphasizes the need to create accurate, interpretable and effective diagnostic systems. Electrocardiography is a key non-invasive method that provides critical information about the functional state of the heart. In this work, an information technology for automated diagnosis of CVD based on the time rhythm function taking into account the extreme amplitudes of the characteristic electrocardiogram (ECG) waves (P, Q, R, S, T) was proposed. A dataset of 924 samples from the open PhysioNet databases was formed, covering four diagnostic categories (conditional norm, norm with a pacemaker, arrhythmias and morphological pathologies). Ten statistical descriptors (Mean, Median, Mode, Standard Deviation, Sample Variance, Kurtosis, Skewness, Range, Minimum, Maximum) were used to describe temporal variability. The EvalML AutoML framework was used to build the models, which automatically determined the optimal data processing. The Extra Trees Classifier algorithm turned out to be the best, achieving an average classification accuracy of about 96.5% for four classes and an AUC of more than 0.92, which confirms its generalization capabilities. To ensure the transparency of the results, the SHAP method was used, which showed that the most significant features are Skewness and Kurtosis. The integration of AutoML and Explainable AI methods provided high accuracy and reliability of diagnostics while maintaining interpretability, which makes the proposed approach promising for clinical application and analysis of other biomedical signals.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;ECG signal</kwd>
        <kwd>machine learning</kwd>
        <kwd>EvalML</kwd>
        <kwd>AutoML</kwd>
        <kwd>Explainable AI</kwd>
        <kwd>SHAP 1</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>In recent decades, cardiovascular diseases (CVD) have become one of the most serious challenges
for the global healthcare system. According to the World Health Organization, they account for
almost a third of all deaths in the world. Such statistics indicate the extreme relevance of research
in the field of cardiology and the search for effective methods of early diagnosis, monitoring and
prevention. Timely detection of pathological changes in cardiac activity allows significantly
reducing the risk of complications and saving the lives of millions of patients.</p>
      <p>One of the key tools in this area is electrocardiography — a non-invasive method that provides a
safe and relatively simple registration of the electrical activity of the heart. The ECG signal
contains multilayered information about the functional state of the myocardium, which is reflected
in the shape, amplitude and time parameters of the characteristic P, Q, R, S, T waves, as well as the
intervals and segments between them. It is the analysis of the morphology of these components
that allows us to detect a wide range of pathologies — from rhythm and conduction disorders to
signs of ischemic disease or structural changes in the heart muscle. Due to this, the ECG remains</p>
      <p>0000-0003-0246-2236 (D. Tymoshchuk); 0000-0001-8644-0776 (A. Sverstiuk); 0009-0007-2858-9434 (V. Tymoshchuk);
0009-0000-9778-331X (L. Mosiy); 0000-0002-5533-3561 (O. Bahrii-Zaiats)
an indispensable method in modern clinical practice, serving as the basis for both routine medical
examinations and specialized scientific research.</p>
      <p>Despite significant progress in digital processing of ECG signal, there remains a gap between
the complexity of the temporal organization of the heart rhythm and the methods for its analysis.
Existing approaches often ignore interwave temporal variability or consider it in isolation from
amplitude characteristics. This limits the ability to detect complex arrhythmias, early signs of
ischemia, or conduction disorders, which may manifest themselves precisely in changes in the
temporal structure of individual ECG signal components.</p>
      <p>
        Therefore, there is a need to use approaches that can overcome the limitations of classical
methods. In this context, modern machine learning (ML) methods are of particular importance, as
they are a universal tool for working with large data sets, providing the ability to find hidden
dependencies where traditional approaches are powerless. ML is actively used in the financial
sector to predict risks and optimize investment strategies [
        <xref ref-type="bibr" rid="ref1 ref2">1,2</xref>
        ], in transport to develop intelligent
control systems and road safety [
        <xref ref-type="bibr" rid="ref3 ref4">3,4</xref>
        ], in medicine to create intelligent decision-making support
systems and predict the course of diseases [
        <xref ref-type="bibr" rid="ref5 ref6">5,6</xref>
        ], in materials science to model the structural
properties and durability of materials [
        <xref ref-type="bibr" rid="ref7 ref8">7,8</xref>
        ], in energy to assess consumption and improve resource
efficiency [
        <xref ref-type="bibr" rid="ref10 ref9">9,10</xref>
        ], as well as in cybersecurity to detect anomalies in network traffic and prevent
attacks [
        <xref ref-type="bibr" rid="ref11 ref12">11,12</xref>
        ]. Due to their ability to adapt to different types of data and conditions, ML methods
have become the basis for creating systems that provide accurate predictions, high speed of
analysis, and scalability in various application contexts.
      </p>
      <p>Recent advances in ML for ECG signal analysis demonstrate significant progress in automated
diagnosis of cardiovascular diseases. Based on a comprehensive review of publications in the
Scopus database, the following analysis presents advances in ML methods for the analysis of heart
rate variability and cardiac signals.</p>
      <p>
        Classical ML algorithms remain powerful tools for ECG signal classification. A hybrid approach
that combines Dual-Tree Complex Wavelet Transform (DTCWT) with ML classifiers to detect six
classes of cardiac arrhythmias was proposed [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Such methodology, tested on the full 48-record
MIT-BIH Arrhythmia Database, demonstrates the effectiveness of combining advanced feature
extraction methods with traditional classifiers. A comprehensive approach to ECG classification
using wavelet scattering to extract low-variability features was developed [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Using the MRMR
(Minimum Redundancy and Maximum Relevance) algorithm for feature selection, it was achieved
an outstanding accuracy of 99.84% using a cubic SVM model among twenty tested ML models.
Optimized ensemble methods for ECG anomaly detection were investigated, and the optimized
XGBoost model using Bayesian hyperparameter optimization achieved 100% accuracy, significantly
outperforming modified gradient boosting with an accuracy of 96.58% and SVM with 91.69% [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
      </p>
      <p>
        The revolutionary impact of deep learning on cardiac signal analysis is evident through
numerous innovative architectures. The study [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] presented an advanced approach to ECG
classification by integrating adaptive segmentation of heart beats with relative heart rate
information in a deep learning network. This methodology achieved a sensitivity of 99.81% for
normal beats, 99.08% for premature ventricular beats, and 97.83% for premature atrial beats. A
hybrid model combining a convolutional neural network (CNN) with a Vision Transformer (ViT)
was proposed for analyzing 12-lead ECG recordings [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. The model achieved an average accuracy
of 74% for five-class and 80% for four-class classification on the PTB-XL dataset, demonstrating the
potential of transformer architectures in cardiac diagnostics. A CNN model for automatic diagnosis
of multiple heart diseases from phonocardiographic signals was developed [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. Using data
augmentation techniques to improve robustness in noisy environments, the model achieved 98.60%
accuracy on the test set, highlighting the effectiveness of deep learning for non-invasive diagnosis.
The integration of deep learning paradigms has fundamentally transformed the capabilities of
cardiac signal analysis. The application of WaveGRU-Net for non-contact ECG reconstruction
using millimeter-wave technology with multiple inputs and outputs was pioneered [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. This
innovative approach successfully distinguishes respiratory and cardiac components in the
timefrequency domain while maintaining robust semantic representation capabilities. A significant
architectural advance has been made with the development of a frequency-guided hierarchical
shifted window (FG-HSWIN) transformer incorporating inter-frequency attention mechanisms
[
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. This architecture demonstrates exceptional classification performance, achieving 98.72%
accuracy on the MIT-BIH arrhythmia dataset through frequency-stratified window attention
mechanisms. The integration of frequency-aware positional encoding (FAPE) and lightweight
multiscale feature fusion (LMFF) represents a significant methodological contribution to the field.
The neuro-fuzzy paradigm exemplifies the trend towards hybrid architectures, where a multimodal
feature fusion framework combining transformer-based processing with neuro-fuzzy systems
achieves 98.46% accuracy and 99.1% F1-score, demonstrating the effectiveness of integrated
computational approaches [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. A sophisticated three-phase framework incorporating change point
detection via autoencoder architectures, facilitating real-time processing of sequential data for
cardiac anomaly detection, has been presented [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ].
      </p>
      <p>
        Combining different cardiac signal modalities improves diagnostic accuracy. A cardiovascular
disease prediction system integrating ECG and phonocardiogram using Hidden Semi-Markov
Model was developed [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. The proposed HSMM achieved sensitivity of 0.952, specificity of 0.92,
Fscore of 0.94, accuracy of 0.91, and AUC of 0.96. A multimodal approach to emotion recognition by
combining ECG signals with facial features was presented [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]. Using LightGBM, Bagged Decision
Trees, Linear SVM, and Gaussian Naive Bayes, the combined model achieved an accuracy of
93.80%, demonstrating the effectiveness of multimodal analysis. Researchers in [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] demonstrated
the effectiveness of hybrid approaches by integrating a dual tree complex wavelet transform with
traditional ML classifiers. Their methodology successfully distinguishes six different classes of ECG
beats, highlighting the continued relevance of traditional signal processing methods combined with
modern classification algorithms. The ResNet-34 architecture proposed in [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ], improved through
transfer learning mechanisms, further confirms the effectiveness of deep residual networks in
analyzing cardiac signals in the time-frequency domain. The integration of multiple sensory
modalities improves diagnostic complexity. Scientists [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] developed a device with the support of
fog computing that integrates phonocardiography, ballistocardiography and seismocardiography to
estimate systolic blood pressure, achieving a mean absolute error of 3.5 mm Hg. This multimodal
approach uses additional information from mechanical and electrical cardiac signals. The
MDD2DG-IRA methodology presented by the researchers in [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ] is an example of sophisticated
multi-channel processing via dynamic graph convolution, achieving 99.94% classification accuracy
for myocardial infarction localization. This approach demonstrates the potential of graph-based
representations to capture inter-channel dependencies in multi-lead ECG analysis.
      </p>
      <p>
        Computational efficiency remains paramount for real-time and embedded applications. This
challenge was addressed by analyzing the reconstructed phase space with optimized delay state
networks, achieving 99.3% accuracy while reducing hardware requirements by an order of
magnitude [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ]. An ensemble compression technique combining CEEMD with LSTM autoencoder
architectures achieves compression ratios of 38.26 with minimal signal distortion (PRD=0.37) [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ].
The development of AutoML greatly simplifies the process of developing models for medical
diagnostics. The prospects of automated ML in biomedical signal processing, namely automation
for practical implementation in clinical practice, are relevant [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ]. A study demonstrated improved
ECG-based stress classification using optimization techniques [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ]. By applying genetic algorithms,
artificial bee colony and improved particle swarm optimization to tune Multi-kernel SVM, the
authors achieved an average accuracy of 98.93%, precision of 96.83%, completeness of 96.83% and
specificity of 96.72%.
      </p>
      <p>The aim of our study is to develop and experimentally test information technology for
automated diagnosis of cardiovascular pathologies based on the time function of the rhythm taking
into account the extreme amplitude values of the characteristic waves of the ECG signal, using
AutoML and Explainable AI methods to increase the accuracy and reliability of the diagnosis of
cardiovascular diseases.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Methods and Models</title>
      <sec id="sec-2-1">
        <title>2.1. Feature Extraction and Dataset Construction</title>
        <p>
          To construct datasets suitable for ML algorithms, the ECG signals were processed using a time
rhythm function taking into account extreme values of the amplitude of the characteristic waves of
the ECG signal, which include extreme values of the amplitude of the characteristic waves (P, Q, R,
S, and T) [
          <xref ref-type="bibr" rid="ref33">33</xref>
          ].
        </p>
        <p>
          The discrete mathematical model of the time rhythm function taking into account the extreme
values of the amplitude of the characteristic waves of the ECG signal [
          <xref ref-type="bibr" rid="ref33">33</xref>
          ] is represented by the
function T Ak ( m ), which takes into account the extreme values of the amplitude of the
characteristic waves of the ECG signal (P, Q, R, S and T):
        </p>
        <p>T Ak ( m) = t Ak ( m) - t Ak ( m - 1) , k∈ {P , Q , R , S , T }, m∈ Z (1)
where t Ak ( m ) – the moment of reaching the peak of the k-type wave in the m-th cardiac cycle
(с), t Ak ( m - 1 ) – time of peak k-wave in the previous cardiac cycle (m-1) (с), T Ak ( m ) – the value of
the time rhythm function taking into account the extreme values of the amplitude of the
characteristic waves of the ECG signal, reflecting the time interval between the peaks of the k-type
waves in the current m and the previous cardiac cycle (m-1), k∈ {P , Q , R , S , T } – type of
characteristic wave, m∈ Z – cycle number.</p>
        <p>
          The data source was publicly available databases from the PhysioNet repository. The primary
dataset [
          <xref ref-type="bibr" rid="ref34">34</xref>
          ] contained 12-lead electrocardiographic recordings from 45 152 subjects, digitized at a
sampling rate of 500 Hz. This comprehensive collection covers a variety of cardiac arrhythmias and
cardiovascular pathologies, with expert-verified annotations to ensure diagnostic accuracy.
Additionally, the study included data [
          <xref ref-type="bibr" rid="ref35">35</xref>
          ] obtained from extended electrocardiographic monitoring
of 15 individuals diagnosed with progressive congestive heart failure, including 11 men (age range:
22–71 years) and 4 women (age range: 54–63 years). The third dataset [
          <xref ref-type="bibr" rid="ref36">36</xref>
          ] provided over 100
twolead electrocardiographic recordings, each covering 15-minute intervals. These recordings
contained detailed morphological annotations delineating the temporal boundaries of cardiac
waveform components — specifically the onset, peak, and termination of P waves, QRS complexes,
T waves, and, when applicable, U waves — for a representative sample of 30 to 50 cardiac cycles per
recording.
        </p>
        <p>For each ECG signal recording, the values of the time rhythm function were determined taking
into account the extreme values of the amplitude of the characteristic ECG signal waves. Examples
of the time rhythm function taking into account the extreme values of the R wave amplitude are
shown in Figure 1.</p>
        <p>(a)
(b)</p>
        <p>For quantitative description of the function T Ak ( m ), a statistical processing method is used,
which allows calculating the following statistical parameters:
• Mean (arithmetic mean) is a measure of the central tendency of a distribution of temporal
variability.
• Median is a robust measure of central tendency that is robust to outliers.
• Mode is the most frequently occurring value of temporal variability.
• Standard Deviation is a measure of dispersion about the mean.
• Sample Variance is the square of the standard deviation, which reflects variability.
• Kurtosis is a measure of the skewness of the distribution.
• Skewness is a measure of the asymmetry of the distribution.
• Range is the difference between the maximum and minimum values.
• Minimum is the smallest value of the time variability.
• Maximum is the largest value of the time variability.</p>
        <p>The compiled dataset contained 924 samples. The following conventions were introduced into
the dataset: Mean - Mean_t, Median - Med_t, Mode - Mo_t, Standard Deviation-StD_t, Sample
Variance-SV_t, Kurtosis-Kur_t, Skewness-Sk_t, Range-Ra_t, Minimum-Min_t, Maximum-Max_t.</p>
        <p>The classification was carried out according to four diagnostic categories. The first class
represented patients without detected pathologies (conditional norm). The second class included
individuals with normal cardiac function, but with installed pacemakers. The third class combined
various forms of cardiac rhythm disorders. The fourth class covered pathologies associated with
structural changes in the heart muscle and conduction system, manifested as stable changes in the
morphology of ECG complexes. The last category also included the consequences of a previous
myocardial infarction with the formation of scar tissue, violations of intraventricular conduction of
varying degrees, an increase in the mass of the myocardium of individual heart departments and
various forms of cardiomyopathies.</p>
        <p>The development environment for building and testing ML models was Python, which provided
the necessary tools for data preprocessing, sample partitioning, and training ML algorithms. To
train and test the model, the original dataset was divided into training and test sets. The size of the
test set was 20% of the total data, while the remaining 80% was used to train the model. To ensure
reproducibility of the results, the initial value of the random number generator was fixed
(random_state = 22). Since the problem is multi-class, the stratify=y parameter was used, which
guaranteed the preservation of the initial class ratio in both the training and test sets. This
approach avoided biases in the class distribution and ensured the representativeness of the model
evaluation.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Machine Learning Methods</title>
        <p>Automated machine learning (AutoML) is a concept aimed at eliminating the need for manual
execution of routine tasks that accompany model building. Traditionally, the process of creating a
model includes the stages of data preprocessing, feature selection, algorithm selection,
hyperparameter optimization and validation of the obtained results. Performing these tasks
requires significant experience and time, which limits the widespread use of ML. AutoML offers an
approach that automatically combines data processing, model selection and optimization methods,
reducing the influence of the human factor and ensuring the stability of the obtained results.
Thanks to the use of optimization algorithms, the search for parameters is more efficient than in
classical brute force, and integrated evaluation mechanisms guarantee the objectivity and
reproducibility of the models.</p>
        <p>
          One of the modern implementations of AutoML is the EvalML library [
          <xref ref-type="bibr" rid="ref37">37</xref>
          ]. It is available in
Python and provides a full cycle of automation, starting from data preprocessing and ending with
obtaining a ready-made pipeline for practical application. EvalML automatically generates a
sequence of operations, which includes missing value imputation, coding of categorical variables,
scaling and selection of the optimal classification or regression algorithm. Using the AutoMLSearch
module, the model and parameter space is explored, and the results are presented in the form of a
ranked list ordered by the selected quality metric. The system allows to extend the pipeline with
custom models and transformers, which makes it suitable for both scientific research and industrial
applications. Thanks to compatibility with the Woodwork library and support for exporting
pipelines, EvalML can be easily integrated into a production environment. EvalML embodies the
principles of AutoML in a practical tool that allows you to quickly obtain reproducible and
competitive models. This makes it particularly useful in tasks that require testing a large number of
combinations of algorithms and data preprocessing procedures.
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Model Evaluation and Interpretation</title>
        <p>
          The evaluation of the effectiveness of classification models is carried out on the basis of indicators
calculated from the confusion matrix. It reflects the ratio between the correct and incorrect
predictions of the model for individual samples and consists of four main components. True
Positives (TP) corresponds to the number of samples of the positive class correctly classified as
positive. True Negatives (TN) characterizes the number of samples of the negative class correctly
classified as negative. False Positives (FP) means the number of samples of the negative class that
the model mistakenly classified as positive. False Negatives (FN) reflects the number of samples of
the positive class that were incorrectly classified as negative. Based on these values, the metrics
Accuracy, Recall, Specificity, Precision, F1-score and G-Mean are calculated (Table 1) [
          <xref ref-type="bibr" rid="ref38">38</xref>
          ].
√ Recall × Specificity
        </p>
        <sec id="sec-2-3-1">
          <title>Interpretation</title>
        </sec>
        <sec id="sec-2-3-2">
          <title>Proportion of correctly classified samples among all observations.</title>
        </sec>
        <sec id="sec-2-3-3">
          <title>Ability of the model to correctly identify positive samples.</title>
        </sec>
        <sec id="sec-2-3-4">
          <title>Ability of the model to correctly identify negative samples.</title>
        </sec>
        <sec id="sec-2-3-5">
          <title>Proportion of correctly classified positive samples among all predicted positives.</title>
        </sec>
        <sec id="sec-2-3-6">
          <title>Harmonic mean of Precision and Recall, reflecting their balance.</title>
        </sec>
        <sec id="sec-2-3-7">
          <title>Geometric mean of Recall</title>
          <p>and Specificity, used to
assess classification balance.</p>
          <p>Taken together, these metrics provide a holistic view of the performance of a classification
model, allowing to evaluate not only the overall accuracy, but also the recognition efficiency of
each class and the balance of the classification.</p>
          <p>
            Interpreting ML results is an important part of modern research, as it allows not only to assess
the quality of the prediction, but also to understand the contribution of each feature to the model’s
decision-making. One of the most common approaches is the SHapley Additive exPlanations
(SHAP) method, based on the concept of Shapley values from cooperative game theory [
            <xref ref-type="bibr" rid="ref39">39</xref>
            ]. In the
classical formulation, Shapley values describe the contribution of each player to the overall win of
the coalition. In the context of ML, the “players” are the features, the “coalition” is their set, and the
“win” is the model’s prediction. The idea is to fairly distribute the predicted outcome among all
features depending on their contribution. The Shapley value for feature  is calculated as the
average marginal contribution of this feature to all possible subsets of features.
          </p>
          <p>SHAP has a number of desirable properties: local accuracy (the explanation corresponds to a
specific prediction), additivity (the contributions of features are summed to the prediction), fairness
(features with the same influence have the same values), which makes the method universal for
interpreting different types of models. Due to this, SHAP provides not only a global interpretation,
when the average influence of features in the entire sample is analyzed, but also a local one, which
allows explaining an individual decision for a specific sample. This opens up the possibility of
simultaneously evaluating generalized patterns and controlling the behavior of the model on
individual cases.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Results and Discussion</title>
      <p>The work of the AutoML algorithm, implemented in the EvalML library, made it possible to
automatically select the optimal data processing pipeline and classification algorithm for the task.
The AutoMLSearch function was used for training. Accuracy multiclass was chosen as the target
function, which served as the main optimization criterion. The maximum number of model search
iterations was set at 25, which allowed, on the one hand, to save computational resources, and on
the other hand, to ensure sufficient coverage of the space of possible models. The parameter
n_jobs=-1 ensured the use of all available processor cores, increasing the efficiency of calculations.
The initial value of the random number generator (random_seed=22) guaranteed the
reproducibility of the experiment.</p>
      <p>In the AutoML search process, K-fold stratified cross-validation was applied to the training
subsample. The number of folds corresponded to the default parameters of the framework (5
stratified splits). At each fold, all stages of data preprocessing and the selected classifier were
trained on the training part, after which their effectiveness was checked on the validation part. For
each pipeline, the target metric (accuracy multiclass) was calculated and averaged over the results
of all folds, forming the mean_cv_score indicator. It was this average CV score that determined the
model's place in the ranking (Figure 2).</p>
      <p>
        The best model was selected according to the performance values and used for further analysis.
The optimal pipeline generated by EvalML consisted of four sequential steps of preprocessing and
classification. The first stage used the Label Encoder, which provided encoding of categorical
variables into a numerical format required for further processing. The next component was the
Imputer, which performed validation and, if necessary, filled in missing values. For numerical
features, the median filling strategy was used. The third step was the Select Columns Transformer,
which selected a subset of the most relevant features for modeling. The final stage was the Extra
Trees Classifier, which belongs to the family of ensemble methods based on random trees [
        <xref ref-type="bibr" rid="ref40">40</xref>
        ].
EvalML automatically selected the hyperparameters: the number of trees n_estimators=997, the
maximum depth max_depth=10, the feature number selection strategy max_features=log2, as well
as the default settings for the partitioning parameters and node weights.
      </p>
      <p>Figure 3 presents the evaluation results of the Extra Trees Classifier model, including the
normalized confusion matrix (%) and multi-class ROC curves.</p>
      <p>The confusion matrix shows that the model provides a high level of correct classifications,
ranging from 89.13% to 95.74%. The lowest rate is observed for the third class, where 8.70% of
examples were incorrectly assigned to the first class, while for other classes the proportion of false
predictions does not exceed 4.35%. On average, the classification accuracy is about 96.5%, which
indicates a balanced quality of recognition between classes. The ROC curves reflect the ratio
between the frequency of true positive and false positive classifications when changing the model
threshold. High values of the area under the curve (AUC &gt; 0.92) were recorded for all four classes.
The curves are located significantly above the random guess line, which confirms the stable ability
of the model to distinguish positive and negative examples.</p>
      <p>In order to get a more complete picture of the model's performance, the model performance
indicators were calculated. The obtained values are given in Table 2.</p>
      <p>The set of obtained values of these metrics confirmed the high efficiency of the constructed
model, which not only demonstrated high overall accuracy, but also maintained the proper balance
between positive class detection, correct negative class recognition, and prediction reliability.</p>
      <p>To interpret the model's performance, the SHAP method was used in the KernelExplainer
variant for the multi-class classification task. The choice of this particular method is explained by
the fact that the resulting pipeline can contain various preprocessing steps and classifiers, not
limited to tree algorithms. KernelExplainer, being model-agnostic, ensures the correctness and
stability of the obtained explanations in such conditions. To construct the explanations, a
background distribution was formed based on the training subsample. Using the training data as
the background prevented information leakage from the test set, preserving the purity of the
evaluation. Next, a subset of test examples was formed for analysis, limited to 50 samples. This
provided a balance between reducing computational complexity and preserving the statistical
representativeness of the test data. The resulting subset was used to calculate local and global
SHAP values, allowing us to estimate the contribution of each feature to the class probability
prediction.</p>
      <p>Figure 4 presents a global SHAP bar plot that displays the mean absolute SHAP values for all
features, averaged across classes.</p>
      <p>This visualization allows us to assess the relative contribution of each feature to the
decisionmaking process of the model. The most influential feature was the Skewness indicator (Sk_t),
which indicates its greatest contribution to prediction and a decisive role in class differentiation. In
second place in importance is the Kurtosis (Kur_t), which also plays a significant role in
decisionmaking. The next most important are Range (Ra_t) and Maximum (Max_t), which demonstrate a
moderate impact on the classification results. The features Mean (Mean_t), Median (Med_t), Mode
(Mo_t), Standard Deviation (StD_t) and Minimum (Min_t) have close values of the average absolute
impact, which indicates their additional role in the prediction process. The value of Sample
Variance (SV_t) practically did not demonstrate a noticeable contribution to the model's work.</p>
      <p>Figure 5 shows a force plot for sample #35, which reflects the local explanation of the model's
prediction when assigning the sample to class 4.</p>
      <p>The horizontal scale shows the deviation of the forecast from the base value, which was 0.1676.
The final value for class 4 is 0.94, which corresponds to the high confidence of the model in the
correct classification of this sample. The visualization shows how individual features affected the
forecast bias. According to the explanation given, for this sample the most significant predictors
were the Skewness (Sk_t) and Kurtosis (Kur_t), which is consistent with the global results of the
SHAP analysis, where these features also took leading positions. Thus, the SHAP analysis confirms
that the model forecast for sample #35 is well-founded. Key features (Sk_t, Kur_t, Max_t, Mean_t)
provided a confident assignment to class 4, while their role in the forecasts for other classes was
minimal or even negative (Table 3).</p>
      <p>Thus, the SHAP analysis not only confirmed the accuracy of the results obtained, but also
ensured the transparency of the model's operation, which is an important factor for its scientific
substantiation and practical application.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>The paper proposed and implemented an approach to diagnosing cardiovascular diseases based on
the time function of the rhythm, taking into account extreme amplitude values of characteristic
ECG waves. The use of the AutoML library EvalML allowed to form an optimal preprocessing and
classification pipeline, where the Extra Trees Classifier algorithm showed the best results with an
average accuracy of about 96.5% and high AUC indicators (&gt;0.92) for all classes.</p>
      <p>Interpretation of the model using SHAP confirmed the transparency of the predictions and
identified the key features (Skewness, Kurtosis, Range, Maximum) that most influenced the
classification results. The obtained results indicate that the combination of AutoML and
Explainable AI provides high efficiency and reliability in the analysis of ECG signals, opening up
prospects for the practical implementation of such systems in clinical diagnostics and their
adaptation for other biomedical analytics tasks.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>The authors would like to thank the ERASMUS-EDU-2025-CBHE project “Digital Transformation
and Curriculum Development for Healthcare Teams” (Digi-CHange - 101233888 – GAP-101233888)
for the idea, inspiration and funds that made this work possible.</p>
    </sec>
    <sec id="sec-6">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the authors used Grammarly in order to grammar and spell
check, and improve the text readability. After using the tool, the authors reviewed and edited the
content as needed to take full responsibility for the publication’s content.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Ahmed</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. M. Alshater</surname>
            ,
            <given-names>A. E.</given-names>
          </string-name>
          <string-name>
            <surname>Ammari</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Hammami</surname>
          </string-name>
          ,
          <article-title>Artificial intelligence and machine learning in finance: A bibliometric review</article-title>
          ,
          <source>Res. Int. Bus. Financ</source>
          .
          <volume>61</volume>
          (
          <year>2022</year>
          )
          <article-title>101646</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.ribaf.
          <year>2022</year>
          .
          <volume>101646</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>P.-F.</given-names>
            <surname>Tsai</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.-H. Gao</surname>
            ,
            <given-names>S.-M.</given-names>
          </string-name>
          <string-name>
            <surname>Yuan</surname>
          </string-name>
          ,
          <source>Stock Selection Using Machine Learning Based on Financial Ratios, Mathematics</source>
          <volume>11</volume>
          .
          <volume>23</volume>
          (
          <year>2023</year>
          )
          <article-title>4758</article-title>
          . doi:
          <volume>10</volume>
          .3390/math11234758.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>X.</given-names>
            <surname>Tao</surname>
          </string-name>
          , L. Cheng, R. Zhang,
          <string-name>
            <given-names>W. K.</given-names>
            <surname>Chan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Chao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Qin</surname>
          </string-name>
          ,
          <article-title>Towards Green Innovation in Smart Cities: Leveraging Traffic Flow Prediction with Machine Learning Algorithms for Sustainable Transportation Systems</article-title>
          ,
          <source>Sustainability 16.1</source>
          (
          <year>2023</year>
          )
          <article-title>251</article-title>
          . doi:
          <volume>10</volume>
          .3390/su16010251.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>T.</given-names>
            <surname>Yuan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. Rocha</given-names>
            <surname>Neto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. E.</given-names>
            <surname>Rothenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Obraczka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Barakat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Turletti</surname>
          </string-name>
          ,
          <article-title>Machine learning for next‐generation intelligent transportation systems: A survey</article-title>
          ,
          <source>Trans. Emerg. Telecommun. Technol. 33.4</source>
          (
          <year>2021</year>
          ). doi:
          <volume>10</volume>
          .1002/ett.4427.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S. O.</given-names>
            <surname>Nykytyuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. S.</given-names>
            <surname>Sverstiuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. I.</given-names>
            <surname>Klymnyuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. S.</given-names>
            <surname>Pyvovarchuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y. B.</given-names>
            <surname>Palaniza</surname>
          </string-name>
          ,
          <article-title>Approach to prediction and receiver operating characteristic analysis of a regression model for assessing the severity of the course Lyme borreliosis in children</article-title>
          ,
          <source>Rheumatology</source>
          <volume>61</volume>
          .5 (
          <year>2023</year>
          )
          <fpage>345</fpage>
          -
          <lpage>352</lpage>
          . doi:
          <volume>10</volume>
          .5114/reum/173115.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Herasymiuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sverstiuk</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Kit</surname>
          </string-name>
          ,
          <article-title>Multifactor regression model for prediction of chronic rhinosinusitis recurrence</article-title>
          ,
          <source>Wiadomosci Lek. 76.5</source>
          (
          <year>2023</year>
          )
          <fpage>928</fpage>
          -
          <lpage>935</lpage>
          . doi:
          <volume>10</volume>
          .36740/wlek202305106.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>O.</given-names>
            <surname>Yasniy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Didych</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Tymoshchuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Maruschak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Demchyk</surname>
          </string-name>
          ,
          <article-title>Prediction of structural elements lifetime of titanium alloy using neural network, Procedia Struct</article-title>
          . Integr.
          <volume>72</volume>
          (
          <year>2025</year>
          )
          <fpage>181</fpage>
          -
          <lpage>187</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.prostr.
          <year>2025</year>
          .
          <volume>08</volume>
          .090.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>O.</given-names>
            <surname>Yasniy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Tymoshchuk</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Didych</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Zolotyi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Tymoshchuk</surname>
          </string-name>
          ,
          <article-title>Modeling of shape memory alloys hysteresis behavior considering the loading cycle frequency</article-title>
          ,
          <source>Procedia Struct. Integr</source>
          .
          <volume>72</volume>
          (
          <year>2025</year>
          )
          <fpage>188</fpage>
          -
          <lpage>194</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.prostr.
          <year>2025</year>
          .
          <volume>08</volume>
          .091.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>E.</given-names>
            <surname>Aguilar Madrid</surname>
          </string-name>
          , N. Antonio, Short-Term
          <source>Electricity Load Forecasting with Machine Learning, Information 12.2</source>
          (
          <year>2021</year>
          )
          <article-title>50</article-title>
          . doi:
          <volume>10</volume>
          .3390/info12020050.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>W.</given-names>
            <surname>Pannakkong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. T.</given-names>
            <surname>Vinh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. N. M.</given-names>
            <surname>Tuyen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Buddhakulsomsiri</surname>
          </string-name>
          ,
          <article-title>A Reinforcement Learning Approach for Ensemble Machine Learning Models in Peak Electricity Forecasting</article-title>
          ,
          <source>Energies</source>
          <volume>16</volume>
          .13 (
          <year>2023</year>
          )
          <article-title>5099</article-title>
          . doi:
          <volume>10</volume>
          .3390/en16135099.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Tymoshchuk</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yasniy</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mytnyk</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zagorodna</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tymoshchuk</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <article-title>Detection and classification of DDoS flooding attacks by machine learning method</article-title>
          .
          <source>CEUR Workshop Proceedings</source>
          ,
          <year>2024</year>
          ,
          <volume>3842</volume>
          , pp.
          <fpage>184</fpage>
          -
          <lpage>195</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>B.</given-names>
            <surname>Lypa</surname>
          </string-name>
          , I. Horyn,
          <string-name>
            <given-names>N.</given-names>
            <surname>Zagorodna</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Tymoshchuk</surname>
          </string-name>
          , T. Lechachenko,
          <article-title>Comparison of feature extraction tools for network traffic data</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          ,
          <volume>3896</volume>
          ,
          <year>2024</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>11</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>H. K. P. Katamreddi</surname>
            ,
            <given-names>T. K.</given-names>
          </string-name>
          <string-name>
            <surname>Battula</surname>
          </string-name>
          ,
          <article-title>A hybrid approach for machine learning based beat classification of ECG using different digital differentiators</article-title>
          and DTCWT,
          <source>Comput. Biol. Med</source>
          .
          <volume>194</volume>
          (
          <year>2025</year>
          )
          <article-title>110426</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.compbiomed.
          <year>2025</year>
          .
          <volume>110426</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>K.J.K.</given-names>
            ,
            <surname>Sree Janani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Janani Kuralnatham; R.S</surname>
          </string-name>
          .,
          <string-name>
            <surname>Sabeenian</surname>
            ,
            <given-names>R. S.,</given-names>
          </string-name>
          <article-title>Machine Learning-based ECG Classification using Wavelet Scattered Features</article-title>
          ,
          <source>AIUB J. Sci. Eng</source>
          .
          <source>(AJSE) 23.2</source>
          (
          <year>2024</year>
          )
          <fpage>168</fpage>
          -
          <lpage>176</lpage>
          . doi:
          <volume>10</volume>
          .53799/ajse.v23i2.
          <fpage>821</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>A.</given-names>
            <surname>Mohapatra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Dastidar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. K.</given-names>
            <surname>Mohapatra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. N.</given-names>
            <surname>Mohanty</surname>
          </string-name>
          ,
          <article-title>Abnormal ECG Detection using Optimized Boosting Tree Classifier</article-title>
          , in: 2022
          <source>OITS International Conference on Information Technology (OCIT)</source>
          , IEEE,
          <year>2022</year>
          . doi:
          <volume>10</volume>
          .1109/ocit56763.
          <year>2022</year>
          .
          <volume>00012</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>J.</given-names>
            <surname>Lim</surname>
          </string-name>
          , D. Han,
          <string-name>
            <given-names>M.</given-names>
            <surname>Pirayesh Shirazi Nejad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. H.</given-names>
            <surname>Chon</surname>
          </string-name>
          ,
          <article-title>ECG classification via integration of adaptive beat segmentation and relative heart rate with deep learning networks</article-title>
          ,
          <source>Comput. Biol. Med</source>
          .
          <volume>181</volume>
          (
          <year>2024</year>
          )
          <article-title>109062</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.compbiomed.
          <year>2024</year>
          .
          <volume>109062</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>D. E. P.</given-names>
            <surname>Moghaddam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Razavi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Aazhang</surname>
          </string-name>
          ,
          <article-title>Cardiac Condition Classification Using 12-Lead ECG Recordings</article-title>
          ,
          <source>in: 2024 58th Asilomar Conference on Signals, Systems, and Computers</source>
          , IEEE,
          <year>2024</year>
          , pp.
          <fpage>1349</fpage>
          -
          <lpage>1353</lpage>
          . doi:
          <volume>10</volume>
          .1109/ieeeconf60004.
          <year>2024</year>
          .
          <volume>10942637</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>N.</given-names>
            <surname>Baghel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. K.</given-names>
            <surname>Dutta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Burget</surname>
          </string-name>
          ,
          <article-title>Automatic diagnosis of multiple cardiac diseases from PCG signals using convolutional neural network</article-title>
          ,
          <source>Comput. Biomed</source>
          .
          <volume>197</volume>
          (
          <year>2020</year>
          )
          <article-title>105750</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.cmpb.
          <year>2020</year>
          .
          <volume>105750</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>D.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Hu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Xing</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Gini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Greco</surname>
          </string-name>
          , WaveGRU-Net:
          <article-title>Robust Non-contact ECG Reconstruction via MIMO Millimeter-Wave Radar and Multi-Scale Semantic Analysis, Signal Process</article-title>
          . (
          <year>2025</year>
          )
          <article-title>110108</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.sigpro.
          <year>2025</year>
          .
          <volume>110108</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>S.</given-names>
            <surname>Lamba</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Diwakar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A Novel</given-names>
            <surname>Frequency-Stratified Transformer</surname>
          </string-name>
          <article-title>Framework with Cross-Frequency Attention for Reliable Cardiac Arrhythmia Classification, Knowledge-Based Syst</article-title>
          . (
          <year>2025</year>
          )
          <article-title>114250</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.knosys.
          <year>2025</year>
          .
          <volume>114250</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>X.</given-names>
            <surname>Lyu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Rani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Manimurugan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Maple</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Feng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A</given-names>
            <surname>Deep</surname>
          </string-name>
          Neuro
          <article-title>-Fuzzy Method for ECG Big Data Analysis via Exploring Multimodal Feature Fusion</article-title>
          ,
          <source>IEEE Trans. Fuzzy Syst</source>
          . (
          <year>2024</year>
          )
          <fpage>1</fpage>
          -
          <lpage>13</lpage>
          . doi:
          <volume>10</volume>
          .1109/tfuzz.
          <year>2024</year>
          .
          <volume>3416217</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>M.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Wadhvani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rasool</surname>
          </string-name>
          ,
          <article-title>Deep learning-based real-time diagnosis of cardiac diseases through behavioral changes in ECG signals</article-title>
          ,
          <source>Biomed. Signal Process. Control</source>
          <volume>104</volume>
          (
          <year>2025</year>
          )
          <article-title>107532</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.bspc.
          <year>2025</year>
          .
          <volume>107532</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <surname>P. B. Patil</surname>
            ,
            <given-names>V. B.</given-names>
          </string-name>
          <string-name>
            <surname>Reddy</surname>
            ,
            <given-names>A. P. S.,</given-names>
          </string-name>
          <article-title>Prediction of Cardiovascular Diseases by Integrating Electrocardiogram (ECG) and Phonocardiogram (PCG) Multi-Modal Features using Hidden Semi Morkov Model</article-title>
          ,
          <source>Int. J. Recent Innov. Trends Comput. Commun</source>
          .
          <volume>10</volume>
          .10 (
          <year>2022</year>
          )
          <fpage>32</fpage>
          -
          <lpage>44</lpage>
          . doi:
          <volume>10</volume>
          .17762/ijritcc.v10i10.
          <fpage>5732</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>A.</given-names>
            <surname>Dua</surname>
          </string-name>
          ,
          <string-name>
            <surname>Emotion Detection</surname>
          </string-name>
          <article-title>Using Machine Learning</article-title>
          ,
          <source>ECG Signals and Facial Features, in: 2024 2nd International Conference on Intelligent Data Communication Technologies and Internet of Things (IDCIoT)</source>
          , IEEE,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          .1109/idciot59759.
          <year>2024</year>
          .
          <volume>10467415</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>H. K. P. Katamreddi</surname>
            ,
            <given-names>T. K.</given-names>
          </string-name>
          <string-name>
            <surname>Battula</surname>
          </string-name>
          ,
          <article-title>A hybrid approach for machine learning based beat classification of ECG using different digital differentiators</article-title>
          and DTCWT,
          <source>Comput. Biol. Med</source>
          .
          <volume>194</volume>
          (
          <year>2025</year>
          )
          <article-title>110426</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.compbiomed.
          <year>2025</year>
          .
          <volume>110426</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>S.</given-names>
            <surname>Mavaddati</surname>
          </string-name>
          ,
          <article-title>ECG arrhythmias classification based on deep learning methods and transfer learning technique</article-title>
          ,
          <source>Biomed. Signal Process. Control</source>
          <volume>101</volume>
          (
          <year>2025</year>
          )
          <article-title>107236</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.bspc.
          <year>2024</year>
          .
          <volume>107236</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>P.</given-names>
            <surname>Salas</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.-M. Mejia-Muñoz</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Gonzalez-Landaeta</surname>
          </string-name>
          ,
          <article-title>Fog-Enabled Multimodal Chest-Worn Device for Systolic Blood Pressure Monitoring</article-title>
          , IEEE Access (
          <year>2025</year>
          )
          <article-title>1</article-title>
          . doi:
          <volume>10</volume>
          .1109/access.
          <year>2025</year>
          .
          <volume>3571829</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>X.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <fpage>MDD2DG</fpage>
          -IRA:
          <article-title>Multivariate Degree Distribution to Dynamic Graph With Inter-Channel Relevance Attention Mechanism for Multi-Channel Myocardial Infarction ECG Analysis</article-title>
          ,
          <string-name>
            <surname>IEEE J. Biomed. Inform.</surname>
          </string-name>
          (
          <year>2025</year>
          )
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          . doi:
          <volume>10</volume>
          .1109/jbhi.
          <year>2025</year>
          .
          <volume>3554309</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>B. B.</given-names>
            <surname>Purkayastha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Barma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Saikia</surname>
          </string-name>
          ,
          <article-title>A Resource-Efficient Cardiac Arrhythmia Detection Using Nonlinear Dynamics in Optimized Delay State Networks</article-title>
          ,
          <source>IEEE Trans. Biomed</source>
          . Eng. (
          <year>2025</year>
          )
          <fpage>1</fpage>
          -
          <lpage>13</lpage>
          . doi:
          <volume>10</volume>
          .1109/tbme.
          <year>2025</year>
          .
          <volume>3605297</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>M. N.</given-names>
            <surname>Mohanty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Baliarsingh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. K.</given-names>
            <surname>Panda</surname>
          </string-name>
          ,
          <article-title>An Ensemble Technique for Cardiac Data Compression in Smart Healthcare System</article-title>
          ,
          <source>SN Comput. Sci. 6</source>
          .
          <issue>1</issue>
          (
          <year>2025</year>
          ).
          <source>doi:10.1007/s42979-024- 03605-7.</source>
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>L. M.</given-names>
            <surname>Lopez-Ramos</surname>
          </string-name>
          ,
          <source>Future Perspectives on Automated Machine Learning in Biomedical Signal Processing, in: Communications in Computer and Information Science</source>
          , Springer International Publishing, Cham,
          <year>2022</year>
          , pp.
          <fpage>159</fpage>
          -
          <lpage>170</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>031</fpage>
          -10525-8_
          <fpage>13</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>V.</given-names>
            <surname>Malhotra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Sandhu</surname>
          </string-name>
          ,
          <article-title>Improved ECG based Stress Prediction using Optimization and Machine Learning Techniques</article-title>
          ,
          <source>ICST Trans. Scalable Inf. Syst</source>
          . (
          <year>2018</year>
          )
          <article-title>169175</article-title>
          . doi:
          <volume>10</volume>
          .4108/eai.6- 4-
          <year>2021</year>
          .169175.
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>L.</given-names>
            <surname>Mosiy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sverstiuk</surname>
          </string-name>
          .
          <article-title>Information technology for electrocardiographic signal analysis based on mathematical models of temporal and amplitude variability</article-title>
          .
          <source>Comput. Syst. Inf. Technol. (2)</source>
          (
          <year>2025</year>
          )
          <fpage>36</fpage>
          -
          <lpage>44</lpage>
          . doi:
          <volume>10</volume>
          .31891/csit-2025
          <source>-2-4</source>
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>J.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Chu</surname>
          </string-name>
          .
          <article-title>A large scale 12-lead electrocardiogram database for arrhythmia study (Version 1</article-title>
          .0.0),
          <source>PhysioNet</source>
          (
          <year>2022</year>
          ). doi:
          <volume>10</volume>
          .13026/WGEX-ER52
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>D.S.</given-names>
            <surname>Baim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.S.</given-names>
            <surname>Colucci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.S.</given-names>
            <surname>Monrad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.S.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.F.</given-names>
            <surname>Wright</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lanoue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.F.</given-names>
            <surname>Gauthier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.J.</given-names>
            <surname>Ransil</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Grossman</surname>
          </string-name>
          ,
          <string-name>
            <surname>E. Braunwald.</surname>
          </string-name>
          <article-title>Survival of patients with severe congestive heart failure treated with oral milrinone</article-title>
          .
          <source>J. Am. Coll. Cardiol</source>
          .
          <volume>7</volume>
          (
          <issue>3</issue>
          ) (
          <year>1986</year>
          )
          <fpage>661</fpage>
          -
          <lpage>670</lpage>
          . doi:
          <volume>10</volume>
          .1016/s0735-
          <volume>1097</volume>
          (
          <issue>86</issue>
          )
          <fpage>80478</fpage>
          -
          <lpage>8</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>P.</given-names>
            <surname>Laguna</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.G.</given-names>
            <surname>Mark</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Goldberg</surname>
          </string-name>
          , G.B. Moody,
          <article-title>A database for evaluation of algorithms for measurement of QT and other waveform intervals in the ECG</article-title>
          ,
          <source>in: Computers in Cardiology</source>
          <year>1997</year>
          , IEEE. doi:
          <volume>10</volume>
          .1109/cic.
          <year>1997</year>
          .648140
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          <source>[37] Automated Machine Learning (AutoML) Search - EvalML 0.84</source>
          .
          <article-title>0 documentation</article-title>
          . URL: https://evalml.alteryx.com/en/stable/user_guide/automl.html.
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [38]
          <article-title>Classification performance metrics and indices</article-title>
          . https://adriancorrendo.github.io/metrica/articles/available_metrics_classification.html.
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [39]
          <article-title>GitHub - shap/shap: A game theoretic approach to explain the output of any machine learning model</article-title>
          . URL: https://github.com/shap/shap.
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [40]
          <article-title>ExtraTreesClassifier</article-title>
          . URL: https://scikit-learn.org/stable/modules/generated/sklearn.ensemble. ExtraTreesClassifier.html.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>