=Paper=
{{Paper
|id=Vol-2667/paper32
|storemode=property
|title=Transfer Learning for tuberculosis screening by single-channel ECG 
|pdfUrl=https://ceur-ws.org/Vol-2667/paper32.pdf
|volume=Vol-2667
|authors=Valeriia Guryanova
}}
==Transfer Learning for tuberculosis screening by single-channel ECG ==
<pdf width="1500px">https://ceur-ws.org/Vol-2667/paper32.pdf</pdf>
<pre>
       Transfer Learning for tuberculosis screening by
                    single-channel ECG
                                                             Valeriia Guryanova
                                             Faculty of Computational Mathematics and Cybernetics
                                                      Lomonosov Moscow State University
                                                                Moscow, Russia
                                                           kibitovavaleria@gmail.com
                                                         ORCID: 0000-0002-3613-8103


   Abstract—Tuberculosis is one of the leading causes of death in          The motivation for this study was the work that shows the
the world. The majority of the population is not able to regularly relationship between cardiological and tuberculosis diseases
conduct specific e x aminations, s u ch a s x - ray e x aminations, for [4]. Besides, often the tuberculosis bacteria themselves affect
the presence of tuberculosis. Currently, there are mobile devices
for measuring ECG, which allow taking measurements without the heart and thus affect its electrical activity. At the moment,
leaving home. This article explores the possibility of determining this is the first work that is devoted to the problem of
tuberculosis based on a single-channel mobile ECG. One of the tuberculosis detection via single-channel mobile ECG.
general top-performance neural networks is used as a classifier.           Now neural networks are one of the most popular methods
This article also explored the possibility of such a classification for analyzing medical signals and images. There are plenty
based not on raw data, but the generated image. The image
interprets the prediction of the neural network and makes it of works in which medical images and ECGs are analyzed
possible for the doctor to understand the model’s decision better. using neural networks (NN), for example, [5], [6], [7], [8],
The article shows the promising performance and provides proof [9]. Usually, raw ECGs are analyzed using recurrent [10] or
of concept of such screening. Different ratios of precision and convolutional neural [8] networks with 1d convolutions. In this
recall are provided, which can be adjusted depending on the paper, it is proposed to analyze an ECG visualization and not a
situation.
   Keywords—Neural Networks, ECG classification, Tubercu-               raw signal. So the image but not a signal is analyzed. There are
losis, Deep Learning, Transfer Learning                                 some works in which neural networks analyze the image of an
                                                                              ECG signal [11] [12] and show good performance. However,
                          I. I NTRODUCTION                                    in these articles, architectures were trained from scratch and
                                                                              were specially selected for the task. There are large number of
   Tuberculosis (TB) is caused by bacteria (Mycobacterium
                                                                              pretrained architectures for images [13], [14], [15]. Since, for
tuberculosis) that most often affect the lungs. Tuberculosis is
                                                                              many tasks, pretrained architectures show significantly better
curable and preventable. TB is easily spread from person to
                                                                              performance than architectures trained from scratch [16], there
person through the air. TB is one of the ten leading causes of
                                                                              is an assumption that they will show better performance on
death in the world [1]. It is crucial to determine tuberculosis
                                                                              the paper’s task. There is work [17] that shows that pretrained
in time to prevent its spread and begin treatment of an ill
                                                                              architectures can be used as a feature extractor for phono-
person. It takes a long time for both the physician and the
                                                                              cardiogram signals (PCG), which also provides additional
patient to carry out specific tests for TB (e.g., such as X-rays or
                                                                              motivation for exploring the possibility of using pretrained
Xpert MTB/RIF), and as such, they cannot be conducted with
                                                                              architectures.
relatively high regularity. So, it would be useful to have some
                                                                                 This paper aims to investigate the possibility of determining
pre-screening, which could be done quite often and which
                                                                              tuberculosis from ECG images, the possibility of using pre-
would allow identifying people with a high chance of having
                                                                              trained models for such tasks, and of the interpretation of such
tuberculosis and conduct specific t e sts o n ly f o r them.
                                                                              models. The main scientific novelty of this paper is the use of
   ECG is a signal that displays the electronic activity of
                                                                              pre-trained architecture for ECG classification. It is also the
the heart. Each ECG recording shows the potential difference
                                                                              first article that explores the feasibility of determining tubercu-
between two electrodes located on the surface of the body.
                                                                              losis by a single-channel ECG and explores the relationships
Each of the measured potential differences is called a lead.
                                                                              of metrics that a solution can achieve.
In medical institutions, 12-lead ECGs are commonly used.
                                                                                 The paper is organized as follows. Initially, the dataset and
Currently, on the market, there are mobile devices that are
                                                                              data preprocessing methods are described. Then, the process
capable of reading an electrocardiogram (ECG) of a person,
                                                                              of generation of the ECG image and the architecture of the
for example, AliveCor [2], CardioQvark [3]. Such
                                                                              neural network are given. Then the results of experiments and
devices read only one of the 12 leads. Such devices can be used
                                                                              interpretation of the neural network are reported. Finally, the
as often as required. Anybody can have just a few devices per
                                                                              limitations of the article are analyzed, and the conclusion is
institution or organization to conduct such pre-screenings
                                                                              given.
every day. People who have a high chance of having a disease
will need to see a TB specialist as soon as possible.


Copyright © 2020 for this paper by its authors.
Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0)
Data Science


                               II. DATASET                                    Since the signals were taken from a device that can have
                                                                           noise, and the signal could disappear for various reasons, it is
  All data used in this study was collected using the Cardio-
                                                                           necessary to remove signals that contain much noise or do not
Qvark device [3]. The dataset consists of 1232 ECGs from
                                                                           contain any useful information. R-peaks that represent heart-
136 people with TB and 3609 ECGs from people without
                                                                           beat were calculated using the Pan–Tompkins algorithm [19].
TB. Dataset sex distribution among people with tuberculosis
                                                                           Then the distances between the two peaks were calculated.
and people without tuberculosis is shown in Fig.1. Dataset
                                                                           It was measured heuristically that the signal section almost
age distribution among people with tuberculosis and people
                                                                           certainly does not contain useful information if the distance
without tuberculosis is shown in Fig.2.
                                                                           between the two R-peaks is more than 2.6s or less than 0.3s.
                                                                           The signal was removed from the dataset if there were more
                                                                           than 55 percent of such distances in it.
                                                                              After dividing the long signals into sub-signals and remov-
                                                                           ing the signals that contain mainly noise, 13776 signals were
                                                                           obtained. Ten thousand ninety-seven of them belong to 367
                                                                           TB-positive people, and 3679 of them belong to 136 TB-
                                                                           negative people. More than 90 percent of the resulting signals
                                                                           is 100 seconds long. The number of signals per TB-positive
                                                                           patients varies from 1 to 1540. The number of signals per TB
                                                                           negative patients varies from 3 to 198.
                                                                              The image was generated using the Matplotlib plot function
                                                                           [20]. The whole ECG signal was divided into seven parts
Fig. 1. Dataset sex distribution.                                          of 2857 points each, with dots per inch of 70. The size of
                                                                           the figure was set to 7.4 by 7.4 inches. Since the matplotlib
                                                                           generates an image in the RGBA format, and most neural
                                                                           networks use RGB format, only the 4th channel was used. This
                                                                           channel was multiplied to create three channels. The example
                                                                           of the generated image is given in Fig. 3.


Fig. 2. Dataset age distribution.

   ECG recordings were sampled as 1000 Hz, and they have
been filtered b y t h e C a rdioQvark d e vice. T h e l e ngth o f ECG
recordings varies from 30.5 seconds to 900 seconds. For trend
subtraction, the median filter w i th a k e rnel s i ze o f 1 8 7 was
used.

      III. I MAGE G ENERATION P ROCESS AND N EURAL
                 N ETWORK A RCHITECTURE
   Signals were resampled to a frequency of 200 Hz via the
polyphase resample method from the SciPy Python library [18]
to reduce the image size. Signals longer than 100 seconds were Fig. 3. Generated ECG image.
divided into sub-signals. The signal is divided sequentially into
sections of 100 seconds to receive the sub-signals, if the length    ResNext 101 [15] was used as the main model for the
of the last section is less than 30 seconds, then it was not added experiments. The main idea of this model is the introduction
to the sample set.                                                 of the new dimension, which authors called ”cardinality” -
                                                                   the size of the set of transformations. The main function of


VI International Conference on "Information Technology and Nanotechnology" (ITNT-2020)                                                 150
Data Science


this dimension is the control over complex transformations.                The test set contains original 1130 ECGs from 144 patients.
Even while maintaining complexity, increasing ”cardinality”                Among them, 37 patients with 288 ECGs have TB. After the
can improve classification accuracy. In this paper, a model                preprocessing stage, the validation set has 3205 ECGs and
with a ”cardinality” value of 32 was used. This model is one               among them 857 from TB-positive persons. The number of
of the top best models for classification on ImageNet [21] and             patients after the preprocessing stage stays the same in all
is easy to use and retrain on new data.                                    datasets.
   This model has four blocks of convolutional layers. The                    For reproducibility, all neural network configurations were
following neural network configurations were used to explore               trained with a random state of 10. This random state was fixed
the possibilities of transfer learning: the completely untrained           for NumPy, PyTorch CPU, and GPU variables and also for
network (UN), pretrained on ImageNet network with no frozen                standard Python random library. Also, the CUDA state was
layers (PN), pretrained on ImageNet network with frozen layer              set to deterministic. For all configurations, Adam was used as
1 (or 2, 3, 4) and all layers before it (L1(2, 3, 4)). In the case         an optimizer, and batch size was set to 16. For the untrained
of freezing up to layer four, only one fully connected classifier          network, the learning rate was set to 0.0005 and is dropped
layer is trained.                                                          by a factor of 0.0001 after the 8th epoch. For the pretrained
   Augmentation is a method in which artificially generated                network, the learning rate was set to 0.0005 and is dropped
data is added to the original data. This method is one of the              by a factor of 0.001 after the 8th. For the pretrained network
methods that is widely used to improve the quality of neural               with unfrozen and frozen layers, the learning rate was set to
network models [22]. Online augmentation is a method that                  0.0001 and is dropped by a factor of 0.001 after the first and
replaces the part of the data to the synthetic one during the              fourth epoch. All parameters that are not described were set
training stage. A study was conducted that investigated online             to default values of PyTorch library [24]. The metric that was
augmentation for the classification quality improvement of                 used for early stopping was the area under the precision-recall
single-channel ECGs by using a neural network [23]. The data               (PR AUC) curve since classes are not balanced in samples. The
that was used in that article contains the dataset discussed in            precision-recall curve illustrates the ratio between precision
this study. However, that study did not use the representation of          and recall of different thresholds. A high value of precision
the signal as an image, nor did it study the possibilities of such         indicates that a model has a low false-positive rate, and a high
training. Research in an article on online augmentation has                value of recall shows that model has a low false-negative rate.
shown that it helps in improving the quality of classification             A high value of the PR AUC corresponds to both high recall
of single-channel signals.                                                 and high precision.
   One of the most promising methods was the ResampleParts                    Another important metric that is useful to take into con-
method. This method resamples parts of the signal with a                   sideration during model evaluation is an area under the ROC
specific resampling coefficient using the polyphase resample               curve (ROC AUC). ROC curve shows the ratio between the
method from the SciPy Python library [18]. The specific size               true positive rate and the false positive rate. This metric could
of the signal part and resampling coefficient is set at random             be useful as it shows the probability that the model ranks
each time from a certain interval. Due to the reason mentioned             a random positive example higher than a random negative
above, this augmentation method was chosen to increase the                 example. However, this metric can be sensitive to unbalanced
quality of tuberculosis detection.                                         classes in data, and so it should not be the only metric
                                                                           for performance evaluation. Table I shows the results from
               IV. E XPERIMENTS AND R ESULTS
                                                                           different network configurations v ia t he ROC AUC s core and
   All experiments were conducted using Python language.                   PR AUC score.
PyTorch [24] was used as the main neural network framework.
Signals were read and transformed using NumPy [25] and
                                                                                              TABLE I. EXPERIMENT RESULTS
SciPy [18] libraries.
   To correctly evaluate the results, the entire dataset was
divided into training, test, and validation set. The training set              Quality                    Network Configuration
                                                                               Criteria     UN     PN         L1     L2         L3       L4
was used to train the model. The validation set was used to                   ROC AUC     0.9079   0.93     0.934  0.9307     0.9138   0.8089
select the hyperparameters and for early stopping. A test set                  PR AUC     0.7956   0.83      0.84  0.8399     0.8081   0.615
was used to check the final performance. The full dataset was
partitioned into three parts in a specific way, such that each
of the three groups contained different patients. The train set               As can be seen from table I, the best performance is
contains 2648 original ECGs from 247 patients. Among them,                 obtained with configurations L1 and L2. This table shows that
69 patients with 689 ECGs have TB. After the preprocessing                 transfer learning is useful for classifying ECG signals. These
stage the train set has 7495 ECGs and among them 2058 from                 results also correspond to the logic that the first layers of a
TB-positive persons. The validation set contains original 1062             neural network learn simple dependencies such as lines and
ECGs from 112 patients. Among them, 30 patients with 255                   simple shapes. When freezing a neural network up to the last
ECG have TB. After the preprocessing stage, the validation set             layers, a worse performance can be gotten than on a network
has 3076 ECGs and among them 765 from TB-positive person.                  without pretraining (UN), since the features on the last layers


VI International Conference on "Information Technology and Nanotechnology" (ITNT-2020)                                                          151
Data Science


are not relevant for this specific task. Thus, based on metrics,              Table II, table III, and table IV show the performance of
L1 can be considered the best model.                                       models trained with augmentation and without it. As shown in
   For the improvement of this best model earlier described,               these tables, the results are quite promising and customizable,
online augmentation was used. The interval for the resampling              depending on the situation. These tables show that augmen-
part was set to (40, 200). The two types of resampling were                tation increases the values of Fβ and F Cβ when β equals to
used: frequency decreasing (FD) and frequency increasing                   0.25 and 1, but does not improve the values when β equals to
(FI). The resulting sample rate is calculated as the upsample              2. It can be explained by the fact that augmentation parameters
factor divided on the downsampling factor. For the frequency,              were selected to maximize PR AUC value. This value is most
increasing the upsample factor was getting from 2 to 3, and                correlated with Fβ when β equals to 1. As can be seen from
the downsampling factor was gotten from 3 to 5. For the                    the ratio between precision and recall, the main increase in the
frequency decreasing upsample, factor was gotten from 3 to                 PR AUC metric was associated with an increase in precision.
6, and the downsampling factor was gotten from 1 to 3.                     It explains the increase in performance when β equal to 0.25.
The probabilities of augmentations was chosen from values                  Perhaps it is necessary to focus on other metrics during the
[0.1, 0.2, 0.3, 0.4, 0.5]. The best model was determined based             process of selection of augmentation parameters to increase
on the PR AUC metric on the validation set. The best model                 the value of F2.
was the model with the probability of augmentation of 0.5.
The best model has 0.9475 ROC AUC and 0.875 PR AUC                                                    TABLE II. EXPERIMENT RESULTS
on the test set. Based on the results of the experiments, it
is reasonable to conclude that augmentation improves model                       β                P                       R                        Fβ
performance.                                                                               M1            M2        M1             M2        M1            M2
                                                                               0.25      0.8918        0.9147     0.471        0.55075     0.617         0.687
   In most real-life scenarios, the probability of certain events                1        0.756         0.785    0.7911         0.782      0.773         0.784
can not be used to make a decision. A specific label is                          2        0.665        0.6274     0.878         0.9393     0.757        0.7523
assigned to each patient using probability thresholding. To
explore the balance between precision and recall in a situation
of unbalanced classes, Fβ score metric is used:
                                                                                                  TABLE III. EXPERIMENT RESULTS
                                  precision × recall
               Fβ = (1 + β 2 )                          .
                                 β 2 precision + recall                          β                PC                      RC                    F Cβ
                                                                                           M1            M2        M1            M2         M1        M2
                                                                               0.25      0.8688         0.894    0.5559        0.6223      0.678     0.734
The β is in range (0, +∞). The greater the β is, the larger the                  1        0.726         0.76      0.863         0.856      0.789    0.8059
weight the recall has. β can be selected differently based on the                2        0.621        0.5711     0.937        0.9825     0.7475    0.7223
situation. If we want to find all TB-positive people and the cost
of additional procedures is not that important, then we give the
recall the greater weight. If the cost of additional procedures
                                                                                                  TABLE IV. EXPERIMENT RESULTS
is crucial, then the precision is much more important even at
the cost of skipping some TB-positive people.
   This article investigated the following beta threshold op-                     β               A                      A1                       PS
                                                                                           M1           M2        M1            M2         M1            M2
tions: 0.25, 1, 2. The validation set was used for the threshold                 0.25     0.843        0.866    0.8655         0.885     0.8389        0.8498
calculation. The following metrics were investigated: accuracy                     1      0.876        0.884    0.8824         0.895      0.819         0.831
(A), which is the number of right guesses, as well as precision                    2      0.849        0.834    0.8388        0.8076     0.7611        0.7246
(P) and recall (R). Since each cardiogram was divided into
several images, the following rule was used to determine the
label for the entire cardiogram. If at least one cardiogram is                                          V. I NTERPRETATION
labeled as TB-positive, then the entire cardiogram is labeled                 Doctors often want to understand why a neural network
TB-positive. The metrics that are determined for the entire                made a confident decision. Therefore, the interpretation of the
ECG are called: AC, PC, and RC for accuracy for ECG,                       neural network is quite useful. Besides, interpretation can be
precision for ECG, and recall for ECG, respectively. Since                 used to identify previously unknown signs of the disease to
one patient can have multiple ECG, the patient score for                   think through treatment in the future.
cardiogram labels was calculated. The patient score (PS)                      Grad-CAM [26] shows quite good results in interpreting
equals to mean accuracy for person ECGs averaged through                   neural networks. Grad-CAM uses the gradient information
all people.                                                                flowing into the last convolutional layer of CNN to understand
   Models without augmentation (M1) and with it (M2) were                  the impact on each neuron for a decision of interest. Fig. 4
compared using the metrics described above. Fβ metric was                  and 5 show the interpretation results on the pretrained model
used to determine which model has a better ratio between                   without augmentation and with it. The lighter the points on
recall and precision. Fβ , which shows the ratio between ECG               the image, the more important these points are. Both images
precision and ECG recall is shown as F Cβ .                                represent the same ECG of TB-positive person. As can be seen


VI International Conference on "Information Technology and Nanotechnology" (ITNT-2020)                                                                           152
Data Science


from the images, the neural network draws attention to the
PQST complexes, the distance between the peaks, and also the
height of some peaks. Features selected by the network with
and without augmentation are almost identical. However, the
network with augmentation does not take into consideration
some features as one at the end of the 4th row. Perhaps the
elimination of some noisy features might help to improve the
classification quality.


                                                                           Fig. 5. Interpretation of NN with augmentation on TB-positive person.


                                                                                                     VII. CONCLUSION

                                                                              The new method for classifying the visualization of ECGs
                                                                           using pre-trained architecture was proposed. This method
                                                                           was used on a single-channel ECG to explore the possibility
                                                                           of determination of tuberculosis from such images. It was
Fig. 4. Interpretation of NN without augmentation on TB-positive person.   determined that the neural network could detect tuberculosis
                                                                           by a cardiogram with moderately high performance. The
                        VI. DISCUSSION                                     new method help to improve the quality of the ECG image
   The reader of this article should know that this article                classification. ROC AUC s core w as i ncreased f rom 0 .9079 to
has the following limitations: the sample size of TB-positive              0.934, and PR AUC score was increased from 0.7956 to 0.84.
people is quite small, so for further investigation of perfor-             The significance o f t he r esults s hows t hat t his m ethod should
mance, the bigger sample should be used. It should also be                 be used to classify the ECG for other tasks. For example, to
noted that the sample contains patients who received treatment             detect coronary heart disease and arrhythmia.
for tuberculosis. Due to the existing database structure, it is               Another paper novelty was the confirmation o f t he possi-
impossible to say whether the patient took medicine, and if                bility of increasing the classification q uality b y u sing online
he did, how recent the use is. For more robust experiments, a              augmentation with such architectures and data. It has been
new database must be compiled. However, even in the current                shown that online augmentation can improve the performance
circumstances, this study is useful for the following reasons.             of the classification. I n t his a rticle, t he p erformance was
Let us imagine an organization that employs a TB-positive                  improved from 0.934 ROC AUC to 0.9475 ROC AUC, and
person, who has already started taking medicine and hides                  PR AUC was increased from 0.84 to 0.875. The increase
his condition, thus increasing the chance of infecting people              shows that online augmentation should be used when using
around him. In such a situation, it is still beneficial t o screen         the proposed method.
people with the suggested algorithm. Besides, it is known that                The method for interpretation of such models is described,
people taking medicine may experience an increase in QT-                   which can be used further for drug and disease effect explo-
interval [27]. However, in this dataset, the average value of              ration purposes.
the QT-interval showed a low predictive quality for TB of                     Various ratios of quality metrics of the obtained solution for
about 0.51 ROC AUC. Based on this, we can conclude that                    tuberculosis detection have been received. They show that it is
the neural network finds some other defining signs. Also, with             reasonable to conduct TB pre-screening using mobile ECG in
the help of the neural network interpretation in the future, the           the future to identify potential patients and recommend further
effect of TB-drugs on the heart can be explored.                           specific t ests. T his s tudy i s a n i mportant p roof o f concept.


VI International Conference on "Information Technology and Nanotechnology" (ITNT-2020)                                                             153
Data Science


   Some disadvantages of the dataset were indicated in the                        [18] P. Virtanen, "SciPy 1.0: fundamental algorithms for scientific computing
                                                                                       in Python," Nature Methods, vol. 17, no. 3, pp. 261-272, 2020.
article. In the future, to build a production solution, it is
                                                                                  [19] J. Pan and W. J. Tompkins, "A Real-Time QRS Detection Algorithm,"
reasonable to increase and carefully verify the dataset.                               IEEE Transactions on Biomedical Engineering, vol. BME-32, no. 3, pp.
                                                                                       230-236, 1985.
                      ACKNOWLEDGMENT                                              [20] J. Hunter, "Matplotlib: A 2D Graphics Environment", Computing in
  This article contains the results of a project carried out                           Science & Engineering, vol. 9, no. 3, pp. 90-95, 2007.
                                                                                  [21] D. Mahajan, R. Girshick, V. Ramanathan, K. He, M. Paluri, Y. Li,
within the imple-mentation of the Program of the Center for                            A. Bharambe and L. van der Maaten, ”Exploring the limits of
Competence of the National Tech-nology Initiative “Center for                          weakly supervised pretraining,” Proceedings of the European
Storage and Analysis of Big Data”, supported by the Ministry                           Conference on Computer Vision (ECCV), pp. 181-196, 2018.
of Science and Higher Education of the Russian Federation                         [22] A. Mikołajczyk and M. Grochowski, ”Data augmentation for improving
                                                                                       deep learning in image classification problem,” International
under the Lomonosov Moscow State University Project                                    Interdisciplinary PhD Workshop (IIPhDW), Swinoujście, pp. 117-122,
Support Fund 13/1251/2018 from 11.12.2018.                                             2018.
                                                                                  [23] V. Guryanova, ”Online Augmentation for Quality Improvement of Neu-
                                                                                       ral Networks for Classification of Single-Channel Electrocardiograms,”
                               REFERENCES                                              Communications in Computer and Information Science, pp. 37-49, 2020.
  [1] ”Tuberculosis     (TB)”,    Who.int,     2020     [Online].   URL:          [24] A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan and
      https://www.who.int/news-room/fact-sheets/detail/tuberculosis.                   T. Killeen, ”PyTorch: An imperative style, high-performance deep
  [2] ”AliveCor”,       Alivecor.com,       2020       [Online].    URL:               learning library,” Advances in Neural Information Processing Systems,
      https://www.alivecor.com.                                                        pp. 8024-8035, 2019.
  [3] ”Cardiomonitor (in Russian) CardioQVARK”, Cardioqvark.ru, 2020              [25] S. van der Walt, S. Colbert and G. Varoquaux, ”The NumPy Array: A
      [Online]. URL: http://www.cardioqvark.ru.                                        Structure for Efficient Numerical Computation,” Computing in Science
  [4] M. Huaman, D. Henson, E. Ticona, T. Sterling and B. Garvy, ”Tu-                  & Engineering, vol. 13, no. 2, pp. 22-30, 2011.
      berculosis and cardiovascular disease: linking the epidemics,” Tropical     [26] R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh and D.
      Diseases, Travel Medicine and Vaccines, vol. 1, no. 1, 2015.                     Batra, ”Grad-cam: Visual explanations from deep networks via
  [5] T. Pashina, A. Gaidel, P. Zelter, A. Kapishnikov and A. Nikonorov, ”Au-          gradient-based localization,” Proceedings of the IEEE international
      tomatic highlighting of the region of interest in computed tomography            conference on computer vision, pp. 618-626, 2017.
      images of the lungs,” Computer Optics, vol. 44, no. 1, pp. 74-81, 2020.     [27] A. A. Bykova, M. I. Chashkina, M. V. Serova, A. V. Simonov
      DOI: 10.18287/2412-6179-CO-659.                                                  and F. Yu. Kopylov, "The prevalence of prolongation of the QT in-
  [6] J. Agafonova, A. Gaidel, P. Zelter and A. Kapishnikov, ”Efficiency               terval in patients receiving antituberculous chemotherapy," Cardiology
      of machine learning algorithms and convolutional neural network for              and Cardiovascular Surgery, vol. 12, no. 2, pp. 146-151, 2019.
      detection of pathological changes in MR images of the brain,” Computer
      Optics, vol. 44, no. 2, pp. 266-273, 2020. DOI: 10.18287/2412-6179-
      CO-671.
  [7] S. Kiranyaz, T. Ince and M. Gabbouj, ”Real-Time Patient-Specific
      ECG Classification by 1-D Convolutional Neural Networks,” IEEE
      Transactions on Biomedical Engineering, vol. 63, no. 3, pp. 664-675,
      2016.
  [8] M. Zihlmann, D. Perekrestenko and M. Tschannen, ”Convolutional
      Recurrent Neural Networks for Electrocardiogram Classification,”
      Computing in Cardiology Conference (CinC), pp. 1-4, 2017.
  [9] Z. Yao, Z. Zhu and Y. Chen, ”Atrial fibrillation detection by multi-scale
      convolutional neural networks,” 20th International Conference on
      Information Fusion (Fusion), Xi’an, pp. 1-6, 2017.
[10] R. Banerjee, A. Ghose and S. Khandelwal, ”A Novel Recurrent Neural
      Network Architecture for Classification of Atrial Fibrillation Using
      Single-lead ECG,” 27th European Signal Processing Conference
      (EUSIPCO), A Coruna, Spain, pp. 1-5, 2019.
[11] P. Sean and J. Huizinga, ”Robust and fast heart rate variability analysis
      of long and noisy electrocardiograms using neural networks and
      images,” arXiv: 1902.06151, 2019.
[12] M. Kim, H. Ko and S. Pan, ”A study on user recognition using 2D ECG
      based on ensemble of deep convolutional neural networks,” Journal of
      Ambient Intelligence and Humanized Computing, vol. 11, no. 5, pp.
      1859-1867, 2019.
 [13] H. Kaiming, X. Zhang, Sh. Ren and J. Sun, ”Deep residual
      learning for image recognition,” Proceedings of the IEEE conference
      on computer vision and pattern recognition, pp. 770-778, 2016.
[14] F. Chollet, ”Xception:            Deep     learning     with    depthwise
      convolutions,” Proceedings of the IEEE conference on computer
      vision and pattern recognition, pp. 1251-1258, 2017.
[15] X. Saining, R. Girshick, P. Dollár, Z. Tu and K. He, ”Aggregated
      residual transformations for deep neural networks,” Proceedings of the
      IEEE conference on computer vision and pattern recognition, pp.
      1492-1500, 2017.
[16] K. Simonyan, and A. Zisserman, ”Very deep convolutional networks
      for large-scale image recognition,” arXiv: 1409.1556, 2014.
[17] H. Alaskar, N. Alzhrani, A. Hussain and F. Almarshed, ”The Imple-
      mentation of Pretrained AlexNet on PCG Classification,” Intelligent
      Computing Methodologies, pp. 784-794, 2019.


VI International Conference on "Information Technology and Nanotechnology" (ITNT-2020)                                                                    154

</pre>