Electrocardiogram Signal Classification Based on Deep Learning Techniques

doi:10.21203/rs.3.rs-3093804/v1

Download PDF

Research Article

Electrocardiogram Signal Classification Based on Deep Learning Techniques

https://doi.org/10.21203/rs.3.rs-3093804/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

One of the most often used diagnostic tools in medicine and healthcare is the electrocardiogram (ECG). When it comes to healthcare prediction problems requiring ECG data, deep learning techniques seem promising. This paper aims to apply deep learning techniques to classify MIT-BIH arrhythmias on publicly available datasets. A new electrocardiogram classification for employing a spectrogram of signals algorithm is proposed. The proposed model depends on convolutional neural networks to automatically learn the characteristics of features and has used convolutional neural networks to detect normal and abnormal ECG heartbeats, with an average detection accuracy of 99.22%.

Artificial Intelligence and Machine Learning

The electrocardiogram is a valuable indicative tool for examining heart muscles. It contains details regarding the heart's structure with the electrical contraction structure's operation. Automatic detection of arrhythmias using ECG signals has become a major research area in recent years, as a manual examination of heartbeat rate is time-consuming and prone to errors. Nowadays, Deep learning (DL) algorithms have the advantage of automatic learning features without explicit feature extraction process, leading the automatic classification and recognition of ECG signals into a new development path [1][2][3]. Therefore, DL algorithms applies the classification and recognition of ECG signals to improve the accuracy of classification and recognition of ECG signals. ECG classification signals are an important research topic in the process of fusion of medicine and computer technology, and the key is to extract the effective features of ECG signals more accurately [4][5][6]. The traditional ECG classification method is to extract and select features first, and then classify them. However, this method relies heavily on manual work and cannot fully mine the deep features hidden in many ECG signals. The main research contents are as follows:(1) ECG signal preprocessing based on wavelet adaptive threshold denoising. To filter out the noise in the ECG signal, this paper proposes a wavelet denoising method based on an adaptive threshold, which can dynamically adjust the threshold of different decomposition scales [7][8][9]. The basic steps are as follows: first, the sym8 wavelet function is used to decompose the ECG signal on eight scales, then the approximation coefficients on the eighth scale and the detail coefficients on the first scale are directly set to zero, and the self-proposed method proposed in this paper is used on the remaining scales [10][11]. The adaptive threshold and soft threshold function process the wavelet coefficients and reconstruct the signal, and finally, obtain the demised signal. (2) ECG signal classification model based on DL. We can identify symptoms of cardiac disease processes (detect abnormal heart rhythm or cardiac abnormalities) by examining changes in normal ECG signals [2][12][13]. The ECG of normal hearts has a characteristic shape. The main part of the ECG contains a P wave, QRS complex, and T wave. Each part of this signal is indicated as:

Atrial depolarization is indicated by a P wave.
The QRS complex is made up of three waves that represent ventricular depolarization: The Q wave, the R wave, and the S wave.
The T wave denotes ventricular repolarization.

Due to difficulties with the classification process, classifying ECG signals is a difficult problem. Lack of standardization of ECG features, variability among ECG features, individuality of ECG patterns, absence of ideal classification methods for ECG classification, and variability in patient ECG waveforms are major problems [2] in ECG classification [14][15][16]. Another challenge in the classification of ECG arrhythmias is coming up with the best classifier that can classify arrhythmia in real-time. Applications of ECG signal categorization include the more accurate diagnosis of a new patient and the detection of different forms of abnormalities. Patients with heart disease can also be diagnosed and treated using it [17][18][19]. Preprocessing, feature extraction, feature normalization, and classification are the four key phases in the categorization of an ECG. The clinical diagnosis of cardiac disease mostly relies on the classification of ECG signals. The biggest issue with utilising an ECG to diagnose heart disease is that each person's normal ECG may be different, and occasionally different individuals' ECG signals will exhibit different symptoms of the same condition. Additionally, the effects of two separate disorders on healthy ECG signals may be nearly identical. The diagnosis of cardiac disease is complicated by these issues. Thus, the use of pattern classifier algorithms can enhance the ECG arrhythmia diagnosis of the new patient. The categorization of ECG signals has drawn the attention of numerous researchers. They have used several classifiers, feature extraction methods, and preprocessing methods [20][21]. For ECG categorization, most researchers used the MIT-BIH arrhythmia database. Dallali et al. [3] used a DWT to extract the RR interval and Z score to normalize the RR interval. They categorized ECG beats using FCM. They had an accuracy rate of 99.05 percent. The RR interval and R point position were retrieved as features in [4] using DWT. FCM was used for reclassifications, and 3-layer MLPNN for the final classification. 99.99 percent accuracy was attained.The remainder of the essay is organized as follows: ECG Classification strategies are discussed in Section 2, the proposed ECG Classification technique is discussed in Section 3, and the experimental findings are discussed in Section 5.

In [5], neural networks and fuzzy logic are used to categorise heart rate. Heart Rate Variability is used to categorise heart rates (HRV). This HRV has some properties that are taken from it, and these features are used as inputs to the fuzzy equivalence and neural networks for categorization. [6] uses characteristics to categorise the ECG waveform into normal and pathological states and provides this information as the input for classification. Artificial Neural Networks (ANNs) and Linear Discriminant Analysis (LDA) were used to classify this data (ANN). In this, ANN was used to do multilayer perception. The author of this study explains why the MLP of the ANN Network produces superior results to the LDA classification. The MLP neural network was once more employed in [7] to distinguish between normal and abnormal heartbeats. To identify and categorize normal and pathological heartbeats, artificial neural networks with adaptive multiple preceptors are used [7][22].

In this study, the author uses 12 separate aberrant heartbeats for categorization. Removing the nonlinear background noise is another goal of the study. In this, multilayer perception was accomplished using ANN. The study's author discusses why the Multi-Layer Perceptron (MLP) of the ANN Network outperforms the Linear discriminant analysis (LDA) classification in terms of outcomes. In [7], the MLP neural network was used once more to differentiate between regular and irregular heartbeats. Artificial neural networks with adaptive multiple preceptors are utilised to distinguish between normal and abnormal heartbeats and classify them [7]. For categorization in this study, the author employs 12 different aberrant heartbeats. Another objective of the study is to eliminate the nonlinear background noise. Principal Component Analysis (PCA) employs a variety of neural network topologies to identify and categorise heartbeats, as shown in [9]. The outcomes of this research have been contrasted with other neural network architectures to determine which neural network structure is most effective for classifying types of arrhythmias. To identify ischemia arrhythmia episodes in the ECG data, a neural network was put into use [10]. The input to the network was decreased thanks to the author's usage of the PCA for dimensional reduction. The research [10] indicates that the new result from the ESC ST-T database exceeds the prior one. In [11] They employed various kinds of multilayer neural networks as classifiers to identify the two types of ECG patterns. To reduce the dimensionality, PCA is used in this research to categorize the ECG data using several techniques [12]. This study demonstrates that the PCA neural network, when combined with the Fuzzy Clustering Algorithm (FCA), performs superior classification than the PCA neural network, Fuzzy C-means (FCM) neural network, and Wavelet neural network. [12] conducted a comparison of various methods for heart arrhythmia identification based on neural network, fuzzy cluster, wavelet transform, and principal component analysis. The K-Nearest technique was used as a classification to find the QRS waveform in [13]. High accuracy was attained in this study by classifying the data and calculating the cumulative from the R peak of the ECG signal. [14] reported the classification performance of an electrocardiogram (ECG) feature extraction stage for the detection of irregular beats using an artificial classifier [23][24]. Different feature sets are generated depending on the ECG shape and RR intervals. Kohonen Self-Organizing Maps (SOMs) were used by Configuration to analyse and cluster signal characteristics. Using the data from the records suggested by the ANSI/AAMI EC57 standard, the classifier was created using the SOM and Learning Vector Quantization (LVQ) techniques [25][26]. Their study contrasts two methods for categorising annotated QRS complexes: one based on the original ECG morphological features and the other, a brand-new strategy based on features that have been preprocessed. The preprocessing of the ECG signal utilised a mathematical morphological filter. The MIT-BIH Arrhythmia Database was used to assess the algorithm's performance in accordance with the AAMI guidelines [27]. This approach produced either normal or arrhythmic findings for the recognition of beats. Wavelet transformation and neural network characteristics are used to extract the ECG signal from [15]. Wavelet decomposition features are first extracted and used as the classification input. For classification, neural networks are employed. These attributes enabled the artificial neural network to learn the different diseases in the ECG data and achieve an accuracy of 92%.

A model learns to carry out classification tasks directly from images using a technique called DL. A Convolutional Neural Network (CNN) automatically learns features and sends them to a classifier for classification. A neural network for image identification is called CNN. The feature extractor is used by CNN during training [29]. It consists of pooling layers, dropout layers, fully connected layers, and convolutional layers followed by an activation function. A regularisation method for lowering overfitting uses the insertion of a dropout layer [16–18]. We examine the CNN-based image classification technique for ECG in this paper. An image of the ECG signal is produced using a spectrogram of the signal.

3. 1. CNN

The CNN algorithm consists of several convolution (CNV) operations followed by the image sequentially, which is followed by a pooling operation (PL) to generate the neurons feed into the fully connected (FC) layer. The input of CNV is typically 2D image data.

Convolutional Layer (CNV Layer): Convolution is a technique for filtering incoming data in general. To create the features, the 2D filters process data at several stages. The coefficients of these filters, which are pre-defined, are calculated during training. Basically, a backpropagation algorithm with gradient descent is used for training [5]. A 2D convolution operation is applied to 2D data. We have multiple CNV layers in the CNN image and the input to these CNV layers is called the input feature map, and the output of the CNV layers is called the output feature map. At the very first CNV layer, the input image with its every component becomes an input feature map with 3D data. This layer contains both the activation operation that is applied to each element after convolution and the subsampling of the data after activation. An activation operation is typically a nonlinear operation that is applied to each element of the convolution output, such as max (0,) (which is also called a Rectifier Linear Unit) or 1 (1 + ex), etc where x is the input data. The activation operation does not change the size of the input. Subsampling is applied after activation that reduces the size of the input to typically 12 of its original dimensions. The window size that is used during subsampling is also 2D, such as 2x2 or 3x3. If the input data size is (W, H), then the size after pooling will be (W/2, H/2) if 12 subsampling is used.

Fully Connected (FC) Layer: By integrating the output of the CNV layer with various weights, this layer will minimise the size of input data to the size of classes that the CNN is trained for. Like the CNV layer, the FC layer uses a backpropagation method to calculate the weight of these taps.
This is the final layer of the CNN that converts the output of FC to the probability of each object being in a certain class. Typically, soft-max types of algorithms are used in this layer. Softmax output can be calculated as follows:

$$P\left(y=j|x\right)=\frac{{e}^{{x}^{T}{w}_{j}}}{\sum _{k=1}^{K}{e}^{{x}^{T}{w}_{k}}}$$

Convolutional networks are like neural networks where neurons are replaced by the convolutional operation at the initial layers. We can perform such a replacement if we consider the focus on similar features at different positions in the image, which is achieved by convolutional filters. In the later stages, neurons in the neural network are also used in the fully connected layer of the convolutional network.

3.2. Proposed CNN Model

Six CNV levels come first, followed by six max-pooling layers in our suggested model. The last method is global average pooling. The model summary for each layer and its output shape are displayed in Table 1. At 224 224, images are imported. For layers 1, 2, 3, 4, 5, and 6, the number filters are 16, 32, 64, 128, 256, and 512, respectively. Finally, the classification choice is made using a dense layer with a size of 26.

The proposed DL model is built with Python 3.5, Tensorflow, and Keras. This model was carried out on the MIT-BIH arrhythmia database. Figures 6 and 7 show an observation of accuracy and loss along epochs of the training phase. There are 100 epochs set for training. Because of its 100-percent accuracy, this model is suitable for use in ECG applications. From the above experimental results, the loss function proposed in this paper obtains a high AUC on both S and V events, which proves that the new loss function has a positive effect on class imbalance. With the introduction of 5-minute segments, the sensitivity of this paper on S-type events has been greatly improved compared with the literature; the sensitivity on V-type events is also higher than that of the existing algorithm.

4.1 MIT-BIH arrhythmia database

The MIT-BIH Arrhythmia Database includes 48 half-hour-long samples of two-channel Holter recordings from 47 people who were examined between 1975 and 1979 by the BIH Arrhythmia Laboratory. The remaining 25 recordings came from the same group and were chosen to include less common but clinically significant arrhythmias that are not well represented in small random samples. A total of 4000 24-hour Holter recordings were collected from inpatients (60%) and outpatients (40%) at Beth Israel Hospital in Boston. 23 recordings were randomly chosen from this group.

The training of neural networks requires many manually labeled data sets. The more samples that can be provided for learning, the better the real data distribution can be reflected, and the smaller the generalization error of the obtained model is. Labeling data requires huge costs. When enough training samples cannot be obtained, data augmentation is often considered. In two-dimensional image processing, the data can generally be amplified by adding noise, cropping, rotation, scale change, etc., but doing the same processing on one-dimensional ECG signals will introduce errors and even change the signal's own meaning. Therefore, in the case of insufficient and unbalanced samples, this paper improves the loss function, increases the weight of the few-sample category, and increases the weight of the easy-to-misclassify sample category, so that the network can focus on the learning of the minority and difficult-to-identify classes. Negative effects of class imbalance at present, most of the literature generally has low sensitivity to S-type events, and when performing multi-classification tasks, S-type samples are easily mistakenly classified as N-type events. On the one hand, it is because the sample size of class S is too small to reflect the overall true distribution; on the other hand, the samples of class S in the training set DS1 are not typical enough, which is different from the distribution of samples of class S in the test set DS2 or the real distribution of samples of class S. The difference is large and cannot be generally representative; and the waveforms of abnormal supraventricular beats and normal-like beats are highly similar, resulting in a lot of overlap in the distribution of these two types of samples, which are easy to misclassify. These problems all bring certain difficulties to the classification task. This paper overcomes the problem of neural network degradation caused by the increase in layers by improving the training method, network architecture, and combining the two identity mappings in residual networks and dense networks so that the model can learn deeper features. To a certain extent, the performance of S-type events has improved. However, it can be seen from the experimental results that the method used in this paper still has room for improvement and improvement. Considering that the nature of the ECG signal is a time series, the state at a certain time point is not independent, but also related to the previous output. So, CNN reflects more of the morphological information of ECG than the long-short memory network can reflect the relationship of the sequence in the time domain, so it has obvious advantages in dealing with time series problems. In the follow-up work, this paper will combine CNN and long-term memory networks to build a network and explore the impact of this combination on arrhythmia classification. The heartbeat-based end-to-end arrhythmia classification method proposed in this paper combines artificial intelligence with ECG signal classification and recognition, achieves the classification task well, and has a good ability to identify class S and V arrhythmia events. It also achieves high sensitivity and provides a new technical reference scheme for the automatic classification of arrhythmias.

In this paper, manual reading of electrocardiograms (ECGs) is time-consuming and labor-intensive, and diagnostic errors are unavoidable. To reduce the cost of clinical diagnosis, how to analyze ECG effectively and accurately with the help of computers has become a research hotspot in the field of biological signals. However, there are many types of arrhythmias, and there are individual differences in the ECG of different patients when an arrhythmia occurs. Automatic detection and classification are still technical difficulties in the field. Many scholars have combined ECG analysis with DL, which has greatly promoted the development of ECG analysis algorithms. Some studies have proposed an ECG anomaly detection method based on an artificial neural network, which first extracts the ECG waveform as feature information, and then trains the model. However, the fitting ability of various ECG waveform extraction methods is relatively general, and there are certain limitations. In order to better reflect the intrinsic properties of the samples, some studies have used neural networks to automatically learn the characteristics of features and have used convolutional neural networks to detect normal and abnormal ECG heartbeats, with an average detection accuracy of 99.22%.

Competing interests: The authors declare no competing interests.

Y. N. Singh, S. K. Singh, and A. K. Ray, "Bioelectrical signals asemerging biometrics: Issues and challenges," ISRN Signal Processing,pp. 1-13, 2012.
M. E. A. Bashir et al., "Highlighting the current issues with pridesuggestions for improving the performance of real time cardiac health monitoring," Inform. Technology in Bio-and Medical Informatics,ITBAM, Springer Berlin Heidelberg, pp. 226-233, 2010.
Hassan, Esraa, et al. "The effect of choosing optimizer algorithms to improve computer vision tasks: a comparative study." Multimedia Tools and Applications (2022): 1-43.
A. Dallali, A. Kachouri, and M. Samet, "Classification of Cardiac Arrhythmia Using WT, HRV, and Fuzzy C-Means Clustering," SignalProcessing: An Int. J. (SPJI), vol. 5, no. 3, pp. 101-109, 2011.
A. Dallali, A. Kachouri, and M. Samet, "Fuzzy c-means clustering,Neural Network, wt, and Hrv for classification of cardiac arrhythmia," ARPN J. of Eng. and Appl. Sci., vol. 6, no. 10, pp. 112-118,2011.
Hassan, Esraa, et al. "COVID-19 diagnosis-based deep learning approaches for COVIDx dataset: A preliminary survey." Artificial Intelligence for Disease Diagnosis and Prognosis in Smart Healthcare (2023): 107.
Rajendra Acharya, U., Subbanna Bhat, P., Iyengar, S.S. 2003. Ashok Rao and Sumeet Dua., Classification of heart rate data using artificial neural network and fuzzy equivalence relation”, Pattern Recognition 36 (2003) 61 – 68.
Hassan E, El-Rashidy N, Talaat FM (2022) Review: Mask R-CNN Models. https://doi.org/10.21608/njccs.2022.280047.
Alexakis, C., Nyongesa, HO., Saatchi, R., Harris, ND., Davies, C., Emery, C., Ireland, RH and Heller SR. 2003. Feature Extraction and Classification of Electrocardiogram (ECG) Signals Related to Hypoglycemia”, Conference on computers in Cardiology, pp. 537-540, IEEE.
E. Hassan, M. Y. Shams, N. A. Hikal and S. Elmougy, “A novel convolutional neural network model for malaria cell images classification,” Computers, Materials & Continua, vol. 72, no. 3, pp. 5889–5907, 2022.
Hu, Y.H., Tompkins, W.J., Urrusti, J.L and Afonso, V.X. 1993. Applications of artificial neural networks for ECG signal detection and classification, J. Electro cardiology, vol. 26 (Suppl.), pp. 66-73.
Talaat, Fatma M., and Esraa Hassan. "Artificial Intelligence in 3D Printing." Enabling Machine Learning Applications in Data Science: Proceedings of Arab Conference for Emerging Technologies 2020. Springer Singapore, 2021.
Xiaomin, Xu. and Ying, Liu. 2004. ECG QRS Complex Detection Using Slope Vector Waveform (SVW) Algorithm, Proceedings of the26th Annual International Conference of the IEEE EMBS, pp. 3597-3600.
Gamel, S.A., Hassan, E., El-Rashidy, N. et al. Exploring the effects of pandemics on transportation through correlations and deep learning techniques. Multimed Tools Appl (2023). https://doi.org/10.1007/s11042-023-15803-1
Silipo, R and Marchesi, C. 1998. Artificial neural networks for automatic ECG analysis, Signal Processing 1998. 46; 1417-1425.
Papaloukas, C and Fotiadis, D.I. 2002. An ischemia detection method based on artificial neural network, Artificial Intelligence in Medicine 2002; 24: 167-178.
Foo, SY and Stuart, G. 2002. Neural network-based ECG pattern recognition, Engineering Applications of Artificial Intelligence 2002; 15: 253-260.
Hassan, Esraa, et al. "Breast Cancer Detection: A Survey." Artificial Intelligence for Disease Diagnosis and Prognosis in Smart Healthcare. CRC Press, 2023. 169-176.‏
Ceylan, R and Ozbay, Y. 2007. Comparison of FCM, PCA and WT techniques for classification ECG arrhythmias using artificial neural network, Expert Systems with Applications, 286-295.
Schreicr, G., Kastner, P. and Marko, W. 2001. An Automatic ECG Processing Algorithm to Identify Patients Prone to Paroxysmal Atrial Fibrillation, IEEE Computers in Cardlology, vol. 28, pp. 133- 135.
Tadejko, P and Rakowski, W. 2007. Mathematical Morphology Based ECG Feature Extraction for the Purpose of Heartbeat Classification, 6th International Conference on Computer Information Systems and Industrial Management Applications, CISIM '07, pp. 322-327
Tayel, M.B and El-Bouridy, M.E. 2006. ECG Images Classification Using Feature Extraction Based on Wavelet Transformation And Neural Network, ICGST, International Conference on AIML.
LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). ''Gradient-based learning applied to document recognition''. Proceedings of the IEEE, 86(11), 2278-2324.
Ranzato, M. A., Huang, F. J., Boureau, Y. L., & LeCun, Y. (2007). Unsupervised learning of invariant feature hierarchies with applications to object recognition. In Computer Vision and Pattern Recognition, 2007. CVPR'07. IEEE Conference on (pp. 1-8). IEEE.
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., & Salakhutdinov, R. (2014).'Dropout: A simple way to prevent neural networks from overfitting’. The Journal of Machine Learning Research, 15(1), 1929-1958.
https://www.python.org/downloads/release/python-350/
Hassan, E.; Elmougy, S.; Ibraheem, M.R.; Hossain, M.S.; AlMutib, K.; Ghoneim, A.; AlQahtani, S.A.; Talaat, F.M. Enhanced Deep Learning Model for Classification of Retinal Optical Coherence Tomography Images. Sensors 2023, 23, 5393. https://doi.org/10.3390/s23125393

Download PDF

Version 1

posted

You are reading this latest preprint version

Electrocardiogram Signal Classification Based on Deep Learning Techniques

Status:

Version 1

Abstract

Figures

1. Introduction

2. ECG Classification techniques

3. Proposed ECG Classification technique

3. 1. CNN

3.2. Proposed CNN Model

4. Experimental Results

5. Conclusion

Declarations

References

Status:

Version 1