A Pooling Convolution Model for Multi-classification of ECG and PCG Signals

doi:10.21203/rs.3.rs-3107018/v1

Download PDF

Research Article

A Pooling Convolution Model for Multi-classification of ECG and PCG Signals

https://doi.org/10.21203/rs.3.rs-3107018/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Cardiovascular disease has always been one of the major threats to human health. The recognition of ECG and PCG signals, which are physiological signals generated when the heart beats, can greatly improve the efficiency of cardiovascular disease detection using deep learning techniques. Based on this, it is proposed that a simple and effective pooling convolution model for multi-classification of ECG and PCG signals. Firstly, ECG, PCG and synchronized ECG-PCG data are pre-processed. Then, several structure blocks are designed, including the block stacked by max-pooling layer(MCM), convolution layer and max-pooling layer, its multiple variants and the residual block(REC). All network models that use these structure blocks separately are based on the ResNet-18 framework. By changing the number of structure blocks, the model can be applied to ECG and PCG data at different sampling rates. The final test shows that the accuracy of the model using the MCM structure block is 98.70%, 92.58% and 99.19% respectively on the ECG, PCG fusion dataset and synchronized ECG-PCG dataset, which is higher than all the networks using its multiple variants. At the same time, the accuracy is improved by 0.02%, 4.30% and 1.43% compared to the model using the REC structure block. In addition, this work also carries out tests on multiple ECG and PCG datasets and compares them with other published literature, further verifying that the model using the MCM structure block has a higher detection rate for ECG and PCG signals.

Cardiovascular disease

Deep learning

ECG and PCG signal

Signal processing and classification

IN recent years, the number of patients with cardiovascular disease has been gradually increasing, especially sudden cardiac death due to stress-related factors. Simple and effective diagnosis of cardiovascular disease is an urgent technology in the field of cardiac health [1]. Single-mode signal diagnosis methods are widely used in clinical medicine, such as electrocardiogram (ECG), phonocardiogram (PCG), ultrasonic cardiogram (UCG) and other methods as routine means of diagnosing and treating heart diseases. And multimodal signal detection involves the accurate classification of diseases using synchronized ECG-PCG signals or other synchronous physiological signals [2].

A. Background

In this study, we mainly discuss and analyze the single mode and synchronous signals of ECG and PCG. ECG signals usually include single-lead, three-lead and even multi-lead signals. A model using ECG multi-lead signal for disease classification is to fuse several standard single-lead data for network training and compare with the unfused data. Finally, the experiment shows that the classification accuracy can be improved by fusing the multi-lead data [3]. A multi-channel PCG detection network uses multi-channel PCG signals to classify coronary artery disease. The classification accuracy of multi-channel signals is improved by 4% compared to single-channel signals [4]. However, there is still a lack of relevant research on the fusion and training of multi-channel ECG and PCG data, and the current synchronized ECG-PCG dataset is smaller, so the research on it is still at a slow development stage.

B. Related work

Currently, the use of ECG or PCG signals to identify and classify cardiovascular disease has been widely studied. For example, Baygin et al. used feature extraction based on homeomorphically irreducible tree graph pattern and feature generation based on maximum absolute pooling to finally classify grade 7 and grade 4 arrhythmia on a large ECG dataset, the accuracy was 92.92% and 97.18%, respectively [5]. Wang et al. used convolution with different kernel sizes to extract multidimensional features from ECG signals, and finally used the max-pooling layer for feature screening, achieving 99.06% accuracy on the MITDB [6]. Ullah et al. proposed a two-dimensional (2-D) convolutional neural network (CNN) model to classify ECG signals into eight classes, and the final average accuracy on the MITDB was 99.11% [7]. Tuncer et al. proposed a graph-based feature generator and applied it to the classification of PCG signals. Finally, the accuracy of five categories of PCG reached 100% [8]. Zeng et al. proposed a PCG classification method based on mixed signal processing and deterministic theory, and finally achieved 97.75%, 98.69% and 98.48% accuracy on several databases [9]. Ghosh performed time-frequency analysis on the signal and used the multi-class composite classifier to classify the three types of PCG, namely AS, MS and MR, with sensitivities of 99.44%, 98.66% and 96.22% [10].

At the same time, many scholars have improved the classification accuracy of cardiovascular diseases by using synchronized ECG-PCG signals. Li used 1-D and 2-D convolutional networks combined with the time-frequency characteristics of ECG and PCG signals to discriminate coronary artery disease, and finally achieved 96.51% accuracy [11]. Chakir used several machine learning methods to verify that using synchronized ECG-PCG signals can achieve higher accuracy than using PCG signals alone [12]. The complete summary is shown in Table 1.

Table 1 - Previous classification approaches for ECG,PCG and synchronized ECG-PCG signals

Ref	First author surname		Description
1	Baygin et al [5]	Using feature extraction and generation on the ECG dataset to for classification
2	Wang et al [6]	Using CNN with different kernel sizes to extract features from ECG signals
3	Ullah et al [7]	Proposing a two-dimensional CNN for 8 classification of ECG signals
4	Tuncer et al [8]	Proposing a graph-based feature generator to classify PCG signals
5	Zeng et al [9]	Designing a PCG classification method based on mixed signal processing and deterministic theory
6	Ghosh et al [10]	Time–frequency analysis is carried out, and three types of PCGs are classified using a multi class composite classifier
7	Li et al [11]	Using 1-D and 2-D CNN, combining the time-frequency characteristics of ECG and PCG signals to differentiate disease
8	Chakir et al [12]	Using several machine learning methods to verify that using synchronized ECG and PCG signals can improve accuracy

C. The objectives of this work

In conclusion, 1-D and 2-D convolutions have been widely used in ECG and PCG signal classification, but the feature extraction ability of the pooling layer has not been significantly explored. At the same time, it is necessary to test the classification effect of merging multiple ECG or PCG databases. And it is also valuable to identify and compare synchronized ECG-PCG data with single-channel ECG or PCG data. As a result, this study proposes the structure block (MCM) stacked by max-pooling layer, convolution layer and max-pooling layer, integrates multiple ECG and PCG datasets, and compares the differences in classification performance between synchronized ECG-PCG data and single-channel ECG or PCG data, as well as the effect of different arrangement of ECG and PCG signals on the classification results.

D. Contribution of our work

The main contributions of our work are as follows:

(1) The MCM structure block is proposed, and several network models are constructed based on the ResNet18 network. Finally, compared with the model using residual block (REC) and other structure blocks, the model using the MCM structure block achieves higher accuracy on several datasets.

(2) Several ECG and PCG datasets are fused, and model training and test are completed for the datasets before and after fusion, which verifies that the dataset formed by multi-database fusion can also be used for disease classification research.

(3) A series of comparison tests have been conducted using the synchronized ECG-PCG dataset, and it has been verified that compared with ECG or PCG data, their synchronized data can obtain richer classification features.

A. ECG databases

We chose the ten databases described in Table 2 after reviewing several ECG databases in turn, including the database abbreviation, full name, and categories used in this work.

Table 2 – Each ECG database and category

Database	Category
MIT-BIH Supraventricular Arrhythmia Database (SVDB) [13, 14]	NB, V, S, \|
MIT-BIH Long Term ECG Database (LTDB) [13]	NB, S, V, F
PAF Prediction Challenge Database (AFPDB) [15]	NB, PAF
MIT-BIH Arrhythmia Database (MITDB) [13, 16]	NB, L, R, V, /, A, +, F, ~
MIT-BIH ST Change Database (STDB) [13, 17]	NB, V, S, ~
European ST-T Database (EDB) [18]	NB, ~, V, +, s, T, S, F
Sudden Cardiac Death Holter Database(SDDB) [19]	NB, r, S, V, s, +, E
MIT-BIH Normal Sinus Rhythm Database (NSRDB) [13]	NB
MIT-BIH Atrial Fibrillation Database(AFDB) [13, 20]	AF
AHA Database Sample Excluded Record (AHADB) [13]	NB, V

Table 3 – Full name and quantity of each ECG category

Category	TR	VA	TE
Normal beat (NB)	172995	57666	57668
Paroxysmal atrial fibrillation (PAF)	82382	27461	27461
Atrial fibrillation (AF)	414000	138000	138000
Premature ventricular contraction (V)	27899	9301	9304
Premature or ectopic supraventricular beat (S)	58750	19582	19587
MA (A, E, L, R, T, r, s, +, /, \|, ~)	60449	20153	20157
Atrial premature contraction (A)	1510	504	504
Ventricular escape beat (E)	247	83	83
Left bundle branch block beat (L)	4813	1605	1605
Right bundle branch block beat (R)	4312	1438	1438
T-wave change (T)	793	265	265
R-on-T premature ventricular contraction (r)	35275	11759	11759
ST change (s)	2788	929	930
Rhythm change (+)	1729	576	577
Paced beat (/)	2150	717	718
Isolated QRS-like artifact (\|)	1315	438	439
Signal quality change (~)	5517	1839	1839

As shown in Table 2, NSRDB and AFDB have only one category in these databases. At the same time, we eliminated only a few disease categories. Table 3 shows the full name and number of each ECG category.

In Table 3, because the number of the last 11 categories is relatively less, we combine them into one category, which will be referred to in the text as multiple abnormality (MA). There are 6 categories of ECG data after the final fusion. TRS, VAS and TES are the samples of training set, validation set and test set respectively. In addition to the learning rate, the rest of the hyperparameters we use in the training process are the standard parameters, which will be conducive to the subsequent comparison between different structural blocks. The division ratio of training set, validation set and test set in this work is 3:1:1, and the PCG dataset in Table 4 and the synchronized ECG-PCG dataset in Table 6 are consistent with the division ratio. At the same time, due to the noise and interference involved in the collection process of each data, it is necessary to filter them before data segmentation.

For each ECG dataset, we need to intercept ECG segments based on R-wave points and sampling rate. In this study the sampling rate was unified to 200 Hz and the dataset consisting of multiple ECG data fusions is called ECGF.

B. PCG databases

Consistent with the processing of ECG datasets, NHS is present in multiple databases in Table 4, so the remaining categories are merged into one category after data fusion. The full name and number of each category is shown in Table 5.

Table 4 – Each PCG database and category

Database	Category
Classification of Heart Sound Recordings: The PhysioNet in Cardiology Challenge 2016 (C2016) [13, 21]	NHS, AHS
Classification of Heart Sound Signal Using Multiple Features (Y2018) [22]	NHS, AS, MS, MR, MVP
A machine learning challenge to classify heart beat sounds (K2016) [23]	NHS, ECS, CM, AHS
Heart Sound & Murmur Library (Mfour) [24]	NHS, APA, AA, AMA

Table 5 – Full name and quantity of each PCG category

Category	TR	VA	TE
Normal heart sound (NHS)	11585	3863	3864
AHSS (AHS, AS, MR, MS, MVP, ECS, CM, FHS, APA, AA, AMA)	5515	1840	1841
Abnormal heart sound (AHS)	3380	1127	1127
Aortic stenosis (AS)	120	40	40
Mitral regurgitation (MR)	120	40	40
Mitral stenosis (MS)	120	40	40
Mitral valve prolapse (MVP)	120	40	40
Extra cardiac sound (ECS)	217	73	73
Cardiac murmur (CM)	554	185	185
Artificial heart sound (FHS)	192	64	64
Abnormal pulmonary artery (APA)	162	54	54
Abnormal aorta (AA)	109	37	37
Abnormal mitral valve (AMA)	421	140	141

In Table 5, 11 types of PCG signals are integrated as abnormal heart sounds (AHSS). To split the PCG data, first the sampling rate of these data must be unified to 2000Hz, then the data segment with the length of 6000 is extracted, and finally the length is reduced to 2000. The dataset resulting from the fusion of several PCG data is called PCGF.

C. Synchronized ECG-PCG database

Table 6 – Synchronized ECG-PCG database and category

Database	Category
EPHNOGRAM: A Simultaneous Electrocardiogram and Phonocardiogram Database (EP2021) [25]	RS, RL, EP, ESW, EBP, EW, EBS

All synchronized ECG-PCG data are divided into 7 categories according to the marked acquisition status. Database information is shown in Table 6. The full names of the different categories and data division are listed in Table 7. Among them, RS and RL are the data collected under different rest states, EP, ESW, EBP, EW and EBS are under different motion states.

Table 7 – Full name and quantity of each synchronized data category

Category	TR	VA	TE
Rest: sitting on armchair (RS)	1123	374	375
Rest: laying on bed (RL)	2280	960	960
Exercise: pedaling a stationary bicycle(EP)	4680	1560	1560
Exercise: slow walk (7 min); fast walk (8 min); sit down and stand up (4 min); slow walk (6 min); rest (ESW)	1080	360	360
Exercise: Bruce protocol treadmill stress test (EBP)	3960	1320	1320
Exercise: walking at constant speed (3.7 km/h) (EW)	4680	1560	1560
Exercise: bicycle stress test (EBS)	3600	1200	1200

D. Data preprocessing

ECG and PCG, as important research objects in biomedical signals, are receiving more and more attention from academia and industry [26]. The advantages of the stable wavelet transform over other denoising methods have been demonstrated [27]. In this study, the stable wavelet method is used to denoise ECG and PCG signals. First, the sym8 wavelet basis function is selected to decompose the signal into 8 layers and denoise these decomposition coefficients. These layers are then reconstructed. Finally, the effects of the ECG and PCG signals before and after denoising are shown in Fig. 1. The grey curve is the original unfiltered data, while the red curve is the waveform drawn using the filtered data. From the comparison, it can be seen that the baseline drift, power frequency interference and other problems are eliminated after the ECG waveform is filtered. The comparison also shows that the filtered PCG waveform removes some of the noise and better preserves the information contained in the original waveform.

E. Our methods

The residual network is a milestone breakthrough in the field of deep learning. Due to its good performance, it has also been widely used in the field of intelligent recognition of medical data. The residual network adopts the idea of short connection, which effectively delays the degradation of the network [28]. In the exploration and research of applying ECG and PCG signals to the classification of cardiovascular diseases, the models designed by many scholars have achieved great success. The pooling layer is generally used to down-sample the input data [29], while the max-pooling layer can extract the main features of the current data, and at the same time, it can ensure that the input and output sizes are consistent by setting parameters. Compared to the extensive use of the convolutional layer, the max-pooling layer is generally used less frequently in the network. Professionals often distinguish ECG and PCG signals based on the abnormality of a particular segment of data, and the pooling layer has an excellent ability to capture the abnormality of the data and ignore other unimportant information. For this reason, the MCM and REC structure block was constructed for comparative testing.

In Fig. 2, the MCM structure block consists of: max-pooling + convolution + max-pooling + batch normalization (BN) [30] + ReLU [31]. The REC structure block consists of: convolution + BN + ReLU + convolution + BN + ReLU. At the same time, the obtained results are added to the original input data before the second ReLU. For the change of network channel number and feature number, the MCM and REC blocks are consistent and processed, which can further improve the comparability. In the MCM structure block, remove the first max-pooling layer is MCM1 structure block, remove the second max-pooling layer is MCM2 structure block, remove both the two max-pooling layers is MCM3 structure block, add a max-pooling layer before the first largest pooling layer is MCM4 structure block, and add a max-pooling layer after the last max-pooling layer of MCM4 is MCM5 structure block. In this paper, MCM1, MCM2, MCM3, MCM4 and MCM5 structure blocks are collectively referred to as variants of the MCM structure block. The specific structure is shown in the Appendix.

Compared with REC, the MCM proposed uses the stacking of max-pooling layer and convolution layer, which greatly reduces the number of parameters and computation. The overall network framework and structure block repetition times are shown in Fig. 3.

Both the MCM and REC blocks are reused 4 and 8 times respectively in Fig. 3. This is due to the different sampling rates of the ECGF and PCGF datasets. When these two structure blocks are used four times, they correspond to two models: MCM-4 and REC-4. Similarly, MCM-8 and REC-8 can be obtained by using them eight times. It can be seen that the other structures in the nets are the same, except for the structure blocks and their usage times, which can also improve the contrast. If the MCM structure block is replaced by one of its variants, then ten network models such as MCM1-4, MCM1-8 can be obtained.

For this network framework, the input data is first passed through a convolution layer, BN layer, ReLU and max-pooling layer, then a structure block was used and the number of times it was used was determined. For the ECGF dataset, the structure block must be repeated 4 times. For the PCGF and synchronized ECG-PCG datasets, the structure block must be repeated 8 times. Finally, the adaptive avg-pooling layer and the linear layer are input and the classification results are output.

When the same data is input, the parameters and computation amount of the two nets of MCM-4 and REC-4 are (534918, 1.36G) and (175840, 4.29G) respectively. Similarly, the results for MCM-8 and REC-8 are (1582791, 12.86G) and (4201799, 33.49G) respectively. It can be seen that the network model built by the MCM structural block has only a third of the number of parameters and computations compared to the REC, which has greater advantages in terms of memory usage and computational speed.

Multiple ECG and PCG datasets are used to compare the network models developed by MCM and REC structure block, and all training and test results are included in Table 1-2 of the Appendix. We used all structure blocks to compare the accuracy changes during training and the test set results after training: REC, MCM, and its multiple variants. Finally, in Table 8-10, we only show the test set results of some networks. Tables 3-4 in the Appendix contain the complete test set results.

A. Evaluation criteria

In this work, the accuracy of each network model in the training set, validation set and test set is used to check whether the model has relative over-fitting problems and the generalization ability. The accuracy formula is as follows:

total samples represent two meanings, namely, the total number of samples in a dataset or category. Similarly, the number of samples correctly classified in a dataset or category is represented by correct samples.

B. Accuracy in ECGF and PCGF datasets

Table 8 – Accuracy in ECGF and PCGF dataset

Dataset	Network	Accuracy of single category						All
Dataset	Network	NB	PAF	AF	V	S	MA	All
ECGF	MCM-4	98.81	99.63	99.73	92.24	97.11	94.44	98.70
ECGF	REC-4	98.83	99.73	99.75	91.82	97.02	94.18	98.68
		NHS	AHSS
PCGF	MCM-8	94.24	89.05					92.58
PCGF	REC-8	90.11	84.04					88.28

In Table 8, "All" represents the accuracy of the network model for total data in the test set. Comparing the accuracy of the two networks for each ECG category, we can see that the recognition rate of MCM-4 network for NB, PAF, AF is slightly lower than that of REC-4, but the difference is not large, while the recognition rate of V, S, MA is higher. For the whole test set, the recognition accuracy of MCM-4 network is also slightly high. In the test set of PCGF, it can be seen that the accuracy of MCM-8 network is higher than that of REC-8 network for both categories, i.e. the extraction ability of MCM structure block for the important features contained in ECG and PCG signals is higher than that of REC structure block.

In the training process of the network, we perform 20 iterations for each training round and the same training 10 times in total. The purpose is to observe whether the network has high stability. The results are shown in Fig. 4.

On the ECGF dataset, the single training process of multiple networks built with various structural blocks is shown in the Fig. 4 (a) and (b), a total of 20 epochs. Among them, TR and VA represent the accuracy of training set and validation set, REC, MCM, MCM1, etc. are all structure blocks mentioned in the article, and their corresponding network models are REC-4, MCM-4, MCM1-4, etc. Similarly, the network framework is also used when testing the PCGF dataset and the synchronized ECG-PCG dataset, but the number of these structure blocks has changed from 4 to 8. Compared with the training set, the REC-4 network model has a higher accuracy, followed by MCM-4 and MCM1-4 networks, and finally the other networks. Compared with the accuracy of validation set, MCM-4 network is higher, followed by REC-4, MCM5-4, MCM4-4, MCM1-4, MCM2-4 and MCM3-4.

In Fig. 4 (c), TE represents the accuracy of the test set. Each network model is trained 10 times under the same parameters and tested on the test set. It can be seen that the accuracy of the test set is consistent with the order of the validation set, which also fully demonstrates that the MCM structure block is more conducive to strengthening the generalization of the network than its variants. At the same time, combining the accuracy of REC-4 network and MCM-4 network in Fig. 4 (a), it can be concluded that compared with MCM-4 network, REC-4 network has over-fitting problem, which further verifies the advantages of MCM structure block.

A comprehensive comparison of the test results of each model in the PCGF dataset in Fig. 5 shows that the REC-8 network has better accuracy in the training set, while the MCM-8 network is relatively low. However, in the validation and test sets, the accuracy of each network is consistent. MCM-8 and MCM4-8 have similar accuracy, and then from high to low are MCM5-8, MCM2-8, MCM1-8, REC-8 and MCM3-8 networks, and the highest accuracy appears in the test results of the MCM-8 networks. At this point, we can still come to the same conclusion as in Fig. 4, i.e. compared to the MCM-8 network, the REC-8 network has an obvious over-fitting problem. Compared with other networks, the MCM-8 network also has higher accuracy in the test set, which again confirms that the network using the MCM structure block has better generalization.

C. Accuracy in synchronized ECG-PCG database

In the synchronized data experiment, we used four different permutations of ECG and PCG signals. The first way is to arrange ECG and PCG data as two-channel data, with ECG as way is a sequence of alternating ECG and PCG data. The third way is to arrange ECG and PCG data as single channel data, with ECG in the front and PCG in the back, the fourth way is the sequence of changing ECG and PCG data. At the same time, ECG and PCG data alone are used as the fifth, sixth way. They are used to compare the MCM-8 and the REC-8 network.

The numbers under Arrangement correspond to the different arrangements of ECG and PCG data in Table 9. We can see that the third arrangement achieves the higher classification accuracy. However, regardless of the arrangement method, the classification accuracy of our MCM-8 network is much higher than that of the REC-8 network. Overall, arranging the synchronized data as single-channel data achieved the highest accuracy, and the results also confirmed that under the condition of standard data acquisition, using synchronized ECG-PCG data can achieve higher accuracy than using single ECG or PCG data.

Table 9 – Accuracy in synchronized ECG-PCG database

Arrange	Network	RS	RL	EP	ESW	EBP	EW	EBS	ALL
1	MCM-8	99.73	99.79	98.13	98.33	98.71	99.16	96.23	98.45
1	REC-8	100.00	99.58	95.34	98.87	96.90	97.28	92.82	96.58
2	MCM-8	100.00	99.48	98.38	97.25	98.70	98.73	96.78	98.42
2	REC-8	100.00	99.27	95.87	98.04	96.60	97.47	91.63	96.40
3	MCM-8	100.00	100.00	98.78	99.16	99.54	99.42	98.17	99.19
3	REC-8	100.00	99.58	96.81	99.15	97.64	98.98	95.03	97.76
4	MCM-8	100.00	100.00	98.59	98.61	99.32	99.17	98.33	99.06
4	REC-8	100.00	99.48	96.94	98.87	97.95	98.85	95.65	97.90
5	MCM-8	100.00	99.06	94.42	98.31	97.53	98.28	91.87	96.47
5	REC-8	100.00	98.74	93.13	97.48	95.20	97.66	89.69	95.23
6	MCM-8	99.73	99.58	90.77	88.35	92.86	96.21	90.77	93.80
6	REC-8	100.00	98.94	90.01	88.71	91.98	94.06	86.12	92.23

Next, we conduct a comparison test on the network models formed by each structure block according to the third arrangement of ECG and PCG.

Through comparison, it is not difficult to find that although we only use accuracy as an evaluation indicator for multiple comparison experiments, the classification accuracy of the model for each category in the dataset is detailed in Tables 8-9, which is crucial for evaluating the quality of the model. Because it can verify the generalization ability of the model in the case of unbalanced multi-category datasets. Obviously, our proposed MCM structure network has an excellent generalization ability compared to the REC structure network. To further validate the recognition ability of the model, we conducted training and testing on several ECG and PCG datasets. To further validate the effect of the type of structural blocks on the recognition ability of the model, we designed five structural blocks, MCM1, MCM2, MCM3, MCM4 and MCM5, and constructed a network based on ResNet18. Several ablation experiments were performed on ECGF, PCGF and ECG synchronization datasets, and the specific structural blocks and results are presented in the Appendix.

In Fig. 6, the accuracy of each network is relatively close in the training set, while it is consistent in the validation and test sets, and the accuracy of MCM-8 is higher than that of other networks. Therefore, we can see that compared to MCM-8, other networks have a relative overfitting problem. We can also see that MCM-8 has better generalization.

The following conclusions can be drawn from a comprehensive comparison with Fig. 4-6:

(1) Compared with using REC structure blocks, using MCM structure block in the networks have better anti-overfitting ability and excellent generalization.

(2) Compared with multiple variants of MCM structure block, itself has better classification performance, which also shows that the max-pooling layer and convolution layer in MCM structure block play a key role.

Table 10 – Classification accuracy of ECG and PCG Dataset

Database	Network	Accuracy	Number of categories
MIT-BIH (MITDB)	Rajkumar[32]	93.60	7
	Singh[33]	99.59	5
	Mougoufan[34]	93.62	2
	REC-4	99.56	6
	MCM-4	99.63	6
PhysioNet Challenge 2016 (C2016)	Alkhodari[35]	87.31	2
	Al-Issa[36]	93.76	2
	Karhade[37]	85.16	2
	REC-8	95.92	2
	MCM-8	97.14	2

As shown in Table 10, to compare the classification effect of ECG signals, we choose the MITDB dataset, which is commonly used in the field of ECG classification. However, since the number of each category in this dataset is not balanced, each scholar uses this data in a different way. When we use this dataset, because a small number of categories are grouped into one category, the dataset remains relatively complete. In the PCG dataset, we selected the C2016 dataset for comparison. This dataset only contains normal and abnormal heart sounds, so all the data in this database can be used. The network models used in this study that include the MCM structure block, namely MCM-4 and MCM-8, give the best results. The accuracy of REC-4, REC-8, MCM-4 and MCM-8 in Table 10 are taken from Table 1-2 in the Appendix.

There is little research on the classification of fusing multiple datasets, and the network models devised in most of the literature are relatively complex. Therefore, multiple ECG and PCG datasets were collected and fused into ECGF and PCGF datasets. Meanwhile, the synchronized ECG-PCG dataset collected under different states was also used for comparative experiments. And several structural blocks are designed: REC, MCM and its variants.

Finally, through the model training and testing of the ECGF and PCGF datasets, it is verified that the network with MCM structure block has better classification performance and recognition ability for the PCG signals compared to the recognition of the ECG signals. For the synchronized ECG-PCG data, six different ways are used, which verifies that the use of synchronized signals can achieve better classification performance than the use of separate ECG or PCG signals. The complete test set results of each network are shown in Table 3-4 of the Appendix, and the variants of the MCM structure block are shown in Fig.1 of the Appendix. In addition, several ECG and PCG data sets were also trained and tested. Please refer to Table 1-2 in the Appendix.

G.A. Roth, G.A. Mensah, C.O. Johnson, "Global burden of cardiovascular diseases and risk factors," J. Am. Coll. Cardiol., vol. 76, no. 25, pp. 2982-3021, Dec. 2020.
S.B. WOLFE, R.L. Popp, H. FEIGENBAUM, "Diagnosis of atrial tumors by ultrasound," Circulation, vol. 39, no. 5, pp. 615-622, May. 1969.
J. Park, J. An, J. Kim, et al., "Study on the use of standard 12-lead ECG data for rhythm-type ECG classification problems," Computer Methods Programs in Biomedicine, vol. 214, no. pp. 106521, Feb. 2022.
P. Samanta, A. Pathak, K. Mandana, et al., "Classification of coronary artery diseased and normal subjects using multi-channel phonocardiogram signal," Biocybernetics Biomedical Engineering, vol. 39, no. 2, pp. 426-443, Apr. 2019.
M. Baygin, T. Tuncer, S. Dogan, et al., "Automated arrhythmia detection with homeomorphically irreducible tree technique using more than 10,000 individual subject ECG records," Information Sciences, vol. 575, no. pp. 323-337, Oct. 2021.
H. Wang, H. Shi, X. Chen, et al., "An improved convolutional neural network based approach for automated heartbeat classification," Journal of Medical Systems, vol. 44, no. 35, pp. 1-9, Dec. 2020.
A. Ullah, S.M. Anwar, M. Bilal, et al., "Classification of arrhythmia by using deep learning with 2-D ECG spectral image representation," Remote Sensing, vol. 12, no. 10, pp. 1685, May. 2020.
T. Tuncer, S. Dogan, R.-S. Tan, et al., "Application of Petersen graph pattern technique for automated detection of heart valve diseases with PCG signals," Information Sciences, vol. 565, no. pp. 91-104, Jul. 2021.
W. Zeng, Z. Lin, C. Yuan, et al., "Detection of heart valve disorders from PCG signals using TQWT, FA-MVEMD, Shannon energy envelope and deterministic learning," Artificial Intelligence Review, vol. 54, no. pp. 6063-6100, Feb. 2021.
S.K. Ghosh, R. Ponnalagu, R. Tripathy, et al., "Automated detection of heart valve diseases using chirplet transform and multiclass composite classifier with PCG signals," Computers in biology medicine, vol. 118, no. pp. 103632, Mar. 2020.
H. Li, X. Wang, C. Liu, et al., "Integrating multi-domain deep features of electrocardiogram and phonocardiogram for coronary artery disease detection," Computers in biology medicine, vol. 138, no. pp. 104914, Nov. 2021.
F. Chakir, A. Jilbab, C. Nacir, et al., "Recognition of cardiac abnormalities from synchronized ECG and PCG signals," Physical Engineering Sciences in Medicine, vol. 43, no. pp. 673-677, Apr. 2020.
A.L. Goldberger, L.A. Amaral, L. Glass, et al., "PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals," Circulation, vol. 101, no. 23, pp. e215-e220, Jun. 2000.
S.D. Greenwald, R.S. Patil, R.G. Mark, "Improved detection and classification of arrhythmias in noise-corrupted electrocardiograms using contextual information," IEEE, 1990, pp.461-464. [Online].
G. Moody, A. Goldberger, S. McClennen, et al., "Predicting the onset of paroxysmal atrial fibrillation: The Computers in Cardiology Challenge 2001," in Computers in Cardiology 2001, 2001, pp.113-116.
G.B. Moody, R.G. Mark, "The impact of the MIT-BIH arrhythmia database," vol. 20, no. 3, pp. 45-50, May-June. 2001.
P. Albrecht, ST segment characterization for long term automated ECG analysis, Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 1983.
A. Taddei, G. Distante, M. Emdin, et al., "The European ST-T database: standard for evaluating systems for the analysis of ST-T changes in ambulatory electrocardiography," vol. 13, no. 9, pp. 1164-1172, Sep. 1992.
S.D. Greenwald, The development and analysis of a ventricular fibrillation detector, Massachusetts Institute of Technology, 1986.
G. Moody, "A new method for detecting atrial fibrillation using RR intervals," Computers in Cardiology, vol. 10, no. pp. 227-230, 1983.
C. Liu, D. Springer, Q. Li, et al., "An open access database for the evaluation of heart sound algorithms," Physiological measurement, vol. 37, no. 12, pp. 2181, Nov. 2016.
G.-Y. Son, S. Kwon, "Classification of heart sound signal using multiple features," Applied Sciences, vol. 8, no. 12, pp. 2344, Nov. 2018.
E.F. Gomes, P.J. Bentley, E. Pereira, et al., "Classifying Heart Sounds-Approaches to the PASCAL Challenge," in HEALTHINF, 2013, pp.337-340.
R. Judge, R. Mangrulkar, "Heart sound and murmur library," 2015, [Online].
A. Kazemnejad, P. Gordany, R. Sameni, "An Open–Access Simultaneous Electrocardiogram and Phonocardiogram Database," bioRxiv, vol. no. pp. Jun. 2021.
N.C. Jones, P.A. Pevzner, "An introduction to bioinformatics algorithms," MIT press, 2004, [Online].
A. Kumar, H. Tomar, V.K. Mehla, et al., "Stationary wavelet transform based ECG signal denoising method," ISA transactions, vol. 114, no. pp. 251-262, Aug. 2021.
K. He, X. Zhang, S. Ren, et al., "Deep residual learning for image recognition," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp.770-778.
Z.J. Wang, R. Turko, O. Shaikh, et al., "CNN explainer: learning convolutional neural networks with interactive visualization," IEEE Transactions on Visualization Computer Graphics, vol. 27, no. 2, pp. 1396-1406, Oct. 2020.
S. Ioffe, C. Szegedy, "Batch normalization: Accelerating deep network training by reducing internal covariate shift," in International conference on machine learning, 2015, pp.448-456.
X. Glorot, A. Bordes, Y. Bengio, "Deep sparse rectifier neural networks," in Proceedings of the fourteenth international conference on artificial intelligence and statistics, 2011, pp.315-323.
A. Rajkumar, M. Ganesan, R. Lavanya, "Arrhythmia classification on ECG using Deep Learning," in 2019 5th international conference on advanced computing & communication systems (ICACCS), 2019, pp.365-369.
V. Singh, S. Tewary, V. Sardana, et al., "Arrhythmia detection-a machine learning based comparative analysis with MIT-BIH ECG data," in 2019 IEEE 5th International Conference for Convergence in Technology (I2CT), 2019, pp.1-5.
J.B.B. à Mougoufan, J.A.E. Fouda, M. Tchuente, et al., "Adaptive ECG beat classification by ordinal pattern based entropies," Communications in Nonlinear Science Numerical Simulation, vol. 84, no. pp. 105156, May. 2020.
M. Alkhodari, L. Fraiwan, "Convolutional and recurrent neural networks for the detection of valvular heart diseases in phonocardiogram recordings," Computer Methods Programs in Biomedicine, vol. 200, no. pp. 105940, Mar. 2021.
Y. Al-Issa, A.M. Alqudah, "A lightweight hybrid deep learning system for cardiac valvular disease classification," Scientific Reports, vol. 12, no. 1, pp. 1-20, Aug. 2022.
J. Karhade, S. Dash, S.K. Ghosh, et al., "Time–Frequency-Domain Deep Learning Framework for the Automated Detection of Heart Valve Disorders Using PCG Signals," IEEE Transactions on Instrumentation Measurement, vol. 71, no. pp. 1-11, Mar. 2022.

Appendix.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

A Pooling Convolution Model for Multi-classification of ECG and PCG Signals

Status:

Version 1

Abstract

Figures

I. Introduction

A. Background

B. Related work

C. The objectives of this work

D. Contribution of our work

II. Methodology

A. ECG databases

B. PCG databases

C. Synchronized ECG-PCG database

D. Data preprocessing

E. Our methods

III. RESULTS AND ANALYSIS

A. Evaluation criteria

B. Accuracy in ECGF and PCGF datasets

C. Accuracy in synchronized ECG-PCG database

IV. DISCUSSION

V. CONCLUSION

References

Supplementary Files

Status:

Version 1