gms | German Medical Science

65th Annual Meeting of the German Association for Medical Informatics, Biometry and Epidemiology (GMDS), Meeting of the Central European Network (CEN: German Region, Austro-Swiss Region and Polish Region) of the International Biometric Society (IBS)

06.09. - 09.09.2020, Berlin (online conference)

The Effect of Various Strengths of Noises and Data Augmentations on Classification of Short Single-Lead ECG Signals Using Deep Neural Networks

Meeting Abstract

  • Faezeh Nejati Hatamian - Pattern Recognition Lab, Department of Computer Science, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany, Erlangen, Germany
  • AmirAbbas Davari - Pattern Recognition Lab, Department of Computer Science, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany, Erlangen, Germany
  • Andreas Maier - Pattern Recognition Lab, Department of Computer Science, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany, Erlangen, Germany

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. 65th Annual Meeting of the German Association for Medical Informatics, Biometry and Epidemiology (GMDS), Meeting of the Central European Network (CEN: German Region, Austro-Swiss Region and Polish Region) of the International Biometric Society (IBS). Berlin, 06.-09.09.2020. Düsseldorf: German Medical Science GMS Publishing House; 2021. DocAbstr. 264

doi: 10.3205/20gmds165, urn:nbn:de:0183-20gmds1654

Published: February 26, 2021

© 2021 Nejati Hatamian et al.
This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 License. See license information at http://creativecommons.org/licenses/by/4.0/.


Outline

Text

Background: Due to the multiple imperfections during the signal acquisition, Electrocardiogram (ECG) datasets are typically contaminated with numerous types of noise, like salt and pepper and baseline drift. These datasets may contain different recordings with various types of noise [1] and thus, denoising may not be the easiest task. Furthermore, usually, the number of labeled bio-signals is very limited for a proper classification task.

Methods: In this work, we investigate the performance of deep convolutional neural networks and their separable alternatives (i.e., separable CNN) that are trained using noisy training data, tested on a clean test set. It is interesting to see if variants of CNNs and different schemes of data augmentation can help the classifier in the presence of noise contamination. To this end, we add different types and strengths of noise to the training data and train the network. We would like to see how noise in the training data impacts the inference performance and which noise is more destructive. For the purpose of simulation, we use additive white Gaussian noise (AWGN) in one variant and linear noise (y=ax) in the other variant. In the case of the AWGN, we use 20, 40, 60, 80 values as the standard deviation to generate various strengths of this noise. Analogously, we use 0.2, 0.4, 0.6 and 0.8 as different slopes for the linear noise. Figure 1 [Fig. 1] shows a sample signal from the training set being contaminated by different types and strengths of noise.

Moreover, we use oversampling and Gaussian Mixture Model (GMM) [2], [3] for data augmentation and compare the results with the variant without data augmentation. The metric that we adopt to measure the performance of the classifier is the F1-score. We use the same dataset [4] in a similar fashion as in our previous work [5] (Figure 2.1 [Fig. 2]). Finally, Figure 2.2 [Fig. 2] depicts the CNN architecture and its separable variant.

Results: Figure 3 [Fig. 3] visualizes the F1-score vs. the corresponding noise strength for different data augmentation algorithms, types of noise and classifiers. Table 1 [Tab. 1] shows the detailed quantitative results.

Conclusion: It can be observed that in the presence of various types and strengths of the noise, the separable CNN is more stable and robust than the normal CNN. In the majority of the experiments, linear noise is more destructive than the AWGN. In fact, both CNN and separable CNN perform equally well in the presence of the AWGN. However, CNN simply falls apart in the presence of the linear noise. In this case, the separable CNN brings robustness to the classification pipeline. Finally, except in the case of linear noise and normal CNN, the highest classification performance is almost consistently being achieved when the GMM data augmentation is in use.

The authors declare that they have no competing interests.

The authors declare that an ethics committee vote is not required.


References

1.
Abdelazez M, Rajan S, Chan ADC. Detection of Noise Type in Electrocardiogram. In: 2018 IEEE International Symposium on Medical Measurements and Applications (MeMeA). IEEE; 2018. p. 1-6.
2.
Davari A, Aptoula E, Yanikoglu B, Maier A, Riess C. GMM-based synthetic samples for classification of hyperspectral images with limited training data. IEEE Geoscience and Remote Sensing Letters. 2018;15(6): 942-946.
3.
Davari A, Özkan HC, Maier A, Riess C. Fast and Efficient Limited Data Hyperspectral Remote Sensing Image Classification via GMM-Based Synthetic Samples. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing. 2019;12(7): 2107-2120.
4.
Clifford GD, Liu C, Moody B, Lehman Li-wei H, Silva I, Li Q, Johnson AE, Mark RG. AF Classification from a short single lead ECG recording: the PhysioNet/Computing in Cardiology Challenge 2017. In: 2017 Computing in Cardiology (CinC). IEEE; 2017. p. 1-4.
5.
Nejati Hatamian F, Ravikumar N, Vesal S, Kemeth FP, Struck M, Maier A. The Effect of Data Augmentation on Classification of Atrial Fibrillation in Short Single-Lead ECG Signals Using Deep Neural Networks. arXiv. 2020. Available from: https://arxiv.org/abs/2002.02870 External link