[HTML][HTML] Data augmentation using Variational Autoencoders for improvement of respiratory disease classification

J Saldanha, S Chakraborty, S Patil, K Kotecha… - Plos one, 2022 - journals.plos.org
J Saldanha, S Chakraborty, S Patil, K Kotecha, S Kumar, A Nayyar
Plos one, 2022journals.plos.org
Computerized auscultation of lung sounds is gaining importance today with the availability
of lung sounds and its potential in overcoming the limitations of traditional diagnosis
methods for respiratory diseases. The publicly available ICBHI respiratory sounds database
is severely imbalanced, making it difficult for a deep learning model to generalize and
provide reliable results. This work aims to synthesize respiratory sounds of various
categories using variants of Variational Autoencoders like Multilayer Perceptron VAE (MLP …
Computerized auscultation of lung sounds is gaining importance today with the availability of lung sounds and its potential in overcoming the limitations of traditional diagnosis methods for respiratory diseases. The publicly available ICBHI respiratory sounds database is severely imbalanced, making it difficult for a deep learning model to generalize and provide reliable results. This work aims to synthesize respiratory sounds of various categories using variants of Variational Autoencoders like Multilayer Perceptron VAE (MLP-VAE), Convolutional VAE (CVAE) Conditional VAE and compare the influence of augmenting the imbalanced dataset on the performance of various lung sound classification models. We evaluated the quality of the synthetic respiratory sounds’ quality using metrics such as Fréchet Audio Distance (FAD), Cross-Correlation and Mel Cepstral Distortion. Our results showed that MLP-VAE achieved an average FAD of 12.42 over all classes, whereas Convolutional VAE and Conditional CVAE achieved an average FAD of 11.58 and 11.64 for all classes, respectively. A significant improvement in the classification performance metrics was observed upon augmenting the imbalanced dataset for certain minority classes and marginal improvement for the other classes. Hence, our work shows that deep learning-based lung sound classification models are not only a promising solution over traditional methods but can also achieve a significant performance boost upon augmenting an imbalanced training set.
PLOS
以上显示的是最相近的搜索结果。 查看全部搜索结果