[PDF][PDF] Deep Autoencoder Based Speech Features for Improved Dysarthric Speech Recognition.

B Vachhani, C Bhat, B Das, SK Kopparapu - Interspeech, 2017 - researchgate.net
Interspeech, 2017researchgate.net
Dysarthria is a motor speech disorder, resulting in mumbled, slurred or slow speech that is
generally difficult to understand by both humans and machines. Traditional Automatic
Speech Recognizers (ASR) perform poorly on dysarthric speech recognition tasks. In this
paper, we propose the use of deep autoencoders to enhance the Mel Frequency Cepstral
Coefficients (MFCC) based features in order to improve dysarthric speech recognition.
Speech from healthy control speakers is used to train an autoencoder which is in turn used …
Abstract
Dysarthria is a motor speech disorder, resulting in mumbled, slurred or slow speech that is generally difficult to understand by both humans and machines. Traditional Automatic Speech Recognizers (ASR) perform poorly on dysarthric speech recognition tasks. In this paper, we propose the use of deep autoencoders to enhance the Mel Frequency Cepstral Coefficients (MFCC) based features in order to improve dysarthric speech recognition. Speech from healthy control speakers is used to train an autoencoder which is in turn used to obtain improved feature representation for dysarthric speech. Additionally, we analyze the use of severity based tempo adaptation followed by autoencoder based speech feature enhancement. All evaluations were carried out on Universal Access dysarthric speech corpus. An overall absolute improvement of 16% was achieved using tempo adaptation followed by autoencoder based speech front end representation for DNN-HMM based dysarthric speech recognition.
researchgate.net
以上显示的是最相近的搜索结果。 查看全部搜索结果