Deep Autoencoder Based Speech Features for Improved Dysarthric Speech Recognition.- 学术资源搜索

[PDF][PDF] Deep Autoencoder Based Speech Features for Improved Dysarthric Speech Recognition.

B Vachhani, C Bhat, B Das, SK Kopparapu - Interspeech, 2017 - researchgate.net

Interspeech, 2017•researchgate.net

Abstract

Dysarthria is a motor speech disorder, resulting in mumbled, slurred or slow speech that is generally difficult to understand by both humans and machines. Traditional Automatic Speech Recognizers (ASR) perform poorly on dysarthric speech recognition tasks. In this paper, we propose the use of deep autoencoders to enhance the Mel Frequency Cepstral Coefficients (MFCC) based features in order to improve dysarthric speech recognition. Speech from healthy control speakers is used to train an autoencoder which is in turn used to obtain improved feature representation for dysarthric speech. Additionally, we analyze the use of severity based tempo adaptation followed by autoencoder based speech feature enhancement. All evaluations were carried out on Universal Access dysarthric speech corpus. An overall absolute improvement of 16% was achieved using tempo adaptation followed by autoencoder based speech front end representation for DNN-HMM based dysarthric speech recognition.

researchgate.net

展开收起

被引用次数：45 相关文章所有 8 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

[PDF][PDF] Deep Autoencoder Based Speech Features for Improved Dysarthric Speech Recognition.

引用