Dysarthric Speech Recognition Using Time-delay Neural Network Based Denoising Autoencoder.- 学术资源搜索

[PDF][PDF] Dysarthric Speech Recognition Using Time-delay Neural Network Based Denoising Autoencoder.

C Bhat, B Das, B Vachhani, SK Kopparapu - Interspeech, 2018 - academia.edu

Interspeech, 2018•academia.edu

Abstract

Dysarthria is a manisfestation of the disruption in the neuromuscular physiology resulting in uneven, slow, slurred, harsh or quiet speech. Dysarthric speech poses serious challenges to automatic speech recognition, considering this speech is difficult to decipher for both humans and machines. The objective of this work is to enhance dysarthric speech features to match that of healthy control speech. We use a Time-Delay Neural Network based Denoising Autoencoder (TDNN-DAE) to enhance the dysarthric speech features. The dysarthric speech thus enhanced is recognized using a DNN-HMM based Automatic Speech Recognition (ASR) engine. This methodology was evaluated for speaker-independent (SI) and speaker-adapted (SA) systems. Absolute improvements of 13% and 3% was observed in the ASR performance for SI and SA systems respectively as compared with unenhanced dysarthric speech recognition.

academia.edu

展开收起

被引用次数：23 相关文章所有 6 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

[PDF][PDF] Dysarthric Speech Recognition Using Time-delay Neural Network Based Denoising Autoencoder.

引用