Recognition of Dysarthric Speech Using Voice Parameters for Speaker Adaptation and Multi-Taper...- 学术资源搜索

[PDF][PDF] Recognition of Dysarthric Speech Using Voice Parameters for Speaker Adaptation and Multi-Taper Spectral Estimation.

C Bhat, B Vachhani, SK Kopparapu - Interspeech, 2016 - academia.edu

Interspeech, 2016•academia.edu

Abstract

Dysarthria is a motor speech disorder resulting from impairment in muscles responsible for speech production, often characterized by slurred or slow speech resulting in low intelligibility. With speech based applications such as voice biometrics and personal assistants gaining popularity, automatic recognition of dysarthric speech becomes imperative as a step towards including people with dysarthria into mainstream. In this paper we examine the applicability of voice parameters that are traditionally used for pathological voice classification such as jitter, shimmer, F0 and Noise Harmonic Ratio (NHR) contour in addition to Mel Frequency Cepstral Coefficients (MFCC) for dysarthric speech recognition. Additionally, we show that multi-taper spectral estimation for computing MFCC improves the unseen dysarthric speech recognition. A Deep neural network (DNN)-hidden Markov model (HMM) recognition system fared better than a Gaussian Mixture Model (GMM)-HMM based system for dysarthric speech recognition. We propose a method to optimally use incremental dysarthric data to improve dysarthric speech recognition for an ASR with DNNHMM. All evaluations were done on Universal Access Speech Corpus.

academia.edu

展开收起

被引用次数：25 相关文章所有 7 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

[PDF][PDF] Recognition of Dysarthric Speech Using Voice Parameters for Speaker Adaptation and Multi-Taper Spectral Estimation.

引用