Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains

S Sun, B Zhang, L Xie, Y Zhang - Neurocomputing, 2017 - Elsevier

This paper addresses the robust speech recognition problem as a domain adaptation task.
Specifically, we introduce an unsupervised deep domain adaptation (DDA) approach to …

被引用次数：187 相关文章所有 2 个版本

Transfer linear subspace learning for cross-corpus speech emotion recognition

P Song - IEEE transactions on affective computing, 2017 - ieeexplore.ieee.org

Speech emotion recognition has received an increasing interest in recent years, which is
often conducted on the assumption that speech utterances in training and testing datasets …

被引用次数：97 相关文章所有 3 个版本

Feature extraction methods for speaker recognition: A review

G Chaudhary, S Srivastava… - International Journal of …, 2017 - World Scientific

This paper presents main paradigms of research for feature extraction methods to further
augment the state of art in speaker recognition (SR) which has been recognized extensively …

被引用次数：30 相关文章所有 2 个版本

An end-to-end deep learning approach to simultaneous speech dereverberation and acoustic modeling for robust speech recognition

B Wu, K Li, F Ge, Z Huang, M Yang… - IEEE Journal of …, 2017 - ieeexplore.ieee.org

We propose an integrated end-to-end automatic speech recognition (ASR) paradigm by joint
learning of the front-end speech signal processing and back-end acoustic modeling. We …

被引用次数：84 相关文章所有 4 个版本

New Era for Robust Speech Recognition

S Watanabe, M Delcroix, F Metze… - Cham, Switzerland …, 2017 - Springer

The field of automatic speech recognition has evolved greatly since the introduction of deep
learning, which began only about 5 years ago. In particular, as more and more products …

被引用次数：64 相关文章所有 6 个版本

[HTML] nih.gov

Regularized speaker adaptation of KL-HMM for dysarthric speech recognition

M Kim, Y Kim, J Yoo, J Wang… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org

This paper addresses the problem of recognizing the speech uttered by patients with
dysarthria, which is a motor speech disorder impeding the physical production of speech …

被引用次数：53 相关文章所有 8 个版本

Adaptive unscented Kalman filter based on maximum posterior and random weighting

Z Gao, D Mu, S Gao, Y Zhong, C Gu - Aerospace Science and Technology, 2017 - Elsevier

The unscented Kalman filter (UKF) is an effective technique of state estimation for nonlinear
dynamic systems. However, its performance depends on prior knowledge on system noise. If …

被引用次数：47 相关文章所有 4 个版本

Adaptation of deep neural network acoustic models for robust automatic speech recognition

KC Sim, Y Qian, G Mantena, L Samarakoon… - New Era for Robust …, 2017 - Springer

Deep neural networks (DNNs) have been successfully applied to many pattern classification
problems, including acoustic modelling for automatic speech recognition (ASR). However …

被引用次数：17 相关文章所有 3 个版本

[PDF] arxiv.org

Embedding-based speaker adaptive training of deep neural networks

X Cui, V Goel, G Saon - arXiv preprint arXiv:1710.06937, 2017 - arxiv.org

An embedding-based speaker adaptive training (SAT) approach is proposed and
investigated in this paper for deep neural network acoustic modeling. In this approach …

被引用次数：44 相关文章所有 8 个版本

[PDF] researchgate.net

[PDF][PDF] Acoustic scene classification using a CNN-SuperVector system trained with auditory and spectrogram image features.

R Hyder, S Ghaffarzadegan, Z Feng, JHL Hansen… - Interspeech, 2017 - researchgate.net

Enabling smart devices to infer about the environment using audio signals has been one of
the several long-standing challenges in machine listening. The availability of public-domain …

被引用次数：41 相关文章所有 6 个版本

高级搜索

QQ 群