Speech recognition using deep neural networks: A systematic review

AB Nassif, I Shahin, I Attili, M Azzeh, K Shaalan - IEEE access, 2019 - ieeexplore.ieee.org
Over the past decades, a tremendous amount of research has been done on the use of
machine learning for speech processing applications, especially speech recognition …

Deep learning for acoustic modeling in parametric speech generation: A systematic review of existing techniques and future trends

ZH Ling, SY Kang, H Zen, A Senior… - IEEE Signal …, 2015 - ieeexplore.ieee.org
Hidden Markov models (HMMs) and Gaussian mixture models (GMMs) are the two most
common types of acoustic models used in statistical parametric approaches for generating …

Deep learning: methods and applications

L Deng, D Yu - Foundations and trends® in signal processing, 2014 - nowpublishers.com
This monograph provides an overview of general deep learning methodology and its
applications to a variety of signal and information processing tasks. The application areas …

Unsupervised learning of acoustic features via deep canonical correlation analysis

W Wang, R Arora, K Livescu… - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
It has been previously shown that, when both acoustic and articulatory training data are
available, it is possible to improve phonetic recognition accuracy by learning acoustic …

Modeling spectral envelopes using restricted Boltzmann machines and deep belief networks for statistical parametric speech synthesis

ZH Ling, L Deng, D Yu - IEEE transactions on audio, speech …, 2013 - ieeexplore.ieee.org
This paper presents a new spectral modeling method for statistical parametric speech
synthesis. In the conventional methods, high-level spectral parameters, such as mel-cepstra …

Hybrid convolutional neural networks for articulatory and acoustic information based speech recognition

V Mitra, G Sivaraman, H Nam, C Espy-Wilson… - Speech …, 2017 - Elsevier
Studies have shown that articulatory information helps model speech variability and,
consequently, improves speech recognition performance. But learning speaker-invariant …

[PDF][PDF] Low Resource Acoustic-to-articulatory Inversion Using Bi-directional Long Short Term Memory.

A Illa, PK Ghosh - Interspeech, 2018 - isca-archive.org
Estimating articulatory movements from speech acoustic features is known as acoustic-to-
articulatory inversion (AAI). Large amount of parallel data from speech and articulatory …

Modeling spectral envelopes using restricted Boltzmann machines for statistical parametric speech synthesis

ZH Ling, L Deng, D Yu - 2013 IEEE International Conference …, 2013 - ieeexplore.ieee.org
This paper presents a new spectral modeling method for statistical parametric speech
synthesis. In contrast to the conventional methods in which high-level spectral parameters …

Articulatory features from deep neural networks and their role in speech recognition

V Mitra, G Sivaraman, H Nam… - … on acoustics, speech …, 2014 - ieeexplore.ieee.org
This paper presents a deep neural network (DNN) to extract articulatory information from the
speech signal and explores different ways to use such information in a continuous speech …

Integrating articulatory data in deep neural network-based acoustic modeling

L Badino, C Canevari, L Fadiga, G Metta - Computer Speech & Language, 2016 - Elsevier
Hybrid deep neural network–hidden Markov model (DNN-HMM) systems have become the
state-of-the-art in automatic speech recognition. In this paper we experiment with DNN-HMM …