Recent advances in convolutional neural networks

J Gu, Z Wang, J Kuen, L Ma, A Shahroudy, B Shuai… - Pattern recognition, 2018 - Elsevier
In the last few years, deep learning has led to very good performance on a variety of
problems, such as visual recognition, speech recognition and natural language processing …

A tutorial survey of architectures, algorithms, and applications for deep learning

L Deng - APSIPA transactions on Signal and Information …, 2014 - cambridge.org
In this invited paper, my overview material on the same topic as presented in the plenary
overview session of APSIPA-2011 and the tutorial material presented in the same …

Deep learning: methods and applications

L Deng, D Yu - Foundations and trends® in signal processing, 2014 - nowpublishers.com
This monograph provides an overview of general deep learning methodology and its
applications to a variety of signal and information processing tasks. The application areas …

Deep learning for acoustic modeling in parametric speech generation: A systematic review of existing techniques and future trends

ZH Ling, SY Kang, H Zen, A Senior… - IEEE Signal …, 2015 - ieeexplore.ieee.org
Hidden Markov models (HMMs) and Gaussian mixture models (GMMs) are the two most
common types of acoustic models used in statistical parametric approaches for generating …

Speech synthesis based on hidden Markov models

K Tokuda, Y Nankaku, T Toda, H Zen… - Proceedings of the …, 2013 - ieeexplore.ieee.org
This paper gives a general overview of hidden Markov model (HMM)-based speech
synthesis, which has recently been demonstrated to be very effective in synthesizing …

[PDF][PDF] Three classes of deep learning architectures and their applications: a tutorial survey

L Deng - APSIPA transactions on signal and information …, 2012 - academia.edu
In this invited paper, my overview material on the same topic as presented in the plenary
overview session of APSIPA-2011 and the tutorial material presented in the same …

Deep encoder-decoder models for unsupervised learning of controllable speech synthesis

GE Henter, J Lorenzo-Trueba, X Wang… - arXiv preprint arXiv …, 2018 - arxiv.org
Generating versatile and appropriate synthetic speech requires control over the output
expression separate from the spoken text. Important non-textual speech variation is seldom …

Analysis of speech production real-time MRI

V Ramanarayanan, S Tilsen, M Proctor, J Töger… - Computer Speech & …, 2018 - Elsevier
Recent advances in real-time magnetic resonance imaging (RT-MRI) have made it possible
to study the anatomy and dynamic motion of the vocal tract during speech production with …

The artificial intelligence renaissance: deep learning and the road to human-level machine intelligence

KH Tan, BP Lim - APSIPA Transactions on Signal and Information …, 2018 - cambridge.org
In this paper we look at recent advances in artificial intelligence. Decades in the making, a
confluence of several factors in the past few years has culminated in a string of …

Principles for learning controllable TTS from annotated and latent variation

G Henter, J Lorenzo-Trueba, X Wang… - Interspeech …, 2017 - research.ed.ac.uk
For building flexible and appealing high-quality speech synthesisers, it is desirable to be
able to accommodate and reproduce fine variations in vocal expression present in natural …