Articulatory control of HMM-based parametric speech synthesis using feature-space-switched...

J Gu, Z Wang, J Kuen, L Ma, A Shahroudy, B Shuai… - Pattern recognition, 2018 - Elsevier

In the last few years, deep learning has led to very good performance on a variety of
problems, such as visual recognition, speech recognition and natural language processing …

被引用次数：6118 相关文章所有 7 个版本

[PDF] cambridge.org

A tutorial survey of architectures, algorithms, and applications for deep learning

L Deng - APSIPA transactions on Signal and Information …, 2014 - cambridge.org

In this invited paper, my overview material on the same topic as presented in the plenary
overview session of APSIPA-2011 and the tutorial material presented in the same …

被引用次数：964 相关文章所有 4 个版本

[PDF] nowpublishers.com

Deep learning: methods and applications

L Deng, D Yu - Foundations and trends® in signal processing, 2014 - nowpublishers.com

This monograph provides an overview of general deep learning methodology and its
applications to a variety of signal and information processing tasks. The application areas …

被引用次数：5843 相关文章所有 13 个版本

[PDF] academia.edu

Deep learning for acoustic modeling in parametric speech generation: A systematic review of existing techniques and future trends

ZH Ling, SY Kang, H Zen, A Senior… - IEEE Signal …, 2015 - ieeexplore.ieee.org

Hidden Markov models (HMMs) and Gaussian mixture models (GMMs) are the two most
common types of acoustic models used in statistical parametric approaches for generating …

被引用次数：295 相关文章所有 11 个版本

[PDF] nii.ac.jp

Speech synthesis based on hidden Markov models

K Tokuda, Y Nankaku, T Toda, H Zen… - Proceedings of the …, 2013 - ieeexplore.ieee.org

This paper gives a general overview of hidden Markov model (HMM)-based speech
synthesis, which has recently been demonstrated to be very effective in synthesizing …

被引用次数：576 相关文章所有 8 个版本

[PDF] academia.edu

[PDF][PDF] Three classes of deep learning architectures and their applications: a tutorial survey

L Deng - APSIPA transactions on signal and information …, 2012 - academia.edu

In this invited paper, my overview material on the same topic as presented in the plenary
overview session of APSIPA-2011 and the tutorial material presented in the same …

被引用次数：190 相关文章所有 3 个版本

[PDF] arxiv.org

Deep encoder-decoder models for unsupervised learning of controllable speech synthesis

GE Henter, J Lorenzo-Trueba, X Wang… - arXiv preprint arXiv …, 2018 - arxiv.org

Generating versatile and appropriate synthetic speech requires control over the output
expression separate from the spoken text. Important non-textual speech variation is seldom …

被引用次数：67 相关文章所有 2 个版本

[PDF] sciencedirect.com

Analysis of speech production real-time MRI

V Ramanarayanan, S Tilsen, M Proctor, J Töger… - Computer Speech & …, 2018 - Elsevier

Recent advances in real-time magnetic resonance imaging (RT-MRI) have made it possible
to study the anatomy and dynamic motion of the vocal tract during speech production with …

被引用次数：65 相关文章所有 15 个版本

[PDF] cambridge.org

The artificial intelligence renaissance: deep learning and the road to human-level machine intelligence

KH Tan, BP Lim - APSIPA Transactions on Signal and Information …, 2018 - cambridge.org

In this paper we look at recent advances in artificial intelligence. Decades in the making, a
confluence of several factors in the past few years has culminated in a string of …

被引用次数：59 相关文章所有 2 个版本

[PDF] ed.ac.uk

Principles for learning controllable TTS from annotated and latent variation

G Henter, J Lorenzo-Trueba, X Wang… - Interspeech …, 2017 - research.ed.ac.uk

For building flexible and appealing high-quality speech synthesisers, it is desirable to be
able to accommodate and reproduce fine variations in vocal expression present in natural …

被引用次数：47 相关文章所有 4 个版本

高级搜索

QQ 群