Integrating articulatory features into HMM-based parametric speech synthesis

JA Gonzalez-Lopez, A Gomez-Alanis… - IEEE …, 2020 - ieeexplore.ieee.org

This review summarises the status of silent speech interface (SSI) research. SSIs rely on non-
acoustic biosignals generated by the human body during speech production to enable …

被引用次数：91 相关文章所有 7 个版本

[HTML] nih.gov

Harnessing the power of artificial intelligence in otolaryngology and the communication sciences

BS Wilson, DL Tucci, DA Moses, EF Chang… - Journal of the …, 2022 - Springer

Use of artificial intelligence (AI) is a burgeoning field in otolaryngology and the
communication sciences. A virtual symposium on the topic was convened from Duke …

被引用次数：19 相关文章所有 8 个版本

[PDF] ieee.org

Biosignal-based spoken communication: A survey

T Schultz, M Wand, T Hueber… - … on Audio, Speech …, 2017 - ieeexplore.ieee.org

Speech is a complex process involving a wide range of biosignals, including but not limited
to acoustics. These biosignals-stemming from the articulators, the articulator muscle …

被引用次数：192 相关文章所有 6 个版本

[PDF] nii.ac.jp

Speech synthesis based on hidden Markov models

K Tokuda, Y Nankaku, T Toda, H Zen… - Proceedings of the …, 2013 - ieeexplore.ieee.org

This paper gives a general overview of hidden Markov model (HMM)-based speech
synthesis, which has recently been demonstrated to be very effective in synthesizing …

被引用次数：576 相关文章所有 8 个版本

[PDF] microsoft.com

Modeling spectral envelopes using restricted Boltzmann machines and deep belief networks for statistical parametric speech synthesis

ZH Ling, L Deng, D Yu - IEEE transactions on audio, speech …, 2013 - ieeexplore.ieee.org

This paper presents a new spectral modeling method for statistical parametric speech
synthesis. In the conventional methods, high-level spectral parameters, such as mel-cepstra …

被引用次数：194 相关文章所有 8 个版本

[PDF] researchgate.net

[PDF][PDF] Announcing the electromagnetic articulography (day 1) subset of the mngu0 articulatory corpus

K Richmond, P Hoole, S King - Twelfth Annual Conference of the …, 2011 - researchgate.net

This paper serves as an initial announcement of the availability of a corpus of articulatory
data called mngu0. This corpus will ultimately consist of a collection of multiple sources of …

被引用次数：167 相关文章所有 14 个版本

A deep recurrent approach for acoustic-to-articulatory inversion

P Liu, Q Yu, Z Wu, S Kang, H Meng… - 2015 IEEE International …, 2015 - ieeexplore.ieee.org

To solve the acoustic-to-articulatory inversion problem, this paper proposes a deep
bidirectional long short term memory recurrent neural network and a deep recurrent mixture …

被引用次数：90 相关文章所有 3 个版本

[PDF] sciencedirect.com

Analysis of speech production real-time MRI

V Ramanarayanan, S Tilsen, M Proctor, J Töger… - Computer Speech & …, 2018 - Elsevier

Recent advances in real-time magnetic resonance imaging (RT-MRI) have made it possible
to study the anatomy and dynamic motion of the vocal tract during speech production with …

被引用次数：65 相关文章所有 15 个版本

[PDF] isca-archive.org

[PDF][PDF] Low Resource Acoustic-to-articulatory Inversion Using Bi-directional Long Short Term Memory.

A Illa, PK Ghosh - Interspeech, 2018 - isca-archive.org

Estimating articulatory movements from speech acoustic features is known as acoustic-to-
articulatory inversion (AAI). Large amount of parallel data from speech and articulatory …

被引用次数：47 相关文章所有 6 个版本

[PDF] ed.ac.uk

Deep architectures for articulatory inversion

B Uria, I Murray, S Renals… - INTERSPEECH 2012 13th …, 2012 - research.ed.ac.uk

We implement two deep architectures for the acoustic-articulatory inversion mapping
problem: a deep neural network and a deep trajectory mixture density network. We find that …

被引用次数：84 相关文章所有 13 个版本

高级搜索

QQ 群