Silent speech interfaces for speech restoration: A review

JA Gonzalez-Lopez, A Gomez-Alanis… - IEEE …, 2020 - ieeexplore.ieee.org
This review summarises the status of silent speech interface (SSI) research. SSIs rely on non-
acoustic biosignals generated by the human body during speech production to enable …

Harnessing the power of artificial intelligence in otolaryngology and the communication sciences

BS Wilson, DL Tucci, DA Moses, EF Chang… - Journal of the …, 2022 - Springer
Use of artificial intelligence (AI) is a burgeoning field in otolaryngology and the
communication sciences. A virtual symposium on the topic was convened from Duke …

Biosignal-based spoken communication: A survey

T Schultz, M Wand, T Hueber… - … on Audio, Speech …, 2017 - ieeexplore.ieee.org
Speech is a complex process involving a wide range of biosignals, including but not limited
to acoustics. These biosignals-stemming from the articulators, the articulator muscle …

Speech synthesis based on hidden Markov models

K Tokuda, Y Nankaku, T Toda, H Zen… - Proceedings of the …, 2013 - ieeexplore.ieee.org
This paper gives a general overview of hidden Markov model (HMM)-based speech
synthesis, which has recently been demonstrated to be very effective in synthesizing …

Modeling spectral envelopes using restricted Boltzmann machines and deep belief networks for statistical parametric speech synthesis

ZH Ling, L Deng, D Yu - IEEE transactions on audio, speech …, 2013 - ieeexplore.ieee.org
This paper presents a new spectral modeling method for statistical parametric speech
synthesis. In the conventional methods, high-level spectral parameters, such as mel-cepstra …

[PDF][PDF] Announcing the electromagnetic articulography (day 1) subset of the mngu0 articulatory corpus

K Richmond, P Hoole, S King - Twelfth Annual Conference of the …, 2011 - researchgate.net
This paper serves as an initial announcement of the availability of a corpus of articulatory
data called mngu0. This corpus will ultimately consist of a collection of multiple sources of …

A deep recurrent approach for acoustic-to-articulatory inversion

P Liu, Q Yu, Z Wu, S Kang, H Meng… - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
To solve the acoustic-to-articulatory inversion problem, this paper proposes a deep
bidirectional long short term memory recurrent neural network and a deep recurrent mixture …

Analysis of speech production real-time MRI

V Ramanarayanan, S Tilsen, M Proctor, J Töger… - Computer Speech & …, 2018 - Elsevier
Recent advances in real-time magnetic resonance imaging (RT-MRI) have made it possible
to study the anatomy and dynamic motion of the vocal tract during speech production with …

[PDF][PDF] Low Resource Acoustic-to-articulatory Inversion Using Bi-directional Long Short Term Memory.

A Illa, PK Ghosh - Interspeech, 2018 - isca-archive.org
Estimating articulatory movements from speech acoustic features is known as acoustic-to-
articulatory inversion (AAI). Large amount of parallel data from speech and articulatory …

Deep architectures for articulatory inversion

B Uria, I Murray, S Renals… - INTERSPEECH 2012 13th …, 2012 - research.ed.ac.uk
We implement two deep architectures for the acoustic-articulatory inversion mapping
problem: a deep neural network and a deep trajectory mixture density network. We find that …