Speech production knowledge in automatic speech recognition

S King, J Frankel, K Livescu, E McDermott… - The Journal of the …, 2007 - pubs.aip.org
Although much is known about how speech is produced, and research into speech
production has resulted in measured articulatory data, feature systems of different kinds, and …

[PDF][PDF] Encoding of articulatory kinematic trajectories in human speech sensorimotor cortex

J Chartier, GK Anumanchipalli, K Johnson, EF Chang - Neuron, 2018 - cell.com
When speaking, we dynamically coordinate movements of our jaw, tongue, lips, and larynx.
To investigate the neural mechanisms underlying articulation, we used direct cortical …

Harnessing the power of artificial intelligence in otolaryngology and the communication sciences

BS Wilson, DL Tucci, DA Moses, EF Chang… - Journal of the …, 2022 - Springer
Use of artificial intelligence (AI) is a burgeoning field in otolaryngology and the
communication sciences. A virtual symposium on the topic was convened from Duke …

Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model

T Toda, AW Black, K Tokuda - Speech communication, 2008 - Elsevier
In this paper, we describe a statistical approach to both an articulatory-to-acoustic mapping
and an acoustic-to-articulatory inversion mapping without using phonetic information. The …

IoT-enabled flood severity prediction via ensemble machine learning models

M Khalaf, H Alaskar, AJ Hussain, T Baker… - IEEE …, 2020 - ieeexplore.ieee.org
River flooding is a natural phenomenon that can have a devastating effect on human life and
economic losses. There have been various approaches in studying river flooding; however …

Operator-valued kernels for learning from functional response data

H Kadri, E Duflos, P Preux, S Canu… - Journal of Machine …, 2016 - jmlr.org
In this paper we consider the problems of supervised classification and regression in the
case where attributes and labels are functions: a data is represented by a set of functions …

Hybrid convolutional neural networks for articulatory and acoustic information based speech recognition

V Mitra, G Sivaraman, H Nam, C Espy-Wilson… - Speech …, 2017 - Elsevier
Studies have shown that articulatory information helps model speech variability and,
consequently, improves speech recognition performance. But learning speaker-invariant …

{V-Cloak}: Intelligibility-, Naturalness-& {Timbre-Preserving}{Real-Time} Voice Anonymization

J Deng, F Teng, Y Chen, X Chen, Z Wang… - 32nd USENIX Security …, 2023 - usenix.org
Voice data generated on instant messaging or social media applications contains unique
user voiceprints that may be abused by malicious adversaries for identity inference or …

A subject-independent acoustic-to-articulatory inversion

PK Ghosh, SS Narayanan - 2011 IEEE International …, 2011 - ieeexplore.ieee.org
Acoustic-to-articulatory inversion is usually done in a subject-dependent manner, ie, the
inversion procedure may not work well if the parallel acoustic and articulatory training data is …

A generalized smoothness criterion for acoustic-to-articulatory inversion

PK Ghosh, S Narayanan - The Journal of the Acoustical Society of …, 2010 - pubs.aip.org
The many-to-one mapping from representations in the speech articulatory space to acoustic
space renders the associated acoustic-to-articulatory inverse mapping non-unique. Among …