A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images

Y Lim, A Toutios, Y Bliesener, Y Tian, SG Lingala… - Scientific data, 2021 - nature.com
Real-time magnetic resonance imaging (RT-MRI) of human speech production is enabling
significant advances in speech science, linguistics, bio-inspired speech technology …

Multimodal segmentation for vocal tract modeling

R Jain, B Yu, P Wu, T Prabhune… - arXiv preprint arXiv …, 2024 - arxiv.org
Accurate modeling of the vocal tract is necessary to construct articulatory representations for
interpretable speech processing and linguistics. However, vocal tract modeling is …

Vocal tract contour tracking in rtMRI using deep temporal regression network

S Asadiabadi, E Erzin - IEEE/ACM Transactions on Audio …, 2020 - ieeexplore.ieee.org
Recent advances in real-time Magnetic Resonance Imaging (rtMRI) provide an invaluable
tool to study speech articulation. In this paper, we present an effective deep learning …

Bilinguals from Larynx to Lips: Exploring Bilingual Articulatory Strategies with Anatomic MRI Data

P Badin, TR Sawallis, M Tabain… - Language and …, 2024 - journals.sagepub.com
The goal of this article is to illustrate the use of MRI for exploring bi-and multi-lingual
articulatory strategies. One male and one female speaker recorded sets of static midsagittal …

[PDF][PDF] Air tissue boundary segmentation using regional loss in real-time Magnetic Resonance Imaging video for speech production.

A Roy, V Belagali, PK Ghosh - INTERSPEECH, 2022 - isca-archive.org
The SegNet model has been shown to provide the best performance in air-tissue boundary
(ATB) segmentation in real-time Magnetic Resonance Imaging (rtMRI) videos in seen …

Automatic segmentation of vocal tract articulators in real-time magnetic resonance imaging

V Ribeiro, K Isaieva, J Leclere, J Felblinger… - Computer Methods and …, 2024 - Elsevier
Abstract Background and Objectives The characterization of the vocal tract geometry during
speech interests various research topics, including speech production modeling, motor …

Realistic dynamic numerical phantom for MRI of the upper vocal tract

J Martin, M Ruthven, R Boubertakh, ME Miquel - Journal of Imaging, 2020 - mdpi.com
Dynamic and real-time MRI (rtMRI) of human speech is an active field of research, with
interest from both the linguistics and clinical communities. At present, different research …

A Review of Multi-modal Human Motion Recognition Based on Deep Learning

Y Li, Y Pan, X Wu - IJLAI Transactions on Science and Engineering, 2024 - ijlaitse.com
Human motion recognition is a research hotspot in the field of computer vision, which has a
wide range of applications, including biometrics, intelligent surveillance and human …

[PDF][PDF] DEEP ARTICULATORY SEGMENTATION AND SPEECH SYNTHESIS USING RT-MRI

B Yu, P Wu, R Jain, T Prabhune, GK Anumanchipalli - 2024 - eecs.berkeley.edu
Accurate modeling of the vocal tract is necessary to construct articulatory representations for
interpretable speech processing and linguistics. However, vocal tract modeling is …

[PDF][PDF] Air-Tissue Boundary Segmentation in Real Time Magnetic Resonance Imaging Video Using 3-D Convolutional Neural Network.

R Mannem, N Gaddam, PK Ghosh - INTERSPEECH, 2020 - interspeech2020.org
Abstract The real-time Magnetic Resonance Imaging (rtMRI) is often used for speech
production research as it captures the complete view of the vocal tract during speech. Air …