Self-supervised learning based domain regularization for mask-wearing speaker verification

R Zhang, J Wei, X Lu, W Lu, D Jin, L Zhang, Y Ji… - Speech …, 2023 - Elsevier
Automatic speaker verification (ASV) faces an unprecedented problem due to mask-wearing
speakers, a consequence of COVID-19. Masked speakers unconsciously alter their normal …

Advances in vocal tract imaging and analysis

A Toutios, D Byrd, L Goldstein… - … Routledge handbook of …, 2019 - taylorfrancis.com
A long-standing challenge in speech research is obtaining accurate information about the
movement and shaping of the vocal tract. Dynamic vocal tract imaging data, recorded in real …

Morphological characteristics of male and female hypopharynx: A magnetic resonance imaging-based study

J Zhang, K Honda, J Wei, T Kitamura - The Journal of the Acoustical …, 2019 - pubs.aip.org
Studies with three-dimensional (3D) vocal tract visualization using magnetic resonance
imaging (MRI) have suggested that hypopharyngeal cavities, ie, laryngeal cavity and …

Retrieving vocal-tract resonance and anti-resonance from high-pitched vowels using a rahmonic subtraction technique

Z Zhang, K Honda, J Wei - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
Vocal tract resonances give rise to core spectral information of speech signals. Linear
prediction and cepstral methods are widely used for this purpose. However, both …

Ultrasound-and MRI-based Speech Synthesis Applying Neural Networks

R Trencsenyi, L Czap - 2024 25th International Carpathian …, 2024 - ieeexplore.ieee.org
Starting from 2D dynamic ultrasound and MRI sources recording the movement of the vocal
organs and the speech signal of the speaker in a simultaneous and synchronised manner …

[PDF][PDF] Modeling voiced stop consonants using the 3D dynamic digital waveguide mesh vocal tract model

AJ Gully, B Tucker - Proceedings of the International Congress of …, 2019 - pure.york.ac.uk
ABSTRACT Three-dimensional (3D) acoustic simulations of the vocal tract are showing
significant promise for the study of speech acoustics. Recent models have demonstrated …

Resonance Tuning in Professional Operatic Sopranos

R Vos - 2018 - etheses.whiterose.ac.uk
Soprano singers are capable of singing at pitches exceeding 1000 Hz, where the spacing of
the harmonics means that the vocal tract resonances are not fully utilised. Sopranos …

Research on Modeling of Vocal State Duration Based on Spectrogram Analysis

X Zhang - E3S Web of Conferences, 2021 - e3s-conferences.org
In the early stage of vocal music education, students generally do not understand the
structure of the human body, and have doubts about how to pronounce their voices …

Acoustic Analysis of the Open End Effect Using Solid Vocal Tract Models Constructed from MRI Data during Vowel Production

Z Zhang, J Wei, J Zhang… - 2018 5th International …, 2018 - ieeexplore.ieee.org
This paper investigates acoustic effect of the vocal-tract open end to examine whether the
open-end correction coefficient known to date is adequate for modeling the vocal tracts with …