[HTML][HTML] Speaker discrimination performance for “easy” versus “hard” voices in style-matched and-mismatched speech

A Afshan, J Kreiman, A Alwan - The Journal of the Acoustical Society of …, 2022 - pubs.aip.org
This study compares human speaker discrimination performance for read speech versus
casual conversations and explores differences between unfamiliar voices that are “easy” …

[HTML][HTML] Acoustic voice variation in spontaneous speech

Y Lee, J Kreiman - The Journal of the Acoustical Society of America, 2022 - pubs.aip.org
This study replicates and extends the recent findings of Lee, Keating, and Kreiman [J.
Acoust. Soc. Am. 146 (3), 1568–1579 (2019)] on acoustic voice variation in read speech …

[HTML][HTML] Towards understanding speaker discrimination abilities in humans and machines for text-independent short utterances of different speech styles

SJ Park, G Yeung, N Vesselinova, J Kreiman… - The Journal of the …, 2018 - pubs.aip.org
Little is known about human and machine speaker discrimination ability when utterances
are very short and the speaking style is variable. This study compares text-independent …

An integrated approach for teaching speech spectrogram analysis to engineering students

A Johnson - The Journal of the Acoustical Society of America, 2022 - pubs.aip.org
Spectrogram analysis is a vital skill for learning speech acoustics. Spectrograms are
necessary for visualizing cause-effect relationships between speech articulator movements …

Target and non-target speaker discrimination by humans and machines

SJ Park, A Afshan, J Kreiman… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
The manner in which acoustic features contribute to perceiving speaker identity remains
unclear. In an attempt to better understand speaker perception, we investigated human and …

Attention-based conditioning methods using variable frame rate for style-robust speaker verification

A Afshan, A Alwan - arXiv preprint arXiv:2206.13680, 2022 - arxiv.org
We propose an approach to extract speaker embeddings that are robust to speaking style
variations in text-independent speaker verification. Typically, speaker embedding extraction …

Comparing human and machine's use of coarticulatory vowel nasalization for linguistic classification

G Zellou, L Kim, C Gendrot - The Journal of the Acoustical Society of …, 2024 - pubs.aip.org
Anticipatory coarticulation is a highly informative cue to upcoming linguistic information:
listeners can identify that the word is ben and not bed by hearing the vowel alone. The …

Learning from human perception to improve automatic speaker verification in style-mismatched conditions

A Afshan, A Alwan - arXiv preprint arXiv:2206.13684, 2022 - arxiv.org
Our prior experiments show that humans and machines seem to employ different
approaches to speaker discrimination, especially in the presence of speaking style …

[PDF][PDF] Linguistic versus biological factors governing acoustic voice variation

Y Lee, J Kreiman - Interspeech Proceedings, 2022 - par.nsf.gov
This study presents a cross-linguistic investigation of acoustic voice spaces in English,
Seoul Korean, and White Hmong, which differ in whether they phonologically contrast …

[图书][B] Speaking style variability in speaker discrimination by humans and machines

A Afshan - 2022 - search.proquest.com
A speaker's voice constantly varies in everyday situations, such as when talking to a friend,
reading aloud, talking to pets, or narrating a happy incident. These changes in speaking …