OPENGLOT–An open environment for the evaluation of glottal inverse filtering

P Alku, T Murtola, J Malinen, J Kuortti, B Story… - Speech …, 2019 - Elsevier
Glottal inverse filtering (GIF) refers to technology to estimate the source of voiced speech,
the glottal flow, from speech signals. When a new GIF algorithm is proposed, its accuracy …

A comparison between straight, glottal, and sinusoidal vocoding in statistical parametric speech synthesis

M Airaksinen, L Juvela, B Bollepalli… - … on Audio, Speech …, 2018 - ieeexplore.ieee.org
A vocoder is used to express a speech waveform with a controllable parametric
representation that can be converted back into a speech waveform. Vocoders representing …

[PDF][PDF] Data Augmentation Improves Recognition of Foreign Accented Speech.

T Fukuda, R Fernandez, A Rosenberg, S Thomas… - Interspeech, 2018 - isca-archive.org
Speech recognition of foreign accented (non-native or L2) speech remains a challenge to
the state-of-the-art. The most common approach to address this scenario involves the …

Experimental Investigation of Acoustic Features to Optimize Intelligibility in Cochlear Implants

F Henry, A Parsi, M Glavin, E Jones - Sensors, 2023 - mdpi.com
Although cochlear implants work well for people with hearing impairment in quiet conditions,
it is well-known that they are not as effective in noisy environments. Noise reduction …

The role of vocal persona in natural and synthesized speech

C Noufi, L May, J Berger - 2023 IEEE 17th International …, 2023 - ieeexplore.ieee.org
The inclusion of voice persona in synthesized voice can be significant in a broad range of
human-computer-interaction (HCI) applications, including augmentative and assistive …

[PDF][PDF] Testing the GlórCáil system in a speaker and affect voice transformation task

A Murphy, I Yanushevskaya, AN Chasaide… - Speech Prosody …, 2020 - isca-archive.org
This paper describes the results of a voice transformation task experiment conducted as part
of the evaluation of a speech synthesis system (the GlórCáil system, also described). The …

Global waveshape parameter Rd in signaling focal prominence: Perceptual salience in the absence of f0 variation

I Yanushevskaya, A Murphy, C Gobl… - Frontiers in …, 2022 - frontiersin.org
This paper explores perceptual salience of voice source parameter manipulation in
signaling prominence in the absence of f 0 variation. Synthetic stimuli were generated based …

[PDF][PDF] The Role of Voice Quality in the Perception of Prominence in Synthetic Speech.

A Murphy, I Yanushevskaya, AN Chasaide… - …, 2019 - researchgate.net
This paper explores how prominence can be modelled in speech synthesis through voice
quality variation. Synthetic utterances varying in voice quality (breathy, modal, tense) were …

[PDF][PDF] Context, Perception, Production: A Model of Vocal Persona

C Noufi, L May, J Berger - PsyArXiv. July, 2023 - files.osf.io
We present a contextualized production-perception model of vocal persona based on a
deductive thematic analysis of interviews with voice and performance experts. The model …

Adding personality to neutral speech synthesis voices

CG Buchanan, MP Aylett, DA Braude - Speech and Computer: 20th …, 2018 - Springer
A synthetic voice personifies the system using it. Previous work has shown that using sub-
corpora with different voice qualities (eg tense and lax) can be used to modify the perceived …