Semi Parametric Concatenative TTS with Instant Voice Modification Capabilities.

P Alku, T Murtola, J Malinen, J Kuortti, B Story… - Speech …, 2019 - Elsevier

Glottal inverse filtering (GIF) refers to technology to estimate the source of voiced speech,
the glottal flow, from speech signals. When a new GIF algorithm is proposed, its accuracy …

被引用次数：46 相关文章所有 9 个版本

[PDF] aalto.fi

A comparison between straight, glottal, and sinusoidal vocoding in statistical parametric speech synthesis

M Airaksinen, L Juvela, B Bollepalli… - … on Audio, Speech …, 2018 - ieeexplore.ieee.org

A vocoder is used to express a speech waveform with a controllable parametric
representation that can be converted back into a speech waveform. Vocoders representing …

被引用次数：52 相关文章所有 7 个版本

[PDF] isca-archive.org

[PDF][PDF] Data Augmentation Improves Recognition of Foreign Accented Speech.

T Fukuda, R Fernandez, A Rosenberg, S Thomas… - Interspeech, 2018 - isca-archive.org

Speech recognition of foreign accented (non-native or L2) speech remains a challenge to
the state-of-the-art. The most common approach to address this scenario involves the …

被引用次数：46 相关文章所有 4 个版本

[PDF] mdpi.com

Experimental Investigation of Acoustic Features to Optimize Intelligibility in Cochlear Implants

F Henry, A Parsi, M Glavin, E Jones - Sensors, 2023 - mdpi.com

Although cochlear implants work well for people with hearing impairment in quiet conditions,
it is well-known that they are not as effective in noisy environments. Noise reduction …

被引用次数：4 相关文章所有 7 个版本

[PDF] arxiv.org

The role of vocal persona in natural and synthesized speech

C Noufi, L May, J Berger - 2023 IEEE 17th International …, 2023 - ieeexplore.ieee.org

The inclusion of voice persona in synthesized voice can be significant in a broad range of
human-computer-interaction (HCI) applications, including augmentative and assistive …

被引用次数：4 相关文章所有 6 个版本

[PDF] isca-archive.org

[PDF][PDF] Testing the GlórCáil system in a speaker and affect voice transformation task

A Murphy, I Yanushevskaya, AN Chasaide… - Speech Prosody …, 2020 - isca-archive.org

This paper describes the results of a voice transformation task experiment conducted as part
of the evaluation of a speech synthesis system (the GlórCáil system, also described). The …

被引用次数：11 相关文章所有 5 个版本

[PDF] frontiersin.org

Global waveshape parameter R_d in signaling focal prominence: Perceptual salience in the absence of f₀ variation

I Yanushevskaya, A Murphy, C Gobl… - Frontiers in …, 2022 - frontiersin.org

This paper explores perceptual salience of voice source parameter manipulation in
signaling prominence in the absence of f 0 variation. Synthetic stimuli were generated based …

被引用次数：3 相关文章所有 2 个版本

[PDF] researchgate.net

[PDF][PDF] The Role of Voice Quality in the Perception of Prominence in Synthetic Speech.

A Murphy, I Yanushevskaya, AN Chasaide… - …, 2019 - researchgate.net

This paper explores how prominence can be modelled in speech synthesis through voice
quality variation. Synthetic utterances varying in voice quality (breathy, modal, tense) were …

被引用次数：6 相关文章所有 6 个版本

[PDF] osf.io

[PDF][PDF] Context, Perception, Production: A Model of Vocal Persona

C Noufi, L May, J Berger - PsyArXiv. July, 2023 - files.osf.io

We present a contextualized production-perception model of vocal persona based on a
deductive thematic analysis of interviews with voice and performance experts. The model …

Adding personality to neutral speech synthesis voices

CG Buchanan, MP Aylett, DA Braude - Speech and Computer: 20th …, 2018 - Springer

A synthetic voice personifies the system using it. Previous work has shown that using sub-
corpora with different voice qualities (eg tense and lax) can be used to modify the perceived …

被引用次数：6 相关文章所有 3 个版本

高级搜索

QQ 群