Creating synthetic voices for children by adapting adult average voice using stacked transformati...

A situational analysis of current speech-synthesis systems for child voices: A scoping review of qualitative and quantitative evidence

C Terblanche, M Harty, M Pascoe, BV Tucker - Applied Sciences, 2022 - mdpi.com

(1) Background: Speech synthesis has customarily focused on adult speech, but with the
rapid development of speech-synthesis technology, it is now possible to create child voices …

被引用次数：10 相关文章所有 9 个版本

[PDF] isca-archive.org

[PDF][PDF] Objective evaluation measures for speaker-adaptive HMM-TTS systems

U Remes, R Karhila, M Kurimo - Eighth ISCA Workshop on Speech …, 2013 - isca-archive.org

This paper investigates using objective quality measures to evaluate speaker adaptation
performance in HMM-based speech synthesis. We compare several objective measures to …

被引用次数：20 相关文章所有 5 个版本

[PDF] epfl.ch

Combining vocal tract length normalization with hierarchical linear transformations

L Saheer, J Yamagishi, PN Garner… - IEEE Journal of …, 2013 - ieeexplore.ieee.org

Recent research has demonstrated the effectiveness of vocal tract length normalization
(VTLN) as a rapid adaptation technique for statistical parametric speech synthesis. VTLN …

被引用次数：15 相关文章所有 17 个版本

[PDF] isca-archive.org

[PDF][PDF] Controlling formant frequencies with neural text-to-speech for the manipulation of perceived speaker age

Z Khan, L Wihlborg, C Valentini-Botinhao… - Proc. Interspeech …, 2023 - isca-archive.org

In this paper, we present a framework for formant-controllable neural text-to-speech. We
train a model that predicts formant frequencies which then condition melspectrogram …

[PDF][PDF] Expressive speech synthesis in human interaction

É Székely - 2015 - researchgate.net

When a synthetic voice represents a human being, its capability to facilitate social interaction
becomes paramount. Extralinguistic aspects of the synthetic voice, such as age, gender …

被引用次数：3 相关文章

[PDF] yok.gov.tr

Speaker adaptation with minimal data in statistical speech synthesis systems

A Mohammadi - 2014 - acikbilim.yok.gov.tr

Statistical speech synthesis (SSS) systems have the ability to adapt to a target speaker with
a couple of minutes of adaptation data. Developing adaptation algorithms to further reduce …

被引用次数：1 相关文章所有 2 个版本

[引用][C] Voice conversion for dubbing using linear predictive coding and hidden markov model

FM Mukhneri, I Wijayanto, S Hadiyoso - Journal of Southwest Jiaotong University, 2020

被引用次数：13 相关文章所有 2 个版本

[PDF] uef.fi

[PDF][PDF] Perceptual and acoustic similarities between the voices of family members: an approach to synthesize a voice based on family-shared f0 characteristics

E Rykova - 2018 - erepo.uef.fi

Human voice provides the means for verbal communication and forms a part of personal
identity. Unfortunately, not every individual can produce speech output. In clinical …

[引用][C] 线性预测编码和隐马尔可夫模型的配音语音转换

V CONVERSION - 西南交通大学学报, 2020

高级搜索

QQ 群