Flavored tacotron: Conditional learning for prosodic-linguistic features

文章

学术资源搜索

获得 4 条结果（用时0.02秒）

我的图书馆

Flavored tacotron: Conditional learning for prosodic-linguistic features

在引用文章中搜索

[PDF] ed.ac.uk

Liaison and pronunciation learning in end-to-end text-to-speech in French

J Taylor, S Le Maguer, K Richmond - The 11th ISCA Speech …, 2021 - research.ed.ac.uk

Abstract Sequence-to-sequence (S2S) TTS models like Tacotron have grapheme-only
inputs when trained fully end-to-end. Grapheme inputs map to phone sounds depending on …

被引用次数：10 相关文章所有 5 个版本

[PDF] isca-archive.org

[PDF][PDF] Can Prosody Transfer Embeddings be Used for Prosody Assessment?

M Juliao, A Abad, H Moniz - Proc. Speech Prosody 2022, 2022 - isca-archive.org

In voice conversion, it is possible to transfer some characteristic components of a (target)
speech utterance, such as the content, pitch, or speaker identity, from the corresponding …

被引用次数：3 相关文章所有 4 个版本

[PDF] purdue.edu

Believe in the Sound You See: The Effects of Body Type and Voice Pitch on the Perceived Audio-Visual Correspondence and Believability of Virtual Characters

L Lam - 2023 - hammer.purdue.edu

Lam, Luchcha. MS, Purdue University, May 2023. Believe in the Sound You See: The Effects
of Body Type and Voice Pitch on the Perceived Audio-Visual Correspondence and …

[PDF][PDF] INVESTIGATING EMOTION EMBEDDING BASED TEXT-TO-SPEECH MODELS UNDER LIMITED TRAINING DATA

H PIJPELINK - arno.uvt.nl

This thesis studies the effect of limiting training data on emotion embedding models for Text-
to-Speech (TTS) systems. In order to reproduce natural human prosody, TTS models use …

高级搜索

QQ 群

Flavored tacotron: Conditional learning for prosodic-linguistic features

Liaison and pronunciation learning in end-to-end text-to-speech in French

[PDF][PDF] Can Prosody Transfer Embeddings be Used for Prosody Assessment?

Believe in the Sound You See: The Effects of Body Type and Voice Pitch on the Perceived Audio-Visual Correspondence and Believability of Virtual Characters

[PDF][PDF] INVESTIGATING EMOTION EMBEDDING BASED TEXT-TO-SPEECH MODELS UNDER LIMITED TRAINING DATA

引用