Experimental and theoretical advances in prosody: A review

M Wagner, DG Watson - Language and cognitive processes, 2010 - Taylor & Francis
Research on prosody has recently become an important focus in various disciplines,
including Linguistics, Psychology, and Computer Science. This article reviews recent …

A prosody tutorial for investigators of auditory sentence processing

S Shattuck-Hufnagel, AE Turk - Journal of psycholinguistic research, 1996 - Springer
In this tutorial we present evidence that, because syntax does not fully predict the way that
spoken utterances are organized, prosody is a significant issue for studies of auditory …

A survey on neural speech synthesis

X Tan, T Qin, F Soong, TY Liu - arXiv preprint arXiv:2106.15561, 2021 - arxiv.org
Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …

Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis

Y Wang, D Stanton, Y Zhang… - International …, 2018 - proceedings.mlr.press
In this work, we propose “global style tokens”(GSTs), a bank of embeddings that are jointly
trained within Tacotron, a state-of-the-art end-to-end speech synthesis system. The …

Towards end-to-end prosody transfer for expressive speech synthesis with tacotron

RJ Skerry-Ryan, E Battenberg, Y Xiao… - international …, 2018 - proceedings.mlr.press
We present an extension to the Tacotron speech synthesis architecture that learns a latent
embedding space of prosody, derived from a reference acoustic representation containing …

[图书][B] Intonational phonology

DR Ladd - 2008 - books.google.com
This second edition presents a completely revised overview of research on intonational
phonology since the 1970s, including new material on research developments since the mid …

[PDF][PDF] Speech and language processing

D Jurafsky - 2000 - dcs.bbk.ac.uk
" This book is an absolute necessity for instructors at all levels, as well as an indispensible
reference for researchers. Introducing NLP, computational linguistics, and speech …

Emotion recognition in human-computer interaction

R Cowie, E Douglas-Cowie… - IEEE Signal …, 2001 - ieeexplore.ieee.org
Two channels have been distinguished in human interaction: one transmits explicit
messages, which may be about anything or nothing; the other transmits implicit messages …

[引用][C] The phonology of tone and intonation

C Gussenhoven - 2004 - books.google.com
Tone and Intonation are two types of pitch variation, which are used by speakers of all
languages in order to give shape to utterances. More specifically, tone encodes segments …

[PDF][PDF] V-measure: A conditional entropy-based external cluster evaluation measure

A Rosenberg, J Hirschberg - … of the 2007 joint conference on …, 2007 - aclanthology.org
We present V-measure, an external entropybased cluster evaluation measure. V-measure
provides an elegant solution to many problems that affect previously defined cluster …