作者
Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi
发表日期
2010/10/1
期刊
Speech Communication
卷号
52
期号
10
页码范围
834-846
出版商
North-Holland
简介
This paper presents an investigation into predicting the movement of a speaker’s mouth from text input using hidden Markov models (HMM). A corpus of human articulatory movements, recorded by electromagnetic articulography (EMA), is used to train HMMs. To predict articulatory movements for input text, a suitable model sequence is selected and a maximum-likelihood parameter generation (MLPG) algorithm is used to generate output articulatory trajectories. Unified acoustic-articulatory HMMs are introduced to integrate acoustic features when an acoustic signal is also provided with the input text. Several aspects of this method are analyzed in this paper, including the effectiveness of context-dependent modeling, the role of supplementary acoustic input, and the appropriateness of certain model structures for the unified acoustic-articulatory models. When text is the sole input, we find that fully context-dependent …
引用总数
201020112012201320142015201620172018201920202021202220232024137745656464121
学术搜索中的文章