作者
Christopher J Leggetter, Philip C Woodland
发表日期
1995/4/1
期刊
Computer speech & language
卷号
9
期号
2
页码范围
171-185
出版商
Academic Press
简介
A method of speaker adaptation for continuous density hidden Markov models (HMMs) is presented. An initial speaker-independent system is adapted to improve the modelling of a new speaker by updating the HMM parameters. Statistics are gathered from the available adaptation data and used to calculate a linear regressionbased transformation for the mean vectors. The transformation matrices are calculated to maximize the likelihood of the adaptation data and can be implemented using the forward–backward algorithm. By tying the transformations among a number of distributions, adaptation can be performed for distributions which are not represented in the training data. An important feature of the method is that arbitrary adaptation data can be used—no special enrolment sentences are needed.
Experiments have been performed on the ARPA RM1 database using an HMM system with cross-word triphones and mixture Gaussian output distributions. Results show that adaptation can be performed using as little as 11 s of adaptation data, and that as more data is used the adaptation performance improves. For example, using 40 adaptation utterances, a 37% reduction in error from the speakerindependent system was achieved with supervised adaptation and a 32% reduction in unsupervised mode.
引用总数
19961997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024288360861161491361351661491841691561751671751822251661691408566553938252612