作者
Valtcho Valtchev, Julian J Odell, Philip C Woodland, Steve J Young
发表日期
1997/9/1
期刊
Speech Communication
卷号
22
期号
4
页码范围
303-314
出版商
North-Holland
简介
This paper describes a framework for optimising the structure and parameters of a continuous density HMM-based large vocabulary recognition system using the Maximum Mutual Information Estimation (MMIE) criterion. To reduce the computational complexity of the MMIE training algorithm, confusable segments of speech are identified and stored as word lattices of alternative utterance hypotheses. An iterative mixture splitting procedure is also employed to adjust the number of mixture components in each state during training such that the optimal balance between the number of parameters and the available training data is achieved. Experiments are presented on various test sets from the Wall Street Journal database using up to 66 hours of acoustic training data. These demonstrate that the use of lattices makes MMIE training practicable for very complex recognition systems and large training sets. Furthermore …
引用总数
199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220231106121281012192119201318141316857452621
学术搜索中的文章
V Valtchev, JJ Odell, PC Woodland, SJ Young - Speech Communication, 1997