作者
Hamid Sheikhzadeh, Li Deng
发表日期
1994/1
期刊
IEEE Transactions on Speech and Audio Processing
卷号
2
期号
1
页码范围
80-89
出版商
IEEE
简介
The authors describe a novel approach to speech recognition by directly modeling the statistical characteristics of the speech waveforms. This approach allows them to remove the need for using speech preprocessors, which conventionally serve a role of converting speech waveforms into frame-based speech data subject to a subsequent modeling process. Central to their method is the representation of the speech waveforms as the output of a time-varying filter excited by a Gaussian source time-varying in its power. In order to formulate a speech recognition algorithm based on this representation, the time variation in the characteristics of the filter and of the excitation source is described in a compact and parametric form of the Markov chain. They analyze in detail the comparative roles played by the filter modeling and by the source modeling in speech recognition performance. Based on the result of the analysis …
引用总数
19931994199519961997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220231235315211526312576638111111