作者
Joe Frankel, Korin Richmond, Simon King, Paul Taylor
发表日期
2000
出版商
International Speech Communication Association
简介
We describe a speech recognition system which uses articulatory parameters as basic features and phone-dependent linear dynamic models. The system first estimates articulatory trajectories from the speech signal. Estimations of x and y coordinates of 7 actual articulator positions in the midsagittal plane are produced every 2 milliseconds by a recurrent neural network, trained on real articulatory data. The output of this network is then passed to a set of linear dynamic models, which perform phone recognition
引用总数
学术搜索中的文章
J Frankel, K Richmond, S King, P Taylor - 2000