作者
Joe Frankel, Korin Richmond, Simon King, Paul Taylor
发表日期
2000
出版商
International Speech Communication Association
简介
We describe a speech recognition system which uses articulatory parameters as basic features and phone-dependent linear dynamic models. The system first estimates articulatory trajectories from the speech signal. Estimations of x and y coordinates of 7 actual articulator positions in the midsagittal plane are produced every 2 milliseconds by a recurrent neural network, trained on real articulatory data. The output of this network is then passed to a set of linear dynamic models, which perform phone recognition
引用总数
20002001200220032004200520062007200820092010201120122013201420152016201720182019202020212022163271143933523225231