作者
Piotr Kozierski, Talar Sadalla, Szymon Drgas, Adam Dąbrowski, Joanna Zietkiewicz
发表日期
2017/8/28
研讨会论文
2017 22nd International Conference on Methods and Models in Automation and Robotics (MMAR)
页码范围
616-621
出版商
IEEE
简介
The article presents studies on the automatic whispery speech recognition. In the performed research a new corpus with whispery speech has been used. The aim of studies presented in this paper was to check, how the vocabulary size and the language model order influence on the speech recognition quality. It has been concluded that even using recordings with 5,000 different words only it is possible to prepare large vocabulary continuous speech recognition (LVCSR) model. It has been also found that the third order of language model is the best choice. The difference between normal and whispery speech is negligible and is manifested only in higher word error rate index (about 1.5 times higher for whispery speech).
引用总数
学术搜索中的文章
P Kozierski, T Sadalla, S Drgas, A Dąbrowski… - 2017 22nd International Conference on Methods and …, 2017