查看文章

Effects of phoneme type and frequency on distributed speaker identification and verification

作者

Mohamed Abdel Fattah, Fuji Ren, Shingo Kuroiwa

发表日期

2006/5/1

期刊

IEICE transactions on information and systems

卷号

期号

页码范围

1712-1719

出版商

The Institute of Electronics, Information and Communication Engineers

简介

In the European Telecommunication Standards Institute (ETSI), Distributed Speech Recognition (DSR) front-end, the distortion added due to feature compression on the front end side increases the variance flooring effect, which in turn increases the identification error rate. The penalty incurred in reducing the bit rate is the degradation in speaker recognition performance. In this paper, we present a nontraditional solution for the previously mentioned problem. To reduce the bit rate, a speech signal is segmented at the client, and the most effective phonemes (determined according to their type and frequency) for speaker recognition are selected and sent to the server. Speaker recognition occurs at the server. Applying this approach to YOHO corpus, we achieved an identification error rate (ER) of 0.05% using an average segment of 20.4% for a testing utterance in a speaker identification task. We also achieved an …

引用总数

被引用次数：17

20062007200820092010201120122013201420152016201720182019202020212022202320241 3 3 2 1 1 3 1 1 1

学术搜索中的文章

Effects of phoneme type and frequency on distributed speaker identification and verification

MA Fattah, F Ren, S Kuroiwa - IEICE transactions on information and systems, 2006

被引用次数：17 相关文章所有 9 个版本