Use of PLP cepstral features for phonetic segmentation

BB Vachhani, HA Patil - 2013 International Conference on …, 2013 - ieeexplore.ieee.org
2013 International Conference on Asian Language Processing, 2013ieeexplore.ieee.org
Phonetic segmentation can find its potential application for Text-to-Speech (TTS) synthesis
and Automatic Speech Recognition (ASR) systems. In this paper, we propose use of
Perceptual Linear Prediction Cepstral Coefficients (PLPCC) feature for phonetic
segmentation task. To detect phonetic boundaries, we used spectral transition measure
(STM). Using proposed approach, we achieve 85%(ie, 3% better than state-of-the art Mel-
frequency Cepstral Coefficients (MFCC) for 20 ms agreement duration) accuracy and 15 …
Phonetic segmentation can find its potential application for Text-to-Speech (TTS) synthesis and Automatic Speech Recognition (ASR) systems. In this paper, we propose use of Perceptual Linear Prediction Cepstral Coefficients (PLPCC) feature for phonetic segmentation task. To detect phonetic boundaries, we used spectral transition measure (STM). Using proposed approach, we achieve 85 % (i.e., 3 % better than state-of-the art Mel-frequency Cepstral Coefficients (MFCC) for 20 ms agreement duration) accuracy and 15 % over-segmentation rate (i.e., 8 % less than MFCC) for automatic boundary detection of 2, 34, 925 phone boundaries corresponding 630 speakers of entire TIMIT database.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果

Google学术搜索按钮

example.edu/paper.pdf
搜索
获取 PDF 文件
引用
References