Semi-supervised learning for acoustic model retraining: Handling speech data with noisy transcript

A Madan, A Khopkar, S Nadig… - 2020 International …, 2020 - ieeexplore.ieee.org
A Madan, A Khopkar, S Nadig, KMS Raghavan, D Eledath, V Ramasubramanian
2020 International Conference on Signal Processing and …, 2020ieeexplore.ieee.org
We address the problem of retraining a seed acoustic model from a large corpus which is
associated with noisy labeling. We propose a forced-alignment likelihood and fuzzy string
matching score based iterative selection of the corpus data to retrain the acoustic model in
an order of increasing degree of noise in the transcript, yielding a succession of enhanced
acoustic models, offering progressively lower error rates on an held-out test data. We show
results in terms of PER (phoneme-error-rate) on a large broadcast news data from a national …
We address the problem of retraining a seed acoustic model from a large corpus which is associated with noisy labeling. We propose a forced-alignment likelihood and fuzzy string matching score based iterative selection of the corpus data to retrain the acoustic model in an order of increasing degree of noise in the transcript, yielding a succession of enhanced acoustic models, offering progressively lower error rates on an held-out test data. We show results in terms of PER (phoneme-error-rate) on a large broadcast news data from a national broadcast network containing multiple languages of transcribed-speech, demonstrating the strong utility of such an approach for training of acoustic models from noisy-transcript.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果