Leveraging pre-trained representations to improve access to untranscribed speech from endangered languages

N San, M Bartelds, M Browne, L Clifford… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org
N San, M Bartelds, M Browne, L Clifford, F Gibson, J Mansfield, D Nash, J Simpson
2021 IEEE Automatic Speech Recognition and Understanding Workshop …, 2021ieeexplore.ieee.org
Pre-trained speech representations like wav2vec 2.0 are a powerful tool for automatic
speech recognition (ASR). Yet many endangered languages lack sufficient data for pre-
training such models, or are predominantly oral vernaculars without a standardised writing
system, precluding fine-tuning. Query-by-example spoken term detection (QbE-STD) offers
an alternative for iteratively indexing untranscribed speech corpora by locating spoken
query terms. Using data from 7 Australian Aboriginal languages and a regional variety of …
Pre-trained speech representations like wav2vec 2.0 are a powerful tool for automatic speech recognition (ASR). Yet many endangered languages lack sufficient data for pre-training such models, or are predominantly oral vernaculars without a standardised writing system, precluding fine-tuning. Query-by-example spoken term detection (QbE-STD) offers an alternative for iteratively indexing untranscribed speech corpora by locating spoken query terms. Using data from 7 Australian Aboriginal languages and a regional variety of Dutch, all of which are endangered or vulnerable, we show that QbE-STD can be improved by leveraging representations developed for ASR (wav2vec 2.0: the English monolingual model and XLSR53 multilingual model). Surprisingly, the English model outperformed the multilingual model on 4 Australian language datasets, raising questions around how to optimally leverage self-supervised speech representations for QbE-STD. Nevertheless, we find that wav2vec 2.0 representations (either English or XLSR53) offer large improvements (56-86% relative) over state-of-the-art approaches on our endangered language datasets.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果