Incremental learning for end-to-end automatic speech recognition

L Fu, X Li, L Zi, Z Zhang, Y Wu, X He… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org
In this paper, we propose an incremental learning method for end-to-end Automatic Speech
Recognition (ASR) which enables an ASR system to perform well on new tasks while …

Synthasr: Unlocking synthetic data for speech recognition

A Fazel, W Yang, Y Liu, R Barra-Chicote… - arXiv preprint arXiv …, 2021 - arxiv.org
End-to-end (E2E) automatic speech recognition (ASR) models have recently demonstrated
superior performance over the traditional hybrid ASR models. Training an E2E ASR model …

A multi-task learning framework for overcoming the catastrophic forgetting in automatic speech recognition

J Xue, J Han, T Zheng, X Gao, J Guo - arXiv preprint arXiv:1904.08039, 2019 - arxiv.org
Recently, data-driven based Automatic Speech Recognition (ASR) systems have achieved
state-of-the-art results. And transfer learning is often used when those existing systems are …

End-to-end speech recognition with adaptive computation steps

M Li, M Liu, H Masanori - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
In this paper, we present Adaptive Computation Steps (ACS) algorithm, which enables end-
to-end speech recognition models to dynamically decide how many frames should be …

Towards lifelong learning of end-to-end ASR

HJ Chang, H Lee, L Lee - arXiv preprint arXiv:2104.01616, 2021 - arxiv.org
Automatic speech recognition (ASR) technologies today are primarily optimized for given
datasets; thus, any changes in the application environment (eg, acoustic conditions or topic …

Cross-language transfer learning and domain adaptation for end-to-end automatic speech recognition

J Luo, J Wang, N Cheng, E Xiao, J Xiao… - … on Multimedia and …, 2021 - ieeexplore.ieee.org
In this paper, we demonstrate the efficacy of transfer learning and continuous learning for
various automatic speech recognition (ASR) tasks using end-to-end models trained with …

Mutual-learning sequence-level knowledge distillation for automatic speech recognition

Z Li, Y Ming, L Yang, JH Xue - Neurocomputing, 2021 - Elsevier
Automatic speech recognition (ASR) is a crucial technology for man-machine interaction.
End-to-end models have been studied recently in deep learning for ASR. However, these …

Advanced long-context end-to-end speech recognition using context-expanded transformers

T Hori, N Moritz, C Hori, JL Roux - arXiv preprint arXiv:2104.09426, 2021 - arxiv.org
This paper addresses end-to-end automatic speech recognition (ASR) for long audio
recordings such as lecture and conversational speeches. Most end-to-end ASR models are …

Triggered attention for end-to-end speech recognition

N Moritz, T Hori, J Le Roux - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
A new system architecture for end-to-end automatic speech recognition (ASR) is proposed
that combines the alignment capabilities of the connectionist temporal classification (CTC) …

Improving hybrid ctc/attention end-to-end speech recognition with pretrained acoustic and language models

K Deng, S Cao, Y Zhang, L Ma - 2021 IEEE Automatic Speech …, 2021 - ieeexplore.ieee.org
Recently, self-supervised pretraining has achieved impressive results in end-to-end (E2E)
automatic speech recognition (ASR). However, the dominant sequence-to-sequence (S2S) …