Online continual learning of end-to-end speech recognition models

M Yang, I Lane, S Watanabe - arXiv preprint arXiv:2207.05071, 2022 - arxiv.org
Continual Learning, also known as Lifelong Learning, aims to continually learn from new
data as it becomes available. While prior research on continual learning in automatic …

Towards lifelong learning of end-to-end ASR

HJ Chang, H Lee, L Lee - arXiv preprint arXiv:2104.01616, 2021 - arxiv.org
Automatic speech recognition (ASR) technologies today are primarily optimized for given
datasets; thus, any changes in the application environment (eg, acoustic conditions or topic …

Alternative pseudo-labeling for semi-supervised automatic speech recognition

H Zhu, D Gao, G Cheng, D Povey… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org
When labeled data is insufficient, pseudo-labeling based semi-supervised learning can
significantly improve the performance of automatic speech recognition. However, pseudo …

Using adapters to overcome catastrophic forgetting in end-to-end automatic speech recognition

S Vander Eeckt, H Van Hamme - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Learning a set of tasks in sequence remains a challenge for artificial neural networks, which,
in such scenarios, tend to suffer from Catastrophic Forgetting (CF). The same applies to End …

Blockchain and Machine Learning for Fraud Detection: A Privacy-Preserving and Adaptive Incentive Based Approach

TH Pranto, KTAM Hasib, T Rahman, AB Haque… - IEEE …, 2022 - ieeexplore.ieee.org
Financial fraud cases are on the rise even with the current technological advancements.
Due to the lack of inter-organization synergy and because of privacy concerns, authentic …

A multi-tasking model of speaker-keyword classification for keeping human in the loop of drone-assisted inspection

Y Li, A Parsan, B Wang, P Dong, S Yao… - Engineering Applications of …, 2023 - Elsevier
Audio commands are a preferred communication medium to keep inspectors in the loop of
civil infrastructure inspection performed by a semi-autonomous drone. To understand job …

Continual learning for monolingual end-to-end automatic speech recognition

S Vander Eeckt, H Van Hamme - 2022 30th European Signal …, 2022 - ieeexplore.ieee.org
Adapting Automatic Speech Recognition (ASR) models to new domains results in a
deterioration of performance on the original domain (s), a phenomenon called Catastrophic …

Ufo2: A unified pre-training framework for online and offline speech recognition

L Fu, S Li, Q Li, L Deng, F Li, L Fan… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
In this paper, we propose a Unified pre-training Framework for Online and Offline (UFO2)
Automatic Speech Recognition (ASR), which 1) simplifies the two separate training …

Updating only encoders prevents catastrophic forgetting of end-to-end ASR models

Y Takashima, S Horiguchi, S Watanabe… - arXiv preprint arXiv …, 2022 - arxiv.org
In this paper, we present an incremental domain adaptation technique to prevent
catastrophic forgetting for an end-to-end automatic speech recognition (ASR) model …

[PDF][PDF] Scala: Supervised contrastive learning for end-to-end automatic speech recognition

L Fu, X Li, R Wang, Z Zhang, Y Wu, X He… - arXiv preprint arXiv …, 2021 - researchgate.net
ABSTRACT End-to-end Automatic Speech Recognition (ASR) models are usually trained to
reduce the losses of the whole token sequences, while neglecting explicit phonemic …