Incremental learning for end-to-end automatic speech recognition

M Yang, I Lane, S Watanabe - arXiv preprint arXiv:2207.05071, 2022 - arxiv.org

Continual Learning, also known as Lifelong Learning, aims to continually learn from new
data as it becomes available. While prior research on continual learning in automatic …

被引用次数：21 相关文章所有 9 个版本

[PDF] arxiv.org

Towards lifelong learning of end-to-end ASR

HJ Chang, H Lee, L Lee - arXiv preprint arXiv:2104.01616, 2021 - arxiv.org

Automatic speech recognition (ASR) technologies today are primarily optimized for given
datasets; thus, any changes in the application environment (eg, acoustic conditions or topic …

被引用次数：34 相关文章所有 8 个版本

[PDF] arxiv.org

Alternative pseudo-labeling for semi-supervised automatic speech recognition

H Zhu, D Gao, G Cheng, D Povey… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org

When labeled data is insufficient, pseudo-labeling based semi-supervised learning can
significantly improve the performance of automatic speech recognition. However, pseudo …

被引用次数：3 相关文章所有 4 个版本

[PDF] kuleuven.be

Using adapters to overcome catastrophic forgetting in end-to-end automatic speech recognition

S Vander Eeckt, H Van Hamme - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org

Learning a set of tasks in sequence remains a challenge for artificial neural networks, which,
in such scenarios, tend to suffer from Catastrophic Forgetting (CF). The same applies to End …

被引用次数：17 相关文章所有 6 个版本

[PDF] ieee.org

Blockchain and Machine Learning for Fraud Detection: A Privacy-Preserving and Adaptive Incentive Based Approach

TH Pranto, KTAM Hasib, T Rahman, AB Haque… - IEEE …, 2022 - ieeexplore.ieee.org

Financial fraud cases are on the rise even with the current technological advancements.
Due to the lack of inter-organization synergy and because of privacy concerns, authentic …

被引用次数：8 相关文章所有 6 个版本

[PDF] sciencedirect.com

A multi-tasking model of speaker-keyword classification for keeping human in the loop of drone-assisted inspection

Y Li, A Parsan, B Wang, P Dong, S Yao… - Engineering Applications of …, 2023 - Elsevier

Audio commands are a preferred communication medium to keep inspectors in the loop of
civil infrastructure inspection performed by a semi-autonomous drone. To understand job …

被引用次数：4 相关文章所有 6 个版本

[PDF] kuleuven.be

Continual learning for monolingual end-to-end automatic speech recognition

S Vander Eeckt, H Van Hamme - 2022 30th European Signal …, 2022 - ieeexplore.ieee.org

Adapting Automatic Speech Recognition (ASR) models to new domains results in a
deterioration of performance on the original domain (s), a phenomenon called Catastrophic …

被引用次数：12 相关文章所有 4 个版本

[PDF] arxiv.org

Ufo2: A unified pre-training framework for online and offline speech recognition

L Fu, S Li, Q Li, L Deng, F Li, L Fan… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

In this paper, we propose a Unified pre-training Framework for Online and Offline (UFO2)
Automatic Speech Recognition (ASR), which 1) simplifies the two separate training …

被引用次数：5 相关文章所有 4 个版本

[PDF] arxiv.org

Updating only encoders prevents catastrophic forgetting of end-to-end ASR models

Y Takashima, S Horiguchi, S Watanabe… - arXiv preprint arXiv …, 2022 - arxiv.org

In this paper, we present an incremental domain adaptation technique to prevent
catastrophic forgetting for an end-to-end automatic speech recognition (ASR) model …

被引用次数：4 相关文章所有 8 个版本

[PDF] researchgate.net

[PDF][PDF] Scala: Supervised contrastive learning for end-to-end automatic speech recognition

L Fu, X Li, R Wang, Z Zhang, Y Wu, X He… - arXiv preprint arXiv …, 2021 - researchgate.net

ABSTRACT End-to-end Automatic Speech Recognition (ASR) models are usually trained to
reduce the losses of the whole token sequences, while neglecting explicit phonemic …

被引用次数：7 相关文章

高级搜索

QQ 群