相关文章- 学术资源搜索

Mutual-learning sequence-level knowledge distillation for automatic speech recognition

Z Li, Y Ming, L Yang, JH Xue - Neurocomputing, 2021 - Elsevier

Automatic speech recognition (ASR) is a crucial technology for man-machine interaction.
End-to-end models have been studied recently in deep learning for ASR. However, these …

被引用次数：23 相关文章所有 2 个版本

[PDF] arxiv.org

TutorNet: Towards flexible knowledge distillation for end-to-end speech recognition

JW Yoon, H Lee, HY Kim, WI Cho… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org

In recent years, there has been a great deal of research in developing end-to-end speech
recognition models, which enable simplifying the traditional pipeline and achieving …

被引用次数：30 相关文章所有 5 个版本

[PDF] arxiv.org

Knowledge distillation from multiple foundation models for end-to-end speech recognition

X Yang, Q Li, C Zhang, PC Woodland - arXiv preprint arXiv:2303.10917, 2023 - arxiv.org

Although large foundation models pre-trained by self-supervised learning have achieved
state-of-the-art performance in many tasks including automatic speech recognition (ASR) …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

End-to-end automatic speech recognition with deep mutual learning

R Masumura, M Ihori, A Takashima… - 2020 Asia-Pacific …, 2020 - ieeexplore.ieee.org

This paper is the first study to apply deep mutual learning (DML) to end-to-end ASR models.
In DML, multiple models are trained simultaneously and collaboratively by mimicking each …

被引用次数：7 相关文章所有 4 个版本

Knowledge distillation using output errors for self-attention end-to-end models

HG Kim, H Na, H Lee, J Lee, TG Kang… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org

Most automatic speech recognition (ASR) neural network models are not suitable for mobile
devices due to their large model sizes. Therefore, it is required to reduce the model size to …

被引用次数：30 相关文章

[PDF] arxiv.org

Knowledge transfer from pre-trained language models to cif-based speech recognizers via hierarchical distillation

M Han, F Chen, J Shi, S Xu, B Xu - arXiv preprint arXiv:2301.13003, 2023 - arxiv.org

Large-scale pre-trained language models (PLMs) have shown great potential in natural
language processing tasks. Leveraging the capabilities of PLMs to enhance automatic …

被引用次数：9 相关文章所有 5 个版本

[PDF] arxiv.org

Distilling knowledge from ensembles of acoustic models for joint CTC-attention end-to-end speech recognition

Y Gao, T Parcollet, ND Lane - 2021 IEEE Automatic Speech …, 2021 - ieeexplore.ieee.org

Knowledge distillation has been widely used to compress existing deep learning models
while preserving the performance on a wide range of applications. In the specific context of …

被引用次数：12 相关文章所有 6 个版本

[PDF] arxiv.org

Inter-KD: Intermediate knowledge distillation for CTC-based automatic speech recognition

JW Yoon, BJ Woo, S Ahn, H Lee… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org

Recently, the advance in deep learning has brought a considerable improvement in the end-
to-end speech recognition field, simplifying the traditional pipeline while producing …

被引用次数：8 相关文章所有 3 个版本

[PDF] arxiv.org

Comparison of soft and hard target rnn-t distillation for large-scale asr

D Hwang, KC Sim, Y Zhang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

Knowledge distillation is an effective machine learning technique to transfer knowledge from
a teacher model to a smaller student model, especially with unlabeled data. In this paper, we …

被引用次数：7 相关文章所有 5 个版本

[PDF] arxiv.org

Incremental learning for end-to-end automatic speech recognition

L Fu, X Li, L Zi, Z Zhang, Y Wu, X He… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org

In this paper, we propose an incremental learning method for end-to-end Automatic Speech
Recognition (ASR) which enables an ASR system to perform well on new tasks while …

被引用次数：27 相关文章所有 4 个版本

高级搜索

QQ 群

Mutual-learning sequence-level knowledge distillation for automatic speech recognition

TutorNet: Towards flexible knowledge distillation for end-to-end speech recognition

Knowledge distillation from multiple foundation models for end-to-end speech recognition

End-to-end automatic speech recognition with deep mutual learning

Knowledge distillation using output errors for self-attention end-to-end models

Knowledge transfer from pre-trained language models to cif-based speech recognizers via hierarchical distillation

Distilling knowledge from ensembles of acoustic models for joint CTC-attention end-to-end speech recognition

Inter-KD: Intermediate knowledge distillation for CTC-based automatic speech recognition

Comparison of soft and hard target rnn-t distillation for large-scale asr

Incremental learning for end-to-end automatic speech recognition

相关搜索

引用