[PDF][PDF] Recent advances in end-to-end automatic speech recognition

J Li - APSIPA Transactions on Signal and Information …, 2022 - nowpublishers.com
Recently, the speech community is seeing a significant trend of moving from deep neural
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …

Automatic speech recognition using advanced deep learning approaches: A survey

H Kheddar, M Hemis, Y Himeur - Information Fusion, 2024 - Elsevier
Recent advancements in deep learning (DL) have posed a significant challenge for
automatic speech recognition (ASR). ASR relies on extensive training datasets, including …

A survey on non-autoregressive generation for neural machine translation and beyond

Y Xiao, L Wu, J Guo, J Li, M Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Non-autoregressive (NAR) generation, which is first proposed in neural machine translation
(NMT) to speed up inference, has attracted much attention in both machine learning and …

Paraformer: Fast and accurate parallel transformer for non-autoregressive end-to-end speech recognition

Z Gao, S Zhang, I McLoughlin, Z Yan - arXiv preprint arXiv:2206.08317, 2022 - arxiv.org
Transformers have recently dominated the ASR field. Although able to yield good
performance, they involve an autoregressive (AR) decoder to generate tokens one by one …

Relaxing the conditional independence assumption of CTC-based ASR by conditioning on intermediate predictions

J Nozaki, T Komatsu - arXiv preprint arXiv:2104.02724, 2021 - arxiv.org
This paper proposes a method to relax the conditional independence assumption of
connectionist temporal classification (CTC)-based automatic speech recognition (ASR) …

6G survey on challenges, requirements, applications, key enabling technologies, use cases, AI integration issues and security aspects

MS Akbar, Z Hussain, M Ikram, QZ Sheng… - arXiv preprint arXiv …, 2022 - arxiv.org
Fifth-generation (5G) wireless networks will likely offer high data rates, increased reliability,
and low delay for mobile, personal, and local area networks. Along with the rapid growth of …

On challenges of sixth-generation (6G) wireless networks: A comprehensive survey of requirements, applications, and security issues

MS Akbar, Z Hussain, M Ikram, QZ Sheng… - Journal of Network and …, 2024 - Elsevier
Abstract Fifth-generation (5G) wireless networks are likely to offer high data rates, increased
reliability, and low delay for mobile, personal, and local area networks. Along with the rapid …

Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models

K Deng, Z Yang, S Watanabe, Y Higuchi… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
While Transformers have achieved promising results in end-to-end (E2E) automatic speech
recognition (ASR), their autoregressive (AR) structure becomes a bottleneck for speeding up …

Transformer 在语音识别任务中的研究现状与展望.

张晓旭, 马志强, 刘志强, 朱方圆… - Journal of Frontiers of …, 2021 - search.ebscohost.com
Transformer 作为一种新的深度学习算法框架, 得到了越来越多研究人员的关注,
成为目前的研究热点. Transformer 模型中的自注意力机制受人类只关注于重要事物的启发 …

A ctc alignment-based non-autoregressive transformer for end-to-end automatic speech recognition

R Fan, W Chu, P Chang, A Alwan - IEEE/ACM Transactions on …, 2023 - ieeexplore.ieee.org
Recently, end-to-end models have been widely used in automatic speech recognition (ASR)
systems. Two of the most representative approaches are connectionist temporal …