F Boyer, Y Shinohara, T Ishii… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org
In this study, we present recent developments of models trained with the RNN-T loss in ESPnet. It involves the use of various archi-tectures such as recently proposed Conformer …
We present a system for the Zero Resource Speech Challenge 2021, which combines a Contrastive Predictive Coding (CPC) with deep cluster. In deep cluster, we first prepare …
In this paper, we present an incremental domain adaptation technique to prevent catastrophic forgetting for an end-to-end automatic speech recognition (ASR) model …
Much of the recent literature on automatic speech recognition (ASR) is taking an end-to-end approach. Unlike English where the writing system is closely related to sound, Chinese …
Y Wang, L Lu, W Yang, Y Chen - International Journal of Machine …, 2024 - Springer
Transformer is widely used in natural language processing (NLP) tasks due to the parallel and modeling of long texts. However, its performance in Chinese named entity recognition …
C Wu, H Sun, K Huang, L Wu - Sensors, 2024 - mdpi.com
This study addresses the challenges of low accuracy and high computational demands in Tibetan speech recognition by investigating the application of end-to-end networks. We …
N Adiga, J Park, CS Kumar, S Singh, K Lee… - arXiv preprint arXiv …, 2023 - arxiv.org
Recently, the cascaded two-pass architecture has emerged as a strong contender for on- device automatic speech recognition (ASR). A cascade of causal and shallow non-causal …