Joint modeling of code-switched and monolingual asr via conditional factorization

W Chen, B Yan, J Shi, Y Peng, S Maiti… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

Multilingual Automatic Speech Recognition (ASR) models have extended the usability of
speech technologies to a wide variety of languages. With how many languages these …

被引用次数：43 相关文章所有 5 个版本

[PDF] arxiv.org

Streaming end-to-end multilingual speech recognition with joint language identification

C Zhang, B Li, T Sainath, T Strohman… - arXiv preprint arXiv …, 2022 - arxiv.org

Language identification is critical for many downstream tasks in automatic speech
recognition (ASR), and is beneficial to integrate into multilingual end-to-end ASR as an …

被引用次数：31 相关文章所有 5 个版本

[PDF] arxiv.org

Language-specific characteristic assistance for code-switching speech recognition

T Song, Q Xu, M Ge, L Wang, H Shi, Y Lv, Y Lin… - arXiv preprint arXiv …, 2022 - arxiv.org

Dual-encoder structure successfully utilizes two language-specific encoders (LSEs) for code-
switching speech recognition. Because LSEs are initialized by two pre-trained language …

被引用次数：25 相关文章所有 8 个版本

[PDF] arxiv.org

Lae: Language-aware encoder for monolingual and multilingual asr

J Tian, J Yu, C Zhang, C Weng, Y Zou, D Yu - arXiv preprint arXiv …, 2022 - arxiv.org

Despite the rapid progress in automatic speech recognition (ASR) research, recognizing
multilingual speech using a unified ASR system remains highly challenging. Previous works …

被引用次数：23 相关文章所有 7 个版本

[PDF] arxiv.org

Language-routing mixture of experts for multilingual and code-switching speech recognition

W Wang, G Ma, Y Li, B Du - arXiv preprint arXiv:2307.05956, 2023 - arxiv.org

Multilingual speech recognition for both monolingual and code-switching speech is a
challenging task. Recently, based on the Mixture of Experts (MoE), many works have made …

被引用次数：16 相关文章所有 5 个版本

[PDF] arxiv.org

Enhancing code-switching speech recognition with interactive language biases

H Liu, LP Garcia, X Zhang, AWH Khong… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org

Languages usually switch within a multilingual speech signal, especially in a bilingual
society. This phenomenon is referred to as code-switching (CS), making automatic speech …

被引用次数：14 相关文章所有 3 个版本

[PDF] arxiv.org

Towards zero-shot code-switched speech recognition

B Yan, M Wiesner, O Klejch, P Jyothi… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

In this work, we seek to build effective code-switched (CS) automatic speech recognition
systems (ASR) under the zero-shot set-ting where no transcribed CS speech data is …

被引用次数：20 相关文章所有 6 个版本

[PDF] arxiv.org

Adapting OpenAI's Whisper for speech recognition on code-switch mandarin-english seame and asru2019 datasets

Y Yang, Y Peng, H Huang, ES Chng… - 2024 Asia Pacific …, 2024 - ieeexplore.ieee.org

This paper reports on SOTA results achieved using openAI's Whisper model with adaptation
on different adaptation corpus sizes for two established code-switch Mandarin/English …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Internal language model estimation based language model fusion for cross-domain code-switching speech recognition

Y Peng, Y Liu, J Zhang, H Xu, Y He, H Huang… - arXiv preprint arXiv …, 2022 - arxiv.org

Internal Language Model Estimation (ILME) based language model (LM) fusion has been
shown significantly improved recognition results over conventional shallow fusion in both …

被引用次数：9 相关文章所有 2 个版本

[PDF] arxiv.org

Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation

Z Liang, Z Song, Z Ma, C Du, K Yu, X Chen - arXiv preprint arXiv …, 2023 - arxiv.org

Recently, end-to-end (E2E) automatic speech recognition (ASR) models have made great
strides and exhibit excellent performance in general speech recognition. However, there …

被引用次数：6 相关文章所有 4 个版本

高级搜索

QQ 群