Foundationtts: Text-to-speech for asr customization with generative language model

R Xue, Y Liu, L He, X Tan, L Liu, E Lin… - arXiv preprint arXiv …, 2023 - arxiv.org
Neural text-to-speech (TTS) generally consists of cascaded architecture with separately
optimized acoustic model and vocoder, or end-to-end architecture with continuous mel …

Contextualized End-to-end Automatic Speech Recognition with Intermediate Biasing Loss

M Shakeel, Y Sudo, Y Peng, S Watanabe - arXiv preprint arXiv …, 2024 - arxiv.org
Contextualized end-to-end automatic speech recognition has been an active research area,
with recent efforts focusing on the implicit learning of contextual phrases based on the final …

Locality enhanced dynamic biasing and sampling strategies for contextual ASR

MA Jalal, PP Parada, G Pavlidis… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
Automatic Speech Recognition (ASR) still face challenges when recognizing time-variant
rare-phrases. Contextual biasing (CB) modules bias ASR model towards such contextually …

High-precision Voice Search Query Correction via Retrievable Speech-text Embedings

C Li, G Wang, K Kastner, H Su, A Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Automatic speech recognition (ASR) systems can suffer from poor recall for various reasons,
such as noisy audio, lack of sufficient training data, etc. Previous work has shown that recall …

An efficient text augmentation approach for contextualized Mandarin speech recognition

N Zheng, X Wan, K Liu, Z Du, Z Huan - arXiv preprint arXiv:2406.09950, 2024 - arxiv.org
Although contextualized automatic speech recognition (ASR) systems are commonly used to
improve the recognition of uncommon words, their effectiveness is hindered by the inherent …

Contextualized Automatic Speech Recognition with Dynamic Vocabulary

Y Sudo, Y Fukumoto, M Shakeel, Y Peng… - arXiv preprint arXiv …, 2024 - arxiv.org
Deep biasing (DB) improves the performance of end-to-end automatic speech recognition
(E2E-ASR) for rare words or contextual phrases using a bias list. However, most existing …

[PDF][PDF] Combining Image and Text for Chinese Spelling Correction

S Qin, L Sha, J Li - researchgate.net
Correcting spelling mistakes is a complex task that presents significant challenges in
obtaining satisfactory solutions. In this study, we focus on Chinese spelling error correction …