A survey on automatic speech recognition systems for Portuguese language and its variations

TA de Lima, M Da Costa-Abreu - Computer Speech & Language, 2020 - Elsevier
Communication has been an essential part of being human and living in society. There are
several different languages and variations of them, so you can speak English in one place …

Multilingual speech recognition with a single end-to-end model

S Toshniwal, TN Sainath, RJ Weiss, B Li… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org
Training a conventional automatic speech recognition (ASR) system to support multiple
languages is challenging because the sub-word unit, lexicon and word inventories are …

Neural natural language generation: A survey on multilinguality, multimodality, controllability and learning

E Erdem, M Kuyu, S Yagcioglu, A Frank… - Journal of Artificial …, 2022 - jair.org
Developing artificial learning systems that can understand and generate natural language
has been one of the long-standing goals of artificial intelligence. Recent decades have …

Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling

J Cho, MK Baskar, R Li, M Wiesner… - 2018 IEEE Spoken …, 2018 - ieeexplore.ieee.org
Sequence-to-sequence (seq2seq) approach for low-resource ASR is a relatively new
direction in speech research. The approach benefits by performing model training without …

Meta learning for end-to-end low-resource speech recognition

JY Hsu, YJ Chen, H Lee - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
In this paper, we proposed to apply meta learning approach for low-resource automatic
speech recognition (ASR). We formulated ASR for different languages as different tasks, and …

An overview of Indian spoken language recognition from machine learning perspective

S Dey, M Sahidullah, G Saha - ACM Transactions on Asian and Low …, 2022 - dl.acm.org
Automatic spoken language identification (LID) is a very important research field in the era of
multilingual voice-command-based human-computer interaction. A front-end LID module …

Sequence-based multi-lingual low resource speech recognition

S Dalmia, R Sanabria, F Metze… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Techniques for multi-lingual and cross-lingual speech recognition can help in low resource
scenarios, to bootstrap systems and enable analysis of new languages and domains. End-to …

From english to more languages: Parameter-efficient model reprogramming for cross-lingual speech recognition

CHH Yang, B Li, Y Zhang, N Chen… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
In this work, we propose a new parameter-efficient learning framework based on neural
model reprogramming for cross-lingual speech recognition, which can re-purpose well …

The asru 2019 mandarin-english code-switching speech recognition challenge: Open datasets, tracks, methods and results

X Shi, Q Feng, L Xie - arXiv preprint arXiv:2007.05916, 2020 - arxiv.org
Code-switching (CS) is a common phenomenon and recognizing CS speech is challenging.
But CS speech data is scarce and there's no common testbed in relevant research. This …

Meta-adapter: Efficient cross-lingual adaptation with meta-learning

W Hou, Y Wang, S Gao… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Transfer learning from a multilingual model has shown favorable results on low-resource
automatic speech recognition (ASR). However, full-model fine-tuning generates a separate …