Automatic speech recognition for Uyghur, Kazakh, and Kyrgyz: An overview

W Du, Y Maimaitiyiming, M Nijat, L Li, A Hamdulla… - Applied Sciences, 2022 - mdpi.com
With the emergence of deep learning, the performance of automatic speech recognition
(ASR) systems has remarkably improved. Especially for resource-rich languages such as …

An investigation of multilingual ASR using end-to-end LF-MMI

S Tong, PN Garner, H Bourlard - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
The end-to-end lattice-free maximum mutual information (LF-MMI) approach has recently
been shown to be beneficial for automatic speech recognition (ASR) in general. More …

A self-supervised model for language identification integrating phonological knowledge

Q Zhan, X Xie, C Hu, H Cheng - Electronics, 2021 - mdpi.com
In this paper, a self-supervised learning pre-trained model is proposed and successfully
applied in language identification task (LID). A Transformer encoder is employed and multi …

Multilingual and crosslingual speech recognition using phonological-vector based phone embeddings

C Zhu, K An, H Zheng, Z Ou - 2021 IEEE Automatic Speech …, 2021 - ieeexplore.ieee.org
The use of phonological features (PFs) potentially allows language-specific phones to
remain linked in training, which is highly desirable for information sharing for multilingual …

Domain robust feature extraction for rapid low resource asr development

S Dalmia, X Li, F Metze… - 2018 IEEE Spoken …, 2018 - ieeexplore.ieee.org
Developing a practical speech recognizer for a low resource language is challenging, not
only because of the (potentially unknown) properties of the language, but also because test …

Near-Optimal Active Learning for Multilingual Grapheme-to-Phoneme Conversion

D Cao, Y Zhao, L Wu - Applied Sciences, 2023 - mdpi.com
The construction of pronunciation dictionaries relies on high-quality and extensive training
data in data-driven way. However, the manual annotation of corpus for this purpose is both …

[PDF][PDF] Automatic Speech Recognition without Transcribed Speech or Pronunciation Lexicons

M Wiesner - 2021 - jscholarship.library.jhu.edu
Rapid deployment of automatic speech recognition (ASR) in new languages, with very
limited data, is of great interest and importance for intelligence gathering, as well as for …

Multilingual training and adaptation in speech recognition

S Tong - 2020 - infoscience.epfl.ch
State-of-the-art acoustic models for Automatic Speech Recognition (ASR) are based on
Hidden Markov Models (HMM) and Deep Neural Networks (DNN) and often require …