Automatic Pronunciation Assessment--A Review

YE Kheir, A Ali, SA Chowdhury - arXiv preprint arXiv:2310.13974, 2023 - arxiv.org
Pronunciation assessment and its application in computer-aided pronunciation training
(CAPT) have seen impressive progress in recent years. With the rapid growth in language …

SED-MDD: Towards sentence dependent end-to-end mispronunciation detection and diagnosis

Y Feng, G Fu, Q Chen, K Chen - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
A mispronunciation detection and diagnosis (MD&D) system typically consists of multiple
stages, such as an acoustic model, a language model and a Viterbi decoder. In order to …

An end-to-end mispronunciation detection system for L2 English speech leveraging novel anti-phone modeling

BC Yan, MC Wu, HT Hung, B Chen - arXiv preprint arXiv:2005.11950, 2020 - arxiv.org
Mispronunciation detection and diagnosis (MDD) is a core component of computer-assisted
pronunciation training (CAPT). Most of the existing MDD approaches focus on dealing with …

Mispronunciation detection and diagnosis using deep neural networks: a systematic review

M Lounis, B Dendani, H Bahi - Multimedia Tools and Applications, 2024 - Springer
The increased need for foreign language learning, along with advances in speech
technology have heightened interest in computer-assisted pronunciation teaching (CAPT) …

Automatic assessment of English proficiency for Japanese learners without reference sentences based on deep neural network acoustic models

J Fu, Y Chiba, T Nose, A Ito - Speech Communication, 2020 - Elsevier
Speech-based computer-assisted language learning (CALL) systems should recognize the
utterances of the learner with high accuracy and evaluate the language proficiency of the …

Automatic speech recognition (ASR) for the diagnosis of pronunciation of speech sound disorders in Korean children

T Ahn, Y Hong, Y Im, DH Kim, D Kang… - Clinical Linguistics & …, 2024 - Taylor & Francis
This study presents a model of automatic speech recognition (ASR) that is designed to
diagnose pronunciation issues in children with speech sound disorders (SSDs) to replace …

Relative dynamic time warping comparison for pronunciation errors

C Richter, J Guðnason - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
We propose using a dynamic time warping (DTW) difference-to-sum ratio to classify speech
as either matching or diverging from a linguistic standard. This measure effectively …

[PDF][PDF] End-to-end Mispronunciation Detection with Simulated Error Distance.

Z Zhang, Y Wang, J Yang - INTERSPEECH, 2022 - isca-archive.org
With the development of deep learning, the performance of the mispronunciation detection
model has improved greatly. However, the annotation for mispronunciation is quite …

A universal ordinal regression for assessing phoneme-level pronunciation

S Mao, F Soong, Y Xia, J Tien - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
The efficacy and robustness of Ordinal Regression (OR) in assessing speech pronunciation
for language learning at phrase level has been shown before. However, for assessing …

Beyond orthography: Automatic recovery of short vowels and dialectal sounds in arabic

YE Kheir, H Mubarak, A Ali, SA Chowdhury - arXiv preprint arXiv …, 2024 - arxiv.org
This paper presents a novel Dialectal Sound and Vowelization Recovery framework,
designed to recognize borrowed and dialectal sounds within phonologically diverse and …