Unsupervised discovery of an extended phoneme set in l2 english speech for mispronunciation...

YE Kheir, A Ali, SA Chowdhury - arXiv preprint arXiv:2310.13974, 2023 - arxiv.org

Pronunciation assessment and its application in computer-aided pronunciation training
(CAPT) have seen impressive progress in recent years. With the rapid growth in language …

被引用次数：9 相关文章所有 4 个版本

SED-MDD: Towards sentence dependent end-to-end mispronunciation detection and diagnosis

Y Feng, G Fu, Q Chen, K Chen - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org

A mispronunciation detection and diagnosis (MD&D) system typically consists of multiple
stages, such as an acoustic model, a language model and a Viterbi decoder. In order to …

被引用次数：75 相关文章所有 2 个版本

[PDF] arxiv.org

An end-to-end mispronunciation detection system for L2 English speech leveraging novel anti-phone modeling

BC Yan, MC Wu, HT Hung, B Chen - arXiv preprint arXiv:2005.11950, 2020 - arxiv.org

Mispronunciation detection and diagnosis (MDD) is a core component of computer-assisted
pronunciation training (CAPT). Most of the existing MDD approaches focus on dealing with …

被引用次数：52 相关文章所有 9 个版本

Mispronunciation detection and diagnosis using deep neural networks: a systematic review

M Lounis, B Dendani, H Bahi - Multimedia Tools and Applications, 2024 - Springer

The increased need for foreign language learning, along with advances in speech
technology have heightened interest in computer-assisted pronunciation teaching (CAPT) …

被引用次数：6 相关文章

Automatic assessment of English proficiency for Japanese learners without reference sentences based on deep neural network acoustic models

J Fu, Y Chiba, T Nose, A Ito - Speech Communication, 2020 - Elsevier

Speech-based computer-assisted language learning (CALL) systems should recognize the
utterances of the learner with high accuracy and evaluate the language proficiency of the …

被引用次数：33 相关文章所有 4 个版本

[PDF] arxiv.org

Automatic speech recognition (ASR) for the diagnosis of pronunciation of speech sound disorders in Korean children

T Ahn, Y Hong, Y Im, DH Kim, D Kang… - Clinical Linguistics & …, 2024 - Taylor & Francis

This study presents a model of automatic speech recognition (ASR) that is designed to
diagnose pronunciation issues in children with speech sound disorders (SSDs) to replace …

被引用次数：2 相关文章所有 2 个版本

[PDF] github.io

Relative dynamic time warping comparison for pronunciation errors

C Richter, J Guðnason - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org

We propose using a dynamic time warping (DTW) difference-to-sum ratio to classify speech
as either matching or diverging from a linguistic standard. This measure effectively …

被引用次数：7 相关文章所有 2 个版本

[PDF] isca-archive.org

[PDF][PDF] End-to-end Mispronunciation Detection with Simulated Error Distance.

Z Zhang, Y Wang, J Yang - INTERSPEECH, 2022 - isca-archive.org

With the development of deep learning, the performance of the mispronunciation detection
model has improved greatly. However, the annotation for mispronunciation is quite …

被引用次数：7 相关文章所有 3 个版本

A universal ordinal regression for assessing phoneme-level pronunciation

S Mao, F Soong, Y Xia, J Tien - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

The efficacy and robustness of Ordinal Regression (OR) in assessing speech pronunciation
for language learning at phrase level has been shown before. However, for assessing …

被引用次数：6 相关文章

[PDF] arxiv.org

Beyond orthography: Automatic recovery of short vowels and dialectal sounds in arabic

YE Kheir, H Mubarak, A Ali, SA Chowdhury - arXiv preprint arXiv …, 2024 - arxiv.org

This paper presents a novel Dialectal Sound and Vowelization Recovery framework,
designed to recognize borrowed and dialectal sounds within phonologically diverse and …

高级搜索

QQ 群