Recent progress in the CUHK dysarthric speech recognition system

S Liu, M Geng, S Hu, X Xie, M Cui, J Yu… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
Despite the rapid progress of automatic speech recognition (ASR) technologies in the past
few decades, recognition of disordered speech remains a highly challenging task to date …

Multi-stage audio-visual fusion for dysarthric speech recognition with pre-trained models

C Yu, X Su, Z Qian - IEEE Transactions on Neural Systems and …, 2023 - ieeexplore.ieee.org
Dysarthric speech recognition helps speakers with dysarthria to enjoy better communication.
However, collecting dysarthric speech is difficult. The machine learning models cannot be …

Self-supervised asr models and features for dysarthric and elderly speech recognition

S Hu, X Xie, M Geng, Z Jin, J Deng, G Li… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
Self-supervised learning (SSL) based speech foundation models have been applied to a
wide range of ASR tasks. However, their application to dysarthric and elderly speech via …

[PDF][PDF] Dysarthric Speech Recognition From Raw Waveform with Parametric CNNs.

Z Yue, E Loweimi, H Christensen, J Barker… - …, 2022 - isca-archive.org
Raw waveform acoustic modelling has recently received increasing attention. Compared
with the task-blind hand-crafted features which may discard useful information …

[HTML][HTML] Recent advancements in automatic disordered speech recognition: A survey paper

N Gohider, OA Basir - Natural Language Processing Journal, 2024 - Elsevier
Abstract Automatic Speech Recognition technology (ASR) has recently witnessed a
paradigm shift with respect to performance accuracy. Nevertheless, impaired speech …

Multi-modal acoustic-articulatory feature fusion for dysarthric speech recognition

Z Yue, E Loweimi, Z Cvetkovic… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Building automatic speech recognition (ASR) systems for speakers with dysarthria is a very
challenging task. Although multi-modal ASR has received increasing attention recently …

[PDF][PDF] Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition.

S Liu, X Xie, J Yu, S Hu, M Geng, R Su, SX Zhang… - Interspeech, 2020 - isca-archive.org
Audio-visual speech recognition (AVSR) technologies have been successfully applied to a
wide range of tasks. When developing AVSR systems for disordered speech characterized …

Cross-domain deep visual feature generation for mandarin audio–visual speech recognition

R Su, X Liu, L Wang, J Yang - IEEE/ACM Transactions on …, 2019 - ieeexplore.ieee.org
There has been a long term interest in using visual information to improve automatic speech
recognition (ASR) system performance. Both audio and visual information are required in …

[PDF][PDF] Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition.

S Liu, S Hu, Y Wang, J Yu, R Su, X Liu, H Meng - INTERSPEECH, 2019 - isca-archive.org
Automatic speech recognition (ASR) for disordered speech is a challenging task. People
with speech disorders such as dysarthria often have physical disabilities, leading to severe …

Tran-DSR: A hybrid model for dysarthric speech recognition using transformer encoder and ensemble learning

R Mahum, AM El-Sherbeeny, K Alkhaledi, H Hassan - Applied Acoustics, 2024 - Elsevier
Over the last decade, there has been a notable increase in the pervasiveness of
neurological diseases due to population growth and aging. Among individuals with …