A systematic review of the application of machine learning techniques to ultrasound tongue imaging analysis

Z Xia, R Yuan, Y Cao, T Sun, Y Xiong… - The Journal of the …, 2024 - pubs.aip.org
B-mode ultrasound has emerged as a prevalent tool for observing tongue motion in speech
production, gaining traction in speech therapy applications. However, the effective analysis …

Multi-modal co-learning for silent speech recognition based on ultrasound tongue images

M Guo, J Wei, R Zhang, Y Zhao, Q Fang - Speech Communication, 2024 - Elsevier
Silent speech recognition (SSR) is an essential task in human–computer interaction, aiming
to recognize speech from non-acoustic modalities. A key challenge in SSR is inherent input …

Optimizing the ultrasound tongue image representation for residual network-based articulatory-to-acoustic mapping

TG Csapó, G Gosztolya, L Tóth, AH Shandiz, A Markó - Sensors, 2022 - mdpi.com
Within speech processing, articulatory-to-acoustic mapping (AAM) methods can apply
ultrasound tongue imaging (UTI) as an input.(Micro) convex transducers are mostly used …