A review of recent advances on deep learning methods for audio-visual speech recognition

D Ivanko, D Ryumin, A Karpov - Mathematics, 2023 - mdpi.com
This article provides a detailed review of recent advances in audio-visual speech
recognition (AVSR) methods that have been developed over the last decade (2013–2023) …

Research progress of human–computer interaction technology based on gesture recognition

H Zhou, D Wang, Y Yu, Z Zhang - Electronics, 2023 - mdpi.com
Gesture recognition, as a core technology of human–computer interaction, has broad
application prospects and brings new technical possibilities for smart homes, medical care …

Double bistable superposition strategy for improving the performance of triboelectric nanogenerator

J Liu, H Luo, T Yang, Y Cui, K Lu, W Qin - Mechanical Systems and Signal …, 2024 - Elsevier
The output of triboelectric nanogenerator (TENG) is related to the relative motion of friction
materials, and it is difficult for traditional bistable TENG to have a large amplitude in a small …

Domain adaptation with contrastive simultaneous multi-loss training for hand gesture recognition

J Baptista, V Santos, F Silva, D Pinho - Sensors, 2023 - mdpi.com
Hand gesture recognition from images is a critical task with various real-world applications,
particularly in the field of human–robot interaction. Industrial environments, where non …

Audio–visual speech recognition based on regulated transformer and spatio–temporal fusion strategy for driver assistive systems

D Ryumin, A Axyonov, E Ryumina, D Ivanko… - Expert Systems with …, 2024 - Elsevier
This article presents a research methodology for audio–visual speech recognition (AVSR) in
driver assistive systems. These systems necessitate ongoing interaction with drivers while …

A signer-independent sign language recognition method for the single-frequency dataset

T Liu, T Tao, Y Zhao, M Li, J Zhu - Neurocomputing, 2024 - Elsevier
Currently, there are over 70 million people worldwide using more than 300 sign languages
for communication, resulting in a vast number of sign language categories. Sign language …

Speaker independent VSR: A systematic review and futuristic applications

P Nemani, GS Krishna, K Supriya, S Kumar - Image and Vision Computing, 2023 - Elsevier
Speaker-independent visual speech recognition (VSR) is a complex task that involves
identifying spoken words or phrases from video recordings of a speaker's facial movements …

ST-TGR: Spatio-Temporal Representation Learning for Skeleton-Based Teaching Gesture Recognition

Z Chen, W Huang, H Liu, Z Wang, Y Wen, S Wang - Sensors, 2024 - mdpi.com
Teaching gesture recognition is a technique used to recognize the hand movements of
teachers in classroom teaching scenarios. This technology is widely used in education …

A multi-purpose audio-visual corpus for multi-modal Persian speech recognition: The Arman-AV dataset

J Peymanfard, S Heydarian, A Lashini, H Zeinali… - Expert Systems with …, 2024 - Elsevier
Automatic lip reading has advanced significantly in recent years. However, these methods
need large-scale datasets that are scarce for many low-resource languages. In this paper …

Interpretation of Bahasa Isyarat Malaysia (BIM) Using SSD-MobileNet-V2 FPNLite and COCO mAP

IZ Saiful Bahri, S Saon, AK Mahamad, K Isa, U Fadlilah… - Information, 2023 - mdpi.com
This research proposes a study on two-way communication between deaf/mute and normal
people using an Android application. Despite advancements in technology, there is still a …