Audio-visual speech and gesture recognition by sensors of mobile devices

D Ivanko, D Ryumin, A Karpov - Mathematics, 2023 - mdpi.com

This article provides a detailed review of recent advances in audio-visual speech
recognition (AVSR) methods that have been developed over the last decade (2013–2023) …

被引用次数：9 相关文章所有 5 个版本

[PDF] mdpi.com

Research progress of human–computer interaction technology based on gesture recognition

H Zhou, D Wang, Y Yu, Z Zhang - Electronics, 2023 - mdpi.com

Gesture recognition, as a core technology of human–computer interaction, has broad
application prospects and brings new technical possibilities for smart homes, medical care …

被引用次数：5 相关文章所有 3 个版本

Double bistable superposition strategy for improving the performance of triboelectric nanogenerator

J Liu, H Luo, T Yang, Y Cui, K Lu, W Qin - Mechanical Systems and Signal …, 2024 - Elsevier

The output of triboelectric nanogenerator (TENG) is related to the relative motion of friction
materials, and it is difficult for traditional bistable TENG to have a large amplitude in a small …

被引用次数：3 相关文章所有 2 个版本

[PDF] mdpi.com

Domain adaptation with contrastive simultaneous multi-loss training for hand gesture recognition

J Baptista, V Santos, F Silva, D Pinho - Sensors, 2023 - mdpi.com

Hand gesture recognition from images is a critical task with various real-world applications,
particularly in the field of human–robot interaction. Industrial environments, where non …

被引用次数：5 相关文章所有 6 个版本

Audio–visual speech recognition based on regulated transformer and spatio–temporal fusion strategy for driver assistive systems

D Ryumin, A Axyonov, E Ryumina, D Ivanko… - Expert Systems with …, 2024 - Elsevier

This article presents a research methodology for audio–visual speech recognition (AVSR) in
driver assistive systems. These systems necessitate ongoing interaction with drivers while …

被引用次数：1 相关文章

A signer-independent sign language recognition method for the single-frequency dataset

T Liu, T Tao, Y Zhao, M Li, J Zhu - Neurocomputing, 2024 - Elsevier

Currently, there are over 70 million people worldwide using more than 300 sign languages
for communication, resulting in a vast number of sign language categories. Sign language …

被引用次数：2 相关文章

Speaker independent VSR: A systematic review and futuristic applications

P Nemani, GS Krishna, K Supriya, S Kumar - Image and Vision Computing, 2023 - Elsevier

Speaker-independent visual speech recognition (VSR) is a complex task that involves
identifying spoken words or phrases from video recordings of a speaker's facial movements …

被引用次数：1 相关文章所有 2 个版本

[PDF] mdpi.com

ST-TGR: Spatio-Temporal Representation Learning for Skeleton-Based Teaching Gesture Recognition

Z Chen, W Huang, H Liu, Z Wang, Y Wen, S Wang - Sensors, 2024 - mdpi.com

Teaching gesture recognition is a technique used to recognize the hand movements of
teachers in classroom teaching scenarios. This technology is widely used in education …

被引用次数：1 相关文章所有 6 个版本

[PDF] arxiv.org

A multi-purpose audio-visual corpus for multi-modal Persian speech recognition: The Arman-AV dataset

J Peymanfard, S Heydarian, A Lashini, H Zeinali… - Expert Systems with …, 2024 - Elsevier

Automatic lip reading has advanced significantly in recent years. However, these methods
need large-scale datasets that are scarce for many low-resource languages. In this paper …

被引用次数：3 相关文章所有 4 个版本

[PDF] mdpi.com

Interpretation of Bahasa Isyarat Malaysia (BIM) Using SSD-MobileNet-V2 FPNLite and COCO mAP

IZ Saiful Bahri, S Saon, AK Mahamad, K Isa, U Fadlilah… - Information, 2023 - mdpi.com

This research proposes a study on two-way communication between deaf/mute and normal
people using an Android application. Despite advancements in technology, there is still a …

被引用次数：5 相关文章所有 5 个版本

高级搜索

QQ 群