Challenges in multi-modal gesture recognition

S Escalera, V Athitsos, I Guyon - Gesture recognition, 2017 - Springer
This paper surveys the state of the art on multimodal gesture recognition and introduces the
JMLR special topic on gesture recognition 2011–2015. We began right at the start of the …

Searching multi-rate and multi-modal temporal enhanced networks for gesture recognition

Z Yu, B Zhou, J Wan, P Wang, H Chen… - … on Image Processing, 2021 - ieeexplore.ieee.org
Gesture recognition has attracted considerable attention owing to its great potential in
applications. Although the great progress has been made recently in multi-modal learning …

Chalearn looking at people: A review of events and resources

S Escalera, X Baró, HJ Escalante… - 2017 International Joint …, 2017 - ieeexplore.ieee.org
This paper reviews the historic of ChaLearn Looking at People (LAP) events. We started in
2011 (with the release of the first Kinect device) to run challenges related to human …

Two streams recurrent neural networks for large-scale continuous gesture recognition

X Chai, Z Liu, F Yin, Z Liu… - 2016 23rd international …, 2016 - ieeexplore.ieee.org
In this paper, we tackle the continuous gesture recognition problem with a two streams
Recurrent Neural Networks (2S-RNN) for the RGB-D data input. In our framework, the …

Isolated sign language recognition with grassmann covariance matrices

H Wang, X Chai, X Hong, G Zhao, X Chen - ACM Transactions on …, 2016 - dl.acm.org
In this article, to utilize long-term dynamics over an isolated sign sequence, we propose a
covariance matrix--based representation to naturally fuse information from multimodal …

Multimodal human action recognition in assistive human-robot interaction

I Rodomagoulakis, N Kardaris… - … on acoustics, speech …, 2016 - ieeexplore.ieee.org
Within the context of assistive robotics we develop an intelligent interface that provides
multimodal sensory processing capabilities for human action recognition. Human action is …

Gesture and sign language recognition with temporal residual networks

L Pigou, M Van Herreweghe… - Proceedings of the …, 2017 - openaccess.thecvf.com
Gesture and sign language recognition in a continuous video stream is a challenging task,
especially with a large vocabulary. In this work, we approach this as a framewise …

Multi-modal data fusion in enhancing human-machine interaction for robotic applications: A survey

TK Mohd, N Nguyen, AY Javaid - arXiv preprint arXiv:2202.07732, 2022 - arxiv.org
Human-machine interaction has been around for several decades now, with new
applications emerging every day. One of the major goals that remain to be achieved is …

A novel sign language recognition framework using hierarchical grassmann covariance matrix

H Wang, X Chai, X Chen - IEEE Transactions on Multimedia, 2019 - ieeexplore.ieee.org
Visual sign language recognition is an interesting and challenging problem. To create a
discriminative representation, a hierarchical Grassmann covariance matrix (HGCM) model is …

[HTML][HTML] Late multimodal fusion for image and audio music transcription

M Alfaro-Contreras, JJ Valero-Mas, JM Iñesta… - Expert Systems with …, 2023 - Elsevier
Music transcription, which deals with the conversion of music sources into a structured
digital format, is a key problem for Music Information Retrieval (MIR). When addressing this …