MultiD-CNN: A multi-dimensional feature learning approach based on deep convolutional networks for gesture recognition in RGB-D image sequences

A Elboushaki, R Hannane, K Afdel, L Koutti - Expert Systems with …, 2020 - Elsevier
Human gesture recognition has become a pillar of today's intelligent Human-Computer
Interfaces as it typically provides more comfortable and ubiquitous interaction. Such expert …

Cross-modality compensation convolutional neural networks for RGB-D action recognition

J Cheng, Z Ren, Q Zhang, X Gao… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
RGB-D-based human action recognition has attracted much attention recently because it
can provide more complementary information than a single modality. However, it is difficult …

Mmnet: A model-based multimodal network for human action recognition in rgb-d videos

XB Bruce, Y Liu, X Zhang, S Zhong… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Human action recognition (HAR) in RGB-D videos has been widely investigated since the
release of affordable depth sensors. Currently, unimodal approaches (eg, skeleton-based …

RGB-D-based human motion recognition with deep learning: A survey

P Wang, W Li, P Ogunbona, J Wan… - Computer vision and image …, 2018 - Elsevier
Human motion recognition is one of the most important branches of human-centered
research activities. In recent years, motion recognition based on RGB-D data has attracted …

Multi-modality learning for human action recognition

Z Ren, Q Zhang, X Gao, P Hao, J Cheng - Multimedia Tools and …, 2021 - Springer
The multi-modality based human action recognition is an increasing topic. Multi-modality
can provide more abundant and complementary information than single modality. However …

Exploiting spatio-temporal representation for 3D human action recognition from depth map sequences

X Ji, Q Zhao, J Cheng, C Ma - Knowledge-Based Systems, 2021 - Elsevier
Human action recognition based on 3D data is attracting increasing attention because it
could provide more abundant spatial and temporal information compared with RGB videos …

Searching multi-rate and multi-modal temporal enhanced networks for gesture recognition

Z Yu, B Zhou, J Wan, P Wang, H Chen… - … on Image Processing, 2021 - ieeexplore.ieee.org
Gesture recognition has attracted considerable attention owing to its great potential in
applications. Although the great progress has been made recently in multi-modal learning …

Spatiotemporal multimodal learning with 3D CNNs for video action recognition

H Wu, X Ma, Y Li - IEEE Transactions on Circuits and Systems …, 2021 - ieeexplore.ieee.org
Extracting effective spatial-temporal information is significantly important for video-based
action recognition. Recently 3D convolutional neural networks (3D CNNs) that could …

A deeply coupled ConvNet for human activity recognition using dynamic and RGB images

T Singh, DK Vishwakarma - Neural Computing and Applications, 2021 - Springer
This work is motivated by the tremendous achievement of deep learning models for
computer vision tasks, particularly for human activity recognition. It is gaining more attention …

A comprehensive review of recent deep learning techniques for human activity recognition

VT Le, K Tran-Trung, VT Hoang - Computational Intelligence …, 2022 - Wiley Online Library
Human action recognition is an important field in computer vision that has attracted
remarkable attention from researchers. This survey aims to provide a comprehensive …