An effective fusion scheme of spatio-temporal features for human action recognition in RGB-D video

QD Tran, NQ Ly - 2013 International Conference on Control …, 2013 - ieeexplore.ieee.org
We investigate the problem of human action recognition by studying the effects of fusing
feature streams retrieved from color and depth sequences. Our main contribution is two-fold …

Human action recognition based on spatio-temporal three-dimensional scattering transform descriptor and an improved VLAD feature encoding algorithm

B Lin, B Fang, W Yang, J Qian - Neurocomputing, 2019 - Elsevier
The local spatio-temporal descriptor and feature encoding algorithm are two crucial key
steps for human action recognition based on spatio-temporal interest points (STIP). Since …

Action recognition in compressed domain using residual information

A Abdari, P Amirjan, A Mansouri - 2019 4th International …, 2019 - ieeexplore.ieee.org
Practically, action recognition using deep learning approaches are slow because of high
temporal redundancy and large size of the raw video data. One of the solutions for boosting …

Improved SSD using deep multi-scale attention spatial–temporal features for action recognition

S Zhou, J Qiu, A Solanki - Multimedia Systems, 2022 - Springer
The biggest difference between video-based action recognition and image-based action
recognition is that the former has an extra feature of time dimension. Most methods of action …

Short-Term Action Learning for Video Action Recognition

L Ting-Long - IEEE Access, 2024 - ieeexplore.ieee.org
For a long-term complex Action, it is typically composed of various short-term Actions. The
speed and importance of these short-term Actions directly affect the recognition results …

RGB-D action recognition using linear coding

H Liu, M Yuan, F Sun - Neurocomputing, 2015 - Elsevier
In this paper, we investigate action recognition using an inexpensive RGB-D sensor
(Microsoft Kinect). First, a depth spatial-temporal descriptor is developed to extract the …

Hidden Two-Stream Collaborative Learning Network for Action Recognition.

S Zhou, L Chen, V Sugumaran - Computers, Materials & …, 2020 - search.ebscohost.com
The two-stream convolutional neural network exhibits excellent performance in the video
action recognition. The crux of the matter is to use the frames already clipped by the videos …

A four-stream ConvNet based on spatial and depth flow for human action classification using RGB-D data

D Srihari, PVV Kishore, EK Kumar, DA Kumar… - Multimedia Tools and …, 2020 - Springer
Appearance and depth-based action recognition has been researched exclusively for
improving recognition accuracy by considering motion and shape recovery particulars from …

[HTML][HTML] Machine vision-based human action recognition using spatio-temporal motion features (STMF) with difference intensity distance group pattern (DIDGP)

J Arunnehru, S Thalapathiraj, R Dhanasekar… - Electronics, 2022 - mdpi.com
In recent years, human action recognition is modeled as a spatial-temporal video volume.
Such aspects have recently expanded greatly due to their explosively evolving real-world …

Multi modal human action recognition for video content matching

J Guo, H Bai, Z Tang, P Xu, D Gan, B Liu - Multimedia Tools and …, 2020 - Springer
Human action recognition (HAR) in videos is a challenging task in computer vision.
Conventional methods are prone to explore the spatiotemporal or optical representations for …