Spatial-temporal pyramid based convolutional neural network for action recognition

Z Zheng, G An, D Wu, Q Ruan - Neurocomputing, 2019 - Elsevier
Abstract Convolutional Neural Networks (CNNs) usually use top-level appearance features
of video frames for action recognition. However, these methods discard the implicit …

Stm: Spatiotemporal and motion encoding for action recognition

B Jiang, MM Wang, W Gan, W Wu… - Proceedings of the …, 2019 - openaccess.thecvf.com
Spatiotemporal and motion features are two complementary and crucial information for
video action recognition. Recent state-of-the-art methods adopt a 3D CNN stream to learn …

Human action recognition in video using DB-LSTM and ResNet

A Mihanpour, MJ Rashti, SE Alavi - 2020 6th International …, 2020 - ieeexplore.ieee.org
Human action recognition in video is one of the most widely applied topics in the field of
image and video processing, with many applications in surveillance (security, sports, etc.) …

[HTML][HTML] Multi-head attention-based two-stream EfficientNet for action recognition

A Zhou, Y Ma, W Ji, M Zong, P Yang, M Wu, M Liu - Multimedia Systems, 2023 - Springer
Recent years have witnessed the popularity of using two-stream convolutional neural
networks for action recognition. However, existing two-stream convolutional neural network …

Joint spatial-temporal attention for action recognition

T Yu, C Guo, L Wang, H Gu, S Xiang, C Pan - Pattern Recognition Letters, 2018 - Elsevier
In this paper, we propose a novel high-level action representation using joint spatial-
temporal attention model, with application to video-based human action recognition …

A discriminative deep model with feature fusion and temporal attention for human action recognition

J Yu, H Gao, W Yang, Y Jiang, W Chin, N Kubota… - IEEE …, 2020 - ieeexplore.ieee.org
Activity recognition which aims to accurately distinguish human actions in complex
environments plays a key role in human-robot/computer interaction. However, long-lasting …

An improved two-stream 3D convolutional neural network for human action recognition

J Chen, Y Xu, C Zhang, Z Xu, X Meng… - … on automation and …, 2019 - ieeexplore.ieee.org
In order to obtain global contextual information precisely from videos with heavy camera
motions and scene changes, this study proposes an improved spatiotemporal two-stream …

Semantic cues enhanced multimodality multistream CNN for action recognition

Z Tu, W Xie, J Dauwels, B Li… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
This paper addresses the issue of video-based action recognition by exploiting an advanced
multistream convolutional neural network (CNN) to fully use semantics-derived multiple …

[PDF][PDF] Human action recognition in videos using convolution long short-term memory network with spatio-temporal networks

A Sarabu, AK Santra - Emerging Science Journal, 2021 - research.vit.ac.in
Two-stream convolutional networks plays an essential role as a powerful feature extractor in
human action recognition in videos. Recent studies have shown the importance of two …

Sequential segment networks for action recognition

QQ Chen, YJ Zhang - IEEE Signal Processing Letters, 2017 - ieeexplore.ieee.org
Recently, deep convolutional networks (ConvNets) have achieved remarkable progress for
action recognition in videos. Most existing deep frameworks treat a video as an unordered …