Stnet: Local and global spatial-temporal modeling for action recognition

D He, Z Zhou, C Gan, F Li, X Liu, Y Li, L Wang… - Proceedings of the …, 2019 - ojs.aaai.org
Despite the success of deep learning for static image understanding, it remains unclear what
are the most effective network architectures for spatial-temporal modeling in videos. In this …

Action recognition on continuous video

YL Chang, CS Chan, P Remagnino - Neural Computing and Applications, 2021 - Springer
Video action recognition has been a challenging task over the years. The challenge herein
is not only due to the complication in increasing information in videos but also the …

A review of convolutional-neural-network-based action recognition

G Yao, T Lei, J Zhong - Pattern Recognition Letters, 2019 - Elsevier
Video action recognition is widely applied in video indexing, intelligent surveillance,
multimedia understanding, and other fields. Recently, it was greatly improved by …

Rethinking spatiotemporal feature learning: Speed-accuracy trade-offs in video classification

S Xie, C Sun, J Huang, Z Tu… - Proceedings of the …, 2018 - openaccess.thecvf.com
Despite the steady progress in video analysis led by the adoption of convolutional neural
networks (CNNs), the relative improvement has been less drastic as that in 2D static image …

Temporal–spatial mapping for action recognition

X Song, C Lan, W Zeng, J Xing, X Sun… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Deep learning models have enjoyed great success for image related computer vision tasks
such as image classification and object detection. For video related tasks such as human …

Action recognition with 3d convnet-gru architecture

G Yao, X Liu, T Lei - Proceedings of the 3rd international conference on …, 2018 - dl.acm.org
Video action recognition is widely applied in video indexing, intelligent surveil-lance,
multimedia understanding, and other fields. Recently, it was greatly improved by …

Appearance-and-dynamic learning with bifurcated convolution neural network for action recognition

J Zhang, H Hu, Z Liu - … Transactions on Circuits and Systems for …, 2020 - ieeexplore.ieee.org
For a long time, learning spatiotemporal features with deep neural networks has been a
difficult task in the field of computer vision. In this paper, we present a novel deep …

Two-stream 3-d convnet fusion for action recognition in videos with arbitrary size and length

X Wang, L Gao, P Wang, X Sun… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
3-D convolutional neural networks (3-D-convNets) have been very recently proposed for
action recognition in videos, and promising results are achieved. However, existing 3-D …

Sequential segment networks for action recognition

QQ Chen, YJ Zhang - IEEE Signal Processing Letters, 2017 - ieeexplore.ieee.org
Recently, deep convolutional networks (ConvNets) have achieved remarkable progress for
action recognition in videos. Most existing deep frameworks treat a video as an unordered …

STA-CNN: Convolutional spatial-temporal attention learning for action recognition

H Yang, C Yuan, L Zhang, Y Sun, W Hu… - … on Image Processing, 2020 - ieeexplore.ieee.org
Convolutional Neural Networks have achieved excellent successes for object recognition in
still images. However, the improvement of Convolutional Neural Networks over the …