相关文章- 学术资源搜索

Identifying the key frames: An attention-aware sampling method for action recognition

W Dong, Z Zhang, C Song, T Tan - Pattern Recognition, 2022 - Elsevier

Deep learning based methods have achieved remarkable progress in action recognition.
Existing works mainly focus on designing novel deep architectures to learn video …

被引用次数：16 相关文章所有 3 个版本

Spatial-temporal pyramid based convolutional neural network for action recognition

Z Zheng, G An, D Wu, Q Ruan - Neurocomputing, 2019 - Elsevier

Abstract Convolutional Neural Networks (CNNs) usually use top-level appearance features
of video frames for action recognition. However, these methods discard the implicit …

被引用次数：59 相关文章所有 3 个版本

[PDF] arxiv.org

Temporal segment networks for action recognition in videos

L Wang, Y Xiong, Z Wang, Y Qiao, D Lin… - IEEE transactions on …, 2018 - ieeexplore.ieee.org

We present a general and flexible video-level framework for learning action models in
videos. This method, called temporal segment network (TSN), aims to model long-range …

被引用次数：854 相关文章所有 13 个版本

Recurrent attention network using spatial-temporal relations for action recognition

M Zhang, Y Yang, Y Ji, N Xie, F Shen - Signal Processing, 2018 - Elsevier

Action recognition in videos, which contains many complex and semantic contents, is still a
challenging task in computer vision research. In this paper, we propose a novel attention …

被引用次数：29 相关文章所有 2 个版本

[PDF] thecvf.com

Tea: Temporal excitation and aggregation for action recognition

Y Li, B Ji, X Shi, J Zhang, B Kang… - Proceedings of the …, 2020 - openaccess.thecvf.com

Temporal modeling is key for action recognition in videos. It normally considers both short-
range motions and long-range aggregations. In this paper, we propose a Temporal …

被引用次数：531 相关文章所有 11 个版本

[PDF] thecvf.com

Stm: Spatiotemporal and motion encoding for action recognition

B Jiang, MM Wang, W Gan, W Wu… - Proceedings of the …, 2019 - openaccess.thecvf.com

Spatiotemporal and motion features are two complementary and crucial information for
video action recognition. Recent state-of-the-art methods adopt a 3D CNN stream to learn …

被引用次数：482 相关文章所有 6 个版本

[PDF] arxiv.org

Sympathy for the details: Dense trajectories and hybrid classification architectures for action recognition

CR De Souza, A Gaidon, E Vig, AM López - Computer Vision–ECCV 2016 …, 2016 - Springer

Action recognition in videos is a challenging task due to the complexity of the spatio-
temporal patterns to model and the difficulty to acquire and learn on large quantities of video …

被引用次数：56 相关文章所有 11 个版本

Recurrent spatial-temporal attention network for action recognition in videos

W Du, Y Wang, Y Qiao - IEEE Transactions on Image …, 2017 - ieeexplore.ieee.org

Recent years have witnessed the popularity of using recurrent neural network (RNN) for
action recognition in videos. However, videos are of high dimensionality and contain rich …

被引用次数：215 相关文章所有 4 个版本

[PDF] aaai.org

Attention-aware sampling via deep reinforcement learning for action recognition

W Dong, Z Zhang, T Tan - Proceedings of the AAAI Conference on Artificial …, 2019 - aaai.org

Deep learning based methods have achieved remarkable progress in action recognition.
Existing works mainly focus on designing novel deep architectures to achieve video …

被引用次数：46 相关文章所有 6 个版本

[PDF] arxiv.org

Dynamic sampling networks for efficient action recognition in videos

YD Zheng, Z Liu, T Lu, L Wang - IEEE transactions on image …, 2020 - ieeexplore.ieee.org

The existing action recognition methods are mainly based on clip-level classifiers such as
two-stream CNNs or 3D CNNs, which are trained from the randomly selected clips and …

被引用次数：83 相关文章所有 3 个版本

高级搜索

QQ 群

Identifying the key frames: An attention-aware sampling method for action recognition

Spatial-temporal pyramid based convolutional neural network for action recognition

Temporal segment networks for action recognition in videos

Recurrent attention network using spatial-temporal relations for action recognition

Tea: Temporal excitation and aggregation for action recognition

Stm: Spatiotemporal and motion encoding for action recognition

Sympathy for the details: Dense trajectories and hybrid classification architectures for action recognition

Recurrent spatial-temporal attention network for action recognition in videos

Attention-aware sampling via deep reinforcement learning for action recognition

Dynamic sampling networks for efficient action recognition in videos

引用