Video you only look once: Overall temporal convolutions for action recognition

L Jing, X Yang, Y Tian - Journal of Visual Communication and Image …, 2018 - Elsevier
In this paper, we propose an efficient and straightforward approach, video you only look
once (VideoYOLO), to capture the overall temporal dynamics from an entire video in a single …

Trajectory convolution for action recognition

Y Zhao, Y Xiong, D Lin - Advances in neural information …, 2018 - proceedings.neurips.cc
How to leverage the temporal dimension is a key question in video analysis. Recent works
suggest an efficient approach to video feature learning, ie, factorizing 3D convolutions into …

Spatial-temporal pyramid based convolutional neural network for action recognition

Z Zheng, G An, D Wu, Q Ruan - Neurocomputing, 2019 - Elsevier
Abstract Convolutional Neural Networks (CNNs) usually use top-level appearance features
of video frames for action recognition. However, these methods discard the implicit …

Temporal segment networks: Towards good practices for deep action recognition

L Wang, Y Xiong, Z Wang, Y Qiao, D Lin… - European conference on …, 2016 - Springer
Deep convolutional networks have achieved great success for visual recognition in still
images. However, for action recognition in videos, the advantage over traditional methods is …

Temporal 3d convnets: New architecture and transfer learning for video classification

A Diba, M Fayyaz, V Sharma, AH Karami… - arXiv preprint arXiv …, 2017 - arxiv.org
The work in this paper is driven by the question how to exploit the temporal cues available in
videos for their accurate classification, and for human action recognition in particular? Thus …

Temporal pyramid pooling-based convolutional neural network for action recognition

P Wang, Y Cao, C Shen, L Liu… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
Encouraged by the success of convolutional neural networks (CNNs) in image classification,
recently much effort is spent on applying the CNNs to the video-based action recognition …

Learning representative temporal features for action recognition

A Javidani, A Mahmoudi-Aznaveh - Multimedia Tools and Applications, 2022 - Springer
In this paper, a novel video classification method is presented that aims to recognize
different categories of third-person videos efficiently. Our motivation is to achieve a light …

T-C3D: Temporal convolutional 3D network for real-time action recognition

K Liu, W Liu, C Gan, M Tan, H Ma - … of the AAAI conference on artificial …, 2018 - ojs.aaai.org
Video-based action recognition with deep neural networks has shown remarkable progress.
However, most of the existing approaches are too computationally expensive due to the …

Long-term temporal convolutions for action recognition

G Varol, I Laptev, C Schmid - IEEE transactions on pattern …, 2017 - ieeexplore.ieee.org
Typical human actions last several seconds and exhibit characteristic spatio-temporal
structure. Recent methods attempt to capture this structure and learn action representations …

End-to-end video-level representation learning for action recognition

J Zhu, Z Zhu, W Zou - 2018 24th international conference on …, 2018 - ieeexplore.ieee.org
From the frame/clip-level feature learning to the video-level representation building, deep
learning methods in action recognition have developed rapidly in recent years. However …