Learning to track for spatio-temporal action localization

P Weinzaepfel, Z Harchaoui… - Proceedings of the …, 2015 - openaccess.thecvf.com
We propose an effective approach for spatio-temporal action localization in realistic videos.
The approach first detects proposals at the frame-level and scores them with a combination …

Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis

QV Le, WY Zou, SY Yeung, AY Ng - CVPR 2011, 2011 - ieeexplore.ieee.org
Previous work on action recognition has focused on adapting hand-designed local features,
such as SIFT or HOG, from static images to the video domain. In this paper, we propose …

A large-scale benchmark dataset for event recognition in surveillance video

S Oh, A Hoogs, A Perera, N Cuntoor, CC Chen… - CVPR …, 2011 - ieeexplore.ieee.org
We introduce a new large-scale video dataset designed to assess the performance of
diverse visual event recognition algorithms with a focus on continuous visual event …

A survey of content-aware video analysis for sports

HC Shih - IEEE Transactions on circuits and systems for video …, 2017 - ieeexplore.ieee.org
Sports data analysis is becoming increasingly large scale, diversified, and shared, but
difficulty persists in rapidly accessing the most crucial information. Previous surveys have …

The thumos challenge on action recognition for videos “in the wild”

H Idrees, AR Zamir, YG Jiang, A Gorban… - Computer Vision and …, 2017 - Elsevier
Automatically recognizing and localizing wide ranges of human actions are crucial for video
understanding. Towards this goal, the THUMOS challenge was introduced in 2013 to serve …

Multi-region two-stream R-CNN for action detection

X Peng, C Schmid - Computer Vision–ECCV 2016: 14th European …, 2016 - Springer
We propose a multi-region two-stream R-CNN model for action detection in realistic videos.
We start from frame-level action detection based on faster R-CNN, and make three …

Action bank: A high-level representation of activity in video

S Sadanand, JJ Corso - 2012 IEEE Conference on computer …, 2012 - ieeexplore.ieee.org
Activity recognition in video is dominated by low-and mid-level features, and while
demonstrably capable, by nature, these features carry little semantic meaning. Inspired by …

Action recognition with dynamic image networks

H Bilen, B Fernando, E Gavves… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
We introduce the concept of dynamic image, a novel compact representation of videos
useful for video analysis, particularly in combination with convolutional neural networks …

Multisports: A multi-person video dataset of spatio-temporally localized sports actions

Y Li, L Chen, R He, Z Wang, G Wu… - Proceedings of the …, 2021 - openaccess.thecvf.com
Spatio-temporal action detection is an important and challenging problem in video
understanding. The existing action detection benchmarks are limited in aspects of small …

A survey of video datasets for human action and activity recognition

JM Chaquet, EJ Carmona… - Computer Vision and …, 2013 - Elsevier
Vision-based human action and activity recognition has an increasing importance among
the computer vision community with applications to visual surveillance, video retrieval and …