Mitigating representation bias in action recognition: Algorithms and benchmarks

H Duan, Y Zhao, K Chen, Y Xiong, D Lin - European Conference on …, 2022 - Springer
Deep learning models have achieved excellent recognition results on large-scale video
benchmarks. However, they perform poorly when applied to videos with rare scenes or …

Sympathy for the details: Dense trajectories and hybrid classification architectures for action recognition

CR De Souza, A Gaidon, E Vig, AM López - Computer Vision–ECCV 2016 …, 2016 - Springer
Action recognition in videos is a challenging task due to the complexity of the spatio-
temporal patterns to model and the difficulty to acquire and learn on large quantities of video …

Dynamic normalization and relay for video action recognition

D Cai, A Yao, Y Chen - Advances in neural information …, 2021 - proceedings.neurips.cc
Abstract Convolutional Neural Networks (CNNs) have been the dominant model for video
action recognition. Due to the huge memory and compute demand, popular action …

Multi-dataset training of transformers for robust action recognition

J Liang, E Zhang, J Zhang… - Advances in Neural …, 2022 - proceedings.neurips.cc
We study the task of robust feature representations, aiming to generalize well on multiple
datasets for action recognition. We build our method on Transformers for its efficacy …

Sample less, learn more: Efficient action recognition via frame feature restoration

H Cheng, Y Guo, L Nie, Z Cheng… - Proceedings of the 31st …, 2023 - dl.acm.org
Training an effective video action recognition model poses significant computational
challenges, particularly under limited resource budgets. Current methods primarily aim to …

[图书][B] Action Recognition in Videos: Data-efficient approaches for supervised learning of human action classification models for video

CR De Souza - 2018 - ddd.uab.cat
In this dissertation, we explore different ways to perform human action recognition in video
clips. We focus on data efficiency, proposing new approaches that alleviate the need for …

Why can't i dance in the mall? learning to mitigate scene bias in action recognition

J Choi, C Gao, JCE Messou… - Advances in Neural …, 2019 - proceedings.neurips.cc
Human activities often occur in specific scene contexts, eg, playing basketball on a
basketball court. Training a model using existing video datasets thus inevitably captures and …

Interpretable spatio-temporal attention for video action recognition

L Meng, B Zhao, B Chang, G Huang… - Proceedings of the …, 2019 - openaccess.thecvf.com
Inspired by the observation that humans are able to process videos efficiently by only paying
attention where and when it is needed, we propose an interpretable and easy plug-in spatial …

Dynamic sampling networks for efficient action recognition in videos

YD Zheng, Z Liu, T Lu, L Wang - IEEE transactions on image …, 2020 - ieeexplore.ieee.org
The existing action recognition methods are mainly based on clip-level classifiers such as
two-stream CNNs or 3D CNNs, which are trained from the randomly selected clips and …

Discriminative video representation learning using support vector classifiers

J Wang, A Cherian - IEEE transactions on pattern analysis and …, 2019 - ieeexplore.ieee.org
Most popular deep models for action recognition in videos generate independent
predictions for short clips, which are then pooled heuristically to assign an action label to the …