How transferable are video representations based on synthetic data?

Y Kim, S Mishra, SY Jin, R Panda… - Advances in …, 2022 - proceedings.neurips.cc
Action recognition has improved dramatically with massive-scale video datasets. Yet, these
datasets are accompanied with issues related to curation cost, privacy, ethics, bias, and …

Synthetic-to-real domain adaptation for action recognition: A dataset and baseline performances

AV Reddy, K Shah, W Paul, R Mocharla… - … on Robotics and …, 2023 - ieeexplore.ieee.org
Human action recognition is a challenging problem, particularly when there is high
variability in factors such as subject appearance, backgrounds and viewpoint. While deep …

Rethinking CLIP-based Video Learners in Cross-Domain Open-Vocabulary Action Recognition

KY Lin, H Ding, J Zhou, YX Peng, Z Zhao… - arXiv preprint arXiv …, 2024 - arxiv.org
Contrastive Language-Image Pretraining (CLIP) has shown remarkable open-vocabulary
abilities across various image understanding tasks. Building upon this impressive success …

Factornet: Holistic actor, object, and scene factorization for action recognition in videos

N Nigam, T Dutta, HP Gupta - IEEE Transactions on Circuits …, 2021 - ieeexplore.ieee.org
The ability to recognize human actions in a video is challenging due to the complex nature
of video data and the subtlety of human actions. Human activities often get associated with …

Open set action recognition via multi-label evidential learning

C Zhao, D Du, A Hoogs, C Funk - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Existing methods for open set action recognition focus on novelty detection that assumes
video clips show a single action, which is unrealistic in the real world. We propose a new …

Spatial-temporal pyramid graph reasoning for action recognition

T Geng, F Zheng, X Hou, K Lu, GJ Qi… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Spatial-temporal relation reasoning is a significant yet challenging problem for video action
recognition. Previous works typically apply local operations like 2D or 3D CNNs to conduct …

Weakly semantic guided action recognition

T Yu, L Wang, C Da, H Gu, S Xiang… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Action recognition plays a fundamental role in computer vision and video analysis.
Nevertheless, extracting effective spatial-temporal features remains a challenging task. This …

Learning to recognize actions on objects in egocentric video with attention dictionaries

S Sudhakaran, S Escalera… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
We present EgoACO, a deep neural architecture for video action recognition that learns to
pool action-context-object descriptors from frame level features by leveraging the verb-noun …

Why can't i dance in the mall? learning to mitigate scene bias in action recognition

J Choi, C Gao, JCE Messou… - Advances in Neural …, 2019 - proceedings.neurips.cc
Human activities often occur in specific scene contexts, eg, playing basketball on a
basketball court. Training a model using existing video datasets thus inevitably captures and …

Efficient action recognition via dynamic knowledge propagation

H Kim, M Jain, JT Lee, S Yun… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Efficient action recognition has become crucial to extend the success of action recognition to
many real-world applications. Contrary to most existing methods, which mainly focus on …