相关文章- 学术资源搜索

How transferable are video representations based on synthetic data?

Y Kim, S Mishra, SY Jin, R Panda… - Advances in …, 2022 - proceedings.neurips.cc

Action recognition has improved dramatically with massive-scale video datasets. Yet, these
datasets are accompanied with issues related to curation cost, privacy, ethics, bias, and …

被引用次数：20 相关文章所有 9 个版本

[PDF] arxiv.org

Synthetic-to-real domain adaptation for action recognition: A dataset and baseline performances

AV Reddy, K Shah, W Paul, R Mocharla… - … on Robotics and …, 2023 - ieeexplore.ieee.org

Human action recognition is a challenging problem, particularly when there is high
variability in factors such as subject appearance, backgrounds and viewpoint. While deep …

被引用次数：17 相关文章所有 4 个版本

[PDF] arxiv.org

Rethinking CLIP-based Video Learners in Cross-Domain Open-Vocabulary Action Recognition

KY Lin, H Ding, J Zhou, YX Peng, Z Zhao… - arXiv preprint arXiv …, 2024 - arxiv.org

Contrastive Language-Image Pretraining (CLIP) has shown remarkable open-vocabulary
abilities across various image understanding tasks. Building upon this impressive success …

被引用次数：5 相关文章所有 2 个版本

Factornet: Holistic actor, object, and scene factorization for action recognition in videos

N Nigam, T Dutta, HP Gupta - IEEE Transactions on Circuits …, 2021 - ieeexplore.ieee.org

The ability to recognize human actions in a video is challenging due to the complex nature
of video data and the subtlety of human actions. Human activities often get associated with …

被引用次数：11 相关文章

[PDF] thecvf.com

Open set action recognition via multi-label evidential learning

C Zhao, D Du, A Hoogs, C Funk - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Existing methods for open set action recognition focus on novelty detection that assumes
video clips show a single action, which is unrealistic in the real world. We propose a new …

被引用次数：15 相关文章所有 5 个版本

Spatial-temporal pyramid graph reasoning for action recognition

T Geng, F Zheng, X Hou, K Lu, GJ Qi… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Spatial-temporal relation reasoning is a significant yet challenging problem for video action
recognition. Previous works typically apply local operations like 2D or 3D CNNs to conduct …

被引用次数：11 相关文章所有 5 个版本

Weakly semantic guided action recognition

T Yu, L Wang, C Da, H Gu, S Xiang… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org

Action recognition plays a fundamental role in computer vision and video analysis.
Nevertheless, extracting effective spatial-temporal features remains a challenging task. This …

被引用次数：28 相关文章所有 2 个版本

[PDF] arxiv.org

Learning to recognize actions on objects in egocentric video with attention dictionaries

S Sudhakaran, S Escalera… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

We present EgoACO, a deep neural architecture for video action recognition that learns to
pool action-context-object descriptors from frame level features by leveraging the verb-noun …

被引用次数：17 相关文章所有 12 个版本

[PDF] neurips.cc

Why can't i dance in the mall? learning to mitigate scene bias in action recognition

J Choi, C Gao, JCE Messou… - Advances in Neural …, 2019 - proceedings.neurips.cc

Human activities often occur in specific scene contexts, eg, playing basketball on a
basketball court. Training a model using existing video datasets thus inevitably captures and …

被引用次数：176 相关文章所有 13 个版本

[PDF] thecvf.com

Efficient action recognition via dynamic knowledge propagation

H Kim, M Jain, JT Lee, S Yun… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Efficient action recognition has become crucial to extend the success of action recognition to
many real-world applications. Contrary to most existing methods, which mainly focus on …

被引用次数：24 相关文章所有 5 个版本

高级搜索

QQ 群

How transferable are video representations based on synthetic data?

Synthetic-to-real domain adaptation for action recognition: A dataset and baseline performances

Rethinking CLIP-based Video Learners in Cross-Domain Open-Vocabulary Action Recognition

Factornet: Holistic actor, object, and scene factorization for action recognition in videos

Open set action recognition via multi-label evidential learning

Spatial-temporal pyramid graph reasoning for action recognition

Weakly semantic guided action recognition

Learning to recognize actions on objects in egocentric video with attention dictionaries

Why can't i dance in the mall? learning to mitigate scene bias in action recognition

Efficient action recognition via dynamic knowledge propagation

引用