How can objects help action recognition?

C Zhang, A Gupta, A Zisserman - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

We introduce an object-aware decoder for improving the performance of spatio-temporal
representations on ego-centric videos. The key idea is to enhance object-awareness during …

被引用次数：9 相关文章所有 9 个版本

[HTML] springer.com

[HTML][HTML] An outlook into the future of egocentric vision

C Plizzari, G Goletto, A Furnari, S Bansal… - International Journal of …, 2024 - Springer

What will the future be? We wonder! In this survey, we explore the gap between current
research in egocentric vision and the ever-anticipated future, where wearable computing …

被引用次数：15 相关文章所有 7 个版本

[PDF] thecvf.com

Object-centric video representation for long-term action anticipation

C Zhang, C Fu, S Wang, N Agarwal… - Proceedings of the …, 2024 - openaccess.thecvf.com

This paper focuses on building object-centric representations for long-term action
anticipation in videos. Our key motivation is that objects provide important cues to recognize …

被引用次数：7 相关文章所有 5 个版本

Appearance-Agnostic Representation Learning for Compositional Action Recognition

P Huang, X Shu, R Yan, Z Tu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

The discussion of compositional generalization in action recognition, ie., Compositional
Action Recognition (CAR), has recently received increasing attention. CAR challenges …

被引用次数：2 相关文章

[PDF] thecvf.com

Bi-Causal: Group Activity Recognition via Bidirectional Causality

Y Zhang, W Liu, D Xu, Z Zhou… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Abstract Current approaches in Group Activity Recognition (GAR) predominantly emphasize
Human Relations (HRs) while often neglecting the impact of Human-Object Interactions …

[PDF] arxiv.org

Learning Domain-Invariant Temporal Dynamics for Few-Shot Action Recognition

Y Li, G Chen, B Abramowitz, S Anzellott… - arXiv preprint arXiv …, 2024 - arxiv.org

Few-shot action recognition aims at quickly adapting a pre-trained model to the novel data
with a distribution shift using only a limited number of samples. Key challenges include how …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Diving Deep into Regions: Exploiting Regional Information Transformer for Single Image Deraining

B Li, Z Zhang, H Zheng, X Xu, Y Wei, J Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

Transformer-based Single Image Deraining (SID) methods have achieved remarkable
success, primarily attributed to their robust capability in capturing long-range interactions …

Temporal Causal Mechanism Transfer for Few-shot Action Recognition

Y Li, G Chen, B Abramowitz, S Anzellotti, D Wei - openreview.net

The goal of few-shot action recognition is to recognize actions in video sequences for which
there exists only a few training samples. The challenge is to adapt a base model effectively …

高级搜索

QQ 群