- 学术资源搜索

Human action recognition from various data modalities: A review

Z Sun, Q Ke, H Rahmani, M Bennamoun… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

Human Action Recognition (HAR) aims to understand human behavior and assign a label to
each action. It has a wide range of applications, and therefore has been attracting increasing …

被引用次数：495 相关文章所有 16 个版本

A survey on video-based human action recognition: recent updates, datasets, challenges, and applications

P Pareek, A Thakkar - Artificial Intelligence Review, 2021 - Springer

Abstract Human Action Recognition (HAR) involves human activity monitoring task in
different areas of medical, education, entertainment, visual surveillance, video retrieval, as …

被引用次数：308 相关文章所有 5 个版本

[PDF] thecvf.com

Memvit: Memory-augmented multiscale vision transformer for efficient long-term video recognition

CY Wu, Y Li, K Mangalam, H Fan… - Proceedings of the …, 2022 - openaccess.thecvf.com

While today's video recognition systems parse snapshots or short clips accurately, they
cannot connect the dots and reason across a longer range of time yet. Most existing video …

被引用次数：187 相关文章所有 5 个版本

[PDF] thecvf.com

Tdn: Temporal difference networks for efficient action recognition

L Wang, Z Tong, B Ji, G Wu - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

Temporal modeling still remains challenging for action recognition in videos. To mitigate this
issue, this paper presents a new video architecture, termed as Temporal Difference Network …

被引用次数：437 相关文章所有 8 个版本

[PDF] thecvf.com

Bmn: Boundary-matching network for temporal action proposal generation

T Lin, X Liu, X Li, E Ding, S Wen - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com

Temporal action proposal generation is an challenging and promising task which aims to
locate temporal regions in real-world videos where action or event may occur. Current …

被引用次数：715 相关文章所有 5 个版本

[PDF] nsf.gov

Self-supervised visual feature learning with deep neural networks: A survey

L Jing, Y Tian - IEEE transactions on pattern analysis and …, 2020 - ieeexplore.ieee.org

Large-scale labeled data are generally required to train deep neural networks in order to
obtain better performance in visual feature learning from images or videos for computer …

被引用次数：2021 相关文章所有 7 个版本

[PDF] thecvf.com

Temporal pyramid network for action recognition

C Yang, Y Xu, J Shi, B Dai… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com

Visual tempo characterizes the dynamics and the temporal scale of an action. Modeling
such visual tempos of different actions facilitates their recognition. Previous works often …

被引用次数：437 相关文章所有 10 个版本

[PDF] thecvf.com

Word-level deep sign language recognition from video: A new large-scale dataset and methods comparison

D Li, C Rodriguez, X Yu, H Li - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com

Vision-based sign language recognition aims at helping the hearing-impaired people to
communicate with others. However, most existing sign language datasets are limited to a …

被引用次数：517 相关文章所有 10 个版本

[PDF] arxiv.org

Tcgl: Temporal contrastive graph for self-supervised video representation learning

Y Liu, K Wang, L Liu, H Lan, L Lin - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Video self-supervised learning is a challenging task, which requires significant expressive
power from the model to leverage rich spatial-temporal knowledge and generate effective …

被引用次数：129 相关文章所有 6 个版本

[PDF] arxiv.org

Deep learning for spatio-temporal data mining: A survey

S Wang, J Cao, SY Philip - IEEE transactions on knowledge …, 2020 - ieeexplore.ieee.org

With the fast development of various positioning techniques such as Global Position System
(GPS), mobile devices and remote sensing, spatio-temporal data has become increasingly …

被引用次数：659 相关文章所有 6 个版本

高级搜索

QQ 群

Human action recognition from various data modalities: A review

A survey on video-based human action recognition: recent updates, datasets, challenges, and applications

Memvit: Memory-augmented multiscale vision transformer for efficient long-term video recognition

Tdn: Temporal difference networks for efficient action recognition

Bmn: Boundary-matching network for temporal action proposal generation

Self-supervised visual feature learning with deep neural networks: A survey

Temporal pyramid network for action recognition

Word-level deep sign language recognition from video: A new large-scale dataset and methods comparison

Tcgl: Temporal contrastive graph for self-supervised video representation learning

Deep learning for spatio-temporal data mining: A survey

引用