Human action recognition from various data modalities: A review

Z Sun, Q Ke, H Rahmani, M Bennamoun… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Human Action Recognition (HAR) aims to understand human behavior and assign a label to
each action. It has a wide range of applications, and therefore has been attracting increasing …

A survey on video-based human action recognition: recent updates, datasets, challenges, and applications

P Pareek, A Thakkar - Artificial Intelligence Review, 2021 - Springer
Abstract Human Action Recognition (HAR) involves human activity monitoring task in
different areas of medical, education, entertainment, visual surveillance, video retrieval, as …

Memvit: Memory-augmented multiscale vision transformer for efficient long-term video recognition

CY Wu, Y Li, K Mangalam, H Fan… - Proceedings of the …, 2022 - openaccess.thecvf.com
While today's video recognition systems parse snapshots or short clips accurately, they
cannot connect the dots and reason across a longer range of time yet. Most existing video …

Tdn: Temporal difference networks for efficient action recognition

L Wang, Z Tong, B Ji, G Wu - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Temporal modeling still remains challenging for action recognition in videos. To mitigate this
issue, this paper presents a new video architecture, termed as Temporal Difference Network …

Bmn: Boundary-matching network for temporal action proposal generation

T Lin, X Liu, X Li, E Ding, S Wen - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Temporal action proposal generation is an challenging and promising task which aims to
locate temporal regions in real-world videos where action or event may occur. Current …

Self-supervised visual feature learning with deep neural networks: A survey

L Jing, Y Tian - IEEE transactions on pattern analysis and …, 2020 - ieeexplore.ieee.org
Large-scale labeled data are generally required to train deep neural networks in order to
obtain better performance in visual feature learning from images or videos for computer …

Temporal pyramid network for action recognition

C Yang, Y Xu, J Shi, B Dai… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Visual tempo characterizes the dynamics and the temporal scale of an action. Modeling
such visual tempos of different actions facilitates their recognition. Previous works often …

Word-level deep sign language recognition from video: A new large-scale dataset and methods comparison

D Li, C Rodriguez, X Yu, H Li - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Vision-based sign language recognition aims at helping the hearing-impaired people to
communicate with others. However, most existing sign language datasets are limited to a …

Tcgl: Temporal contrastive graph for self-supervised video representation learning

Y Liu, K Wang, L Liu, H Lan, L Lin - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Video self-supervised learning is a challenging task, which requires significant expressive
power from the model to leverage rich spatial-temporal knowledge and generate effective …

Deep learning for spatio-temporal data mining: A survey

S Wang, J Cao, SY Philip - IEEE transactions on knowledge …, 2020 - ieeexplore.ieee.org
With the fast development of various positioning techniques such as Global Position System
(GPS), mobile devices and remote sensing, spatio-temporal data has become increasingly …