A review of video action recognition based on 3D convolution

X Huang, Z Cai - Computers and Electrical Engineering, 2023 - Elsevier
Video action recognition is one of the topics for video understanding. Over the past decade,
video action recognition has made great progress due to the emergence of deep learning …

Cee-net: complementary end-to-end network for 3d human pose generation and estimation

H Li, CM Pun - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org
The limited number of actors and actions in existing datasets make 3D pose estimators tend
to overfit, which can be seen from the performance degradation of the algorithm on cross …

Enhancing traffic safety with parallel dense video captioning for end-to-end event analysis

M Shoman, D Wang, A Aboah… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper introduces our solution for Track 2 in AI City Challenge 2024. The task aims to
solve traffic safety description and analysis with the dataset of Woven Traffic Safety (WTS) a …

Glimpse and focus: Global and local-scale graph convolution network for skeleton-based action recognition

X Gao, S Du, Y Yang - Neural Networks, 2023 - Elsevier
In the 3D skeleton-based action recognition task, learning rich spatial and temporal motion
patterns from body joints are two foundational yet under-explored problems. In this paper …

Spatial–temporal hypergraph based on dual-stage attention network for multi-view data lightweight action recognition

Z Wu, N Ma, C Wang, C Xu, G Xu, M Li - Pattern Recognition, 2024 - Elsevier
For the problems of irrelevant frames and high model complexity in action recognition, we
propose a Spatial–Temporal Hypergraph based on Dual-Stage Attention Network (STHG …

Boosting Rare Scenario Perception in Autonomous Driving: An Adaptive Approach With MoEs and LoRA

Y Li, Y Lin, L Zhong, R Yin, Y Ji… - IEEE Internet of …, 2024 - ieeexplore.ieee.org
Autonomous driving technology has achieved remarkable advancements, offering
substantial potential to revolutionize traffic safety and smart mobility. However, when faced …

Learning heterogeneous spatial–temporal context for skeleton-based action recognition

X Gao, Y Yang, Y Wu, S Du - IEEE Transactions on Neural …, 2023 - ieeexplore.ieee.org
Graph convolution networks (GCNs) have been widely used and achieved fruitful progress
in the skeleton-based action recognition task. In GCNs, node interaction modeling …

Pedestrian crossing intention prediction from surveillance videos for over-the-horizon safety warning

W Zhou, Y Liu, L Zhao, S Xu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Pedestrian crossing intention prediction could effectively prevent traffic injuries and improve
pedestrian safety. This paper focuses on pedestrian crossing intention prediction from …

FineSports: A Multi-person Hierarchical Sports Video Dataset for Fine-grained Action Understanding

J Xu, G Zhao, S Yin, W Zhou… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Fine-grained action analysis in multi-person sports is complex due to athletes' quick
movements and intense physical confrontations which result in severe visual obstructions in …

Blockchain-based Vision Transformer Network for Safe Driving Perception

N Sengar, I Kumari, J Lee, D Har - IEEE Sensors Journal, 2024 - ieeexplore.ieee.org
Driver distraction is a principal cause of traffic accidents. In a study conducted by the
National Highway Traffic Safety Administration, engaging in activities such as interacting …