Human action recognition and prediction: A survey

Y Kong, Y Fu - International Journal of Computer Vision, 2022 - Springer
Derived from rapid advances in computer vision and machine learning, video analysis tasks
have been moving from inferring the present state to predicting the future state. Vision-based …

Spatio-temporal attention networks for action recognition and detection

J Li, X Liu, W Zhang, M Zhang, J Song… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Recently, 3D Convolutional Neural Network (3D CNN) models have been widely studied for
video sequences and achieved satisfying performance in action recognition and detection …

Online human action detection and anticipation in videos: A survey

X Hu, J Dai, M Li, C Peng, Y Li, S Du - Neurocomputing, 2022 - Elsevier
To meet the demand for powerful models for practical applications in real time, the focus of
research on human actions has shifted from offline detection to online and real-time …

Temporal action localization in the deep learning era: A survey

B Wang, Y Zhao, L Yang, T Long… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The temporal action localization research aims to discover action instances from untrimmed
videos, representing a fundamental step in the field of intelligent video understanding. With …

Action-centric relation transformer network for video question answering

J Zhang, J Shao, R Cao, L Gao, X Xu… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Video question answering (VideoQA) has emerged as a popular research topic in recent
years. Enormous efforts have been devoted to developing more effective fusion strategies …

Exploiting informative video segments for temporal action localization

C Sun, H Song, X Wu, Y Jia… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
We propose a novel method of exploiting informative video segments by learning segment
weights for temporal action localization in untrimmed videos. Informative video segments …

Entropy guided attention network for weakly-supervised action localization

Y Cheng, Y Sun, H Fan, T Zhuo, JH Lim… - Pattern Recognition, 2022 - Elsevier
One major challenge of Weakly-supervised Temporal Action Localization (WTAL) is to
handle diverse backgrounds in videos. To model background frames, most existing methods …

Temporal textual localization in video via adversarial bi-directional interaction networks

Z Zhang, Z Zhao, Z Zhang, Z Lin… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Given a natural language description, temporal textual localization aims to localize the most
relevant segment in an untrimmed video, which is a natural and imperative extension of …

Action coherence network for weakly-supervised temporal action localization

Y Zhai, L Wang, W Tang, Q Zhang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Weakly-supervised Temporal Action Localization (W-TAL) aims at simultaneously classifying
and locating all action instances with only video-level supervision. However, current W-TAL …

A simple teacher behavior recognition method for massive teaching videos based on teacher set

Z Gang, Z Wenjuan, H Biling, C Jie, H Hui, X Qing - Applied Intelligence, 2021 - Springer
The analysis of teacher behavior of massive teaching videos has become a surge of
research interest recently. Traditional methods rely on accurate manual analysis, which is …