Action recognition in compressed domains: A survey

Y Ming, J Zhou, N Hu, F Feng, P Zhao, B Lyu, H Yu - Neurocomputing, 2024 - Elsevier
Human action recognition (HAR) refers to the process in which computers analyze and
process video data to obtain the categories of action presented in the video. It has a wide …

Dynamic spatial focus for efficient compressed video action recognition

Z Zheng, L Yang, Y Wang, M Zhang… - … on Circuits and …, 2023 - ieeexplore.ieee.org
Recent years have witnessed a growing interest in compressed video action recognition due
to the rapid growth of online videos. It remarkably reduces the storage by replacing raw …

Motion adaptive pose estimation from compressed videos

Z Fan, J Liu, Y Wang - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Human pose estimation from videos has many real-world applications. Existing methods
focus on applying models with a uniform computation profile on fully de-coded frames …

Representation learning for compressed video action recognition via attentive cross-modal interaction with motion enhancement

B Li, J Chen, D Zhang, X Bao, D Huang - arXiv preprint arXiv:2205.03569, 2022 - arxiv.org
Compressed video action recognition has recently drawn growing attention, since it
remarkably reduces the storage and computational cost via replacing raw videos by …

Efficient semantic segmentation by altering resolutions for compressed videos

Y Hu, Y He, Y Li, J Li, Y Han… - Proceedings of the …, 2023 - openaccess.thecvf.com
Video semantic segmentation (VSS) is a computationally expensive task due to the per-
frame prediction for videos of high frame rates. In recent work, compact models or adaptive …

Compressed video action recognition with dual-stream and dual-modal transformer

Y Mou, X Jiang, K Xu, T Sun… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Compressed video action recognition offers the advantage of reducing decoding and
inference time compared to the RGB domain. However, the compressed domain poses …

SOR-TC: Self-attentive octave ResNet with temporal consistency for compressed video action recognition

J Zhang, X Wang, Y Wan, L Wang, J Wang, SY Philip - Neurocomputing, 2023 - Elsevier
Modeling and recognizing video activities from videos are key parts of many promising
applications such as visual surveillance, human–computer interaction, and video …

Compressed video prompt tuning

B Li, J Chen, X Bao, D Huang - Advances in Neural …, 2023 - proceedings.neurips.cc
Compressed videos offer a compelling alternative to raw videos, showing the possibility to
significantly reduce the on-line computational and storage cost. However, current …

Joint feature optimization and fusion for compressed action recognition

H Li, X Jiang, B Guan, RRM Tan… - … on Image Processing, 2021 - ieeexplore.ieee.org
Recent methods including CoViAR and DMC-Net provide a new paradigm for action
recognition since they are directly targeted at compressed videos (eg, MPEG4 files). It …

Spatiotemporal attention enhanced features fusion network for action recognition

D Zhuang, M Jiang, J Kong, T Liu - International Journal of Machine …, 2021 - Springer
In recent years, action recognition has become a popular and challenging task in computer
vision. Nowadays, two-stream networks with appearance stream and motion stream can …