Transform Domain Learning for Image Recognition

D Tan, J Zhao, S Li - IEEE Access, 2024 - ieeexplore.ieee.org
Image and video classification are distinct tasks in computer vision. Three-dimensional
convolutional neural networks (3D CNNs) are commonly employed for video classification …

A Compression and Recognition Joint Model for Structured Video Surveillance Storage

D Du, C Zhang, Y Wang, X Kuang, Y Yang… - … on Frontiers of …, 2021 - dl.acm.org
Structured data storage of surveillance video helps to reduce the time for information
retrieval. However, modern surveillance systems have to perform the compressing and …

Video mobile-former: Video recognition with efficient global spatial-temporal modeling

R Wang, Z Wu, D Chen, Y Chen, X Dai, M Liu… - arXiv preprint arXiv …, 2022 - arxiv.org
Transformer-based models have achieved top performance on major video recognition
benchmarks. Benefiting from the self-attention mechanism, these models show stronger …

Video expression recognition method based on spatiotemporal recurrent neural network and feature fusion

X Zhou - Journal of Information Processing Systems, 2021 - koreascience.kr
Automatically recognizing facial expressions in video sequences is a challenging task
because there is little direct correlation between facial features and subjective emotions in …

Manifolds-Based Low-Rank Dictionary Pair Learning for Efficient Set-Based Video Recognition

X Gao, K Wei, J Li, Z Shi, H Zhao, S Niu - Applied Sciences, 2023 - mdpi.com
As an important research direction in image and video processing, set-based video
recognition requires speed and accuracy. However, the existing static modeling methods …

Max-margin adaptive model for complex video pattern recognition

L Yu, J Shao, XS Xu, HT Shen - Multimedia Tools and Applications, 2015 - Springer
Patternrecognitionmodels are usually used in a variety of applications ranging from video
concept annotation to event detection. In this paper we propose a new framework called the …

Teinet: Towards an efficient architecture for video recognition

Z Liu, D Luo, Y Wang, L Wang, Y Tai, C Wang… - Proceedings of the …, 2020 - ojs.aaai.org
Efficiency is an important issue in designing video architectures for action recognition. 3D
CNNs have witnessed remarkable progress in action recognition from videos. However …

Motion recognition algorithm in VR video based on dual feature fusion and adaptive promotion

K Han - Ieee Access, 2020 - ieeexplore.ieee.org
VR video recognition in complex environment, a motion recognition algorithm based on two-
feature fusion and adaptive enhancement is proposed to solve the problems of inaccurate …

PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition

Y Hao, D Zhou, Z Wang, CW Ngo, X He, M Wang - 2023 - researchsquare.com
In recent years, vision Transformers and MLPs have demonstrated remarkable performance
in image understanding tasks. However, their inherently dense computational operators …

[PDF][PDF] Expanding Language-Image Pretrained Models for General Video Recognition——–Supplementary Material——–

B Ni, H Peng, M Chen, S Zhang, G Meng, J Fu, S Xiang… - ecva.net
This supplementary material contains additional details of the main manuscript, and
provides more experiment analysis. In Sec. 1, we present the details of our proposed …