相关文章- 学术资源搜索

Transform Domain Learning for Image Recognition

D Tan, J Zhao, S Li - IEEE Access, 2024 - ieeexplore.ieee.org

Image and video classification are distinct tasks in computer vision. Three-dimensional
convolutional neural networks (3D CNNs) are commonly employed for video classification …

A Compression and Recognition Joint Model for Structured Video Surveillance Storage

D Du, C Zhang, Y Wang, X Kuang, Y Yang… - … on Frontiers of …, 2021 - dl.acm.org

Structured data storage of surveillance video helps to reduce the time for information
retrieval. However, modern surveillance systems have to perform the compressing and …

[PDF] arxiv.org

Video mobile-former: Video recognition with efficient global spatial-temporal modeling

R Wang, Z Wu, D Chen, Y Chen, X Dai, M Liu… - arXiv preprint arXiv …, 2022 - arxiv.org

Transformer-based models have achieved top performance on major video recognition
benchmarks. Benefiting from the self-attention mechanism, these models show stronger …

被引用次数：3 相关文章所有 2 个版本

[PDF] koreascience.kr

Video expression recognition method based on spatiotemporal recurrent neural network and feature fusion

X Zhou - Journal of Information Processing Systems, 2021 - koreascience.kr

Automatically recognizing facial expressions in video sequences is a challenging task
because there is little direct correlation between facial features and subjective emotions in …

被引用次数：10 相关文章所有 7 个版本

[PDF] mdpi.com

Manifolds-Based Low-Rank Dictionary Pair Learning for Efficient Set-Based Video Recognition

X Gao, K Wei, J Li, Z Shi, H Zhao, S Niu - Applied Sciences, 2023 - mdpi.com

As an important research direction in image and video processing, set-based video
recognition requires speed and accuracy. However, the existing static modeling methods …

被引用次数：1 相关文章所有 3 个版本

[PDF] researchgate.net

Max-margin adaptive model for complex video pattern recognition

L Yu, J Shao, XS Xu, HT Shen - Multimedia Tools and Applications, 2015 - Springer

Patternrecognitionmodels are usually used in a variety of applications ranging from video
concept annotation to event detection. In this paper we propose a new framework called the …

被引用次数：3 相关文章所有 8 个版本

[PDF] aaai.org

Teinet: Towards an efficient architecture for video recognition

Z Liu, D Luo, Y Wang, L Wang, Y Tai, C Wang… - Proceedings of the …, 2020 - ojs.aaai.org

Efficiency is an important issue in designing video architectures for action recognition. 3D
CNNs have witnessed remarkable progress in action recognition from videos. However …

被引用次数：246 相关文章所有 8 个版本

[PDF] ieee.org

Motion recognition algorithm in VR video based on dual feature fusion and adaptive promotion

K Han - Ieee Access, 2020 - ieeexplore.ieee.org

VR video recognition in complex environment, a motion recognition algorithm based on two-
feature fusion and adaptive enhancement is proposed to solve the problems of inaccurate …

被引用次数：4 相关文章所有 2 个版本

[PDF] researchsquare.com

PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition

Y Hao, D Zhou, Z Wang, CW Ngo, X He, M Wang - 2023 - researchsquare.com

In recent years, vision Transformers and MLPs have demonstrated remarkable performance
in image understanding tasks. However, their inherently dense computational operators …

[PDF][PDF] Expanding Language-Image Pretrained Models for General Video Recognition——–Supplementary Material——–

B Ni, H Peng, M Chen, S Zhang, G Meng, J Fu, S Xiang… - ecva.net

This supplementary material contains additional details of the main manuscript, and
provides more experiment analysis. In Sec. 1, we present the details of our proposed …

高级搜索

QQ 群