A3d: Adaptive 3d networks for video action recognition

H Cai, J Lin, Y Lin, Z Liu, H Tang, H Wang… - ACM Transactions on …, 2022 - dl.acm.org

Deep neural networks (DNNs) have achieved unprecedented success in the field of artificial
intelligence (AI), including computer vision, natural language processing, and speech …

被引用次数：84 相关文章所有 6 个版本

[PDF] thecvf.com

Adaptive focus for efficient video recognition

Y Wang, Z Chen, H Jiang, S Song… - proceedings of the …, 2021 - openaccess.thecvf.com

In this paper, we explore the spatial redundancy in video recognition with the aim to improve
the computational efficiency. It is observed that the most informative region in each frame of …

被引用次数：94 相关文章所有 7 个版本

[PDF] thecvf.com

Stcrowd: A multimodal dataset for pedestrian perception in crowded scenes

P Cong, X Zhu, F Qiao, Y Ren, X Peng… - Proceedings of the …, 2022 - openaccess.thecvf.com

Accurately detecting and tracking pedestrians in 3D space is challenging due to large
variations in rotations, poses and scales. The situation becomes even worse for dense …

被引用次数：33 相关文章所有 10 个版本

[PDF] acm.org

Shuffle-invariant network for action recognition in videos

Q Shi, HB Zhang, Z Li, JX Du, Q Lei, JH Liu - ACM Transactions on …, 2022 - dl.acm.org

The local key features in video are important for improving the accuracy of human action
recognition. However, most end-to-end methods focus on global feature learning from …

被引用次数：25 相关文章

Attention-driven appearance-motion fusion network for action recognition

S Liu, X Ma - IEEE Transactions on Multimedia, 2022 - ieeexplore.ieee.org

Recent years have witnessed the popularity of using a two-stream architecture and attention
mechanism for action recognition with videos. However, it is time-consuming to train two …

被引用次数：9 相关文章所有 2 个版本

[PDF] thecvf.com

Searching for two-stream models in multivariate space for video recognition

X Gong, H Wang, MZ Shou, M Feiszli… - Proceedings of the …, 2021 - openaccess.thecvf.com

Conventional video models rely on a single stream to capture the complex spatial-temporal
features. Recent work on two-stream video models, such as SlowFast network and …

被引用次数：8 相关文章所有 8 个版本

TLEE: Temporal-wise and Layer-wise Early Exiting Network for Efficient Video Recognition on Edge Devices

Q Wang, W Fang, NN Xiong - IEEE Internet of Things Journal, 2023 - ieeexplore.ieee.org

With the explosive growth in video streaming comes a rising demand for efficient and
scalable video understanding. State-of-the-art video recognition approaches based on …

被引用次数：1 相关文章

[HTML] frontiersin.org

[HTML][HTML] 3D network with channel excitation and knowledge distillation for action recognition

Z Hu, J Mao, J Yao, S Bi - Frontiers in Neurorobotics, 2023 - frontiersin.org

Modern action recognition techniques frequently employ two networks: the spatial stream,
which accepts input from RGB frames, and the temporal stream, which accepts input from …

被引用次数：1 相关文章所有 8 个版本

[PDF] city.ac.uk

Fruity: A Multi-modal Dataset for Fruit Recognition and 6D-Pose Estimation in Precision Agriculture

M Abdulsalam, Z Chekakta, N Aouf… - … Conference on Control …, 2023 - ieeexplore.ieee.org

The application of robotic platforms for precision agriculture is gaining traction in modern
research. However, the demand for a complete fruit dataset is still not satisfied. In this paper …

被引用次数：1 相关文章

[PDF] city.ac.uk

Artificial Intelligence based Robotic Platforms for Autonomous Precision Agriculture

M Abdulsalam - 2023 - openaccess.city.ac.uk

Robotic applications are continuously expanding into every aspect of human livelihood, it
becomes paramount to leverage this trend for precision agriculture. The agricultural sector …

高级搜索

QQ 群