Enable deep learning on mobile devices: Methods, systems, and applications

H Cai, J Lin, Y Lin, Z Liu, H Tang, H Wang… - ACM Transactions on …, 2022 - dl.acm.org
Deep neural networks (DNNs) have achieved unprecedented success in the field of artificial
intelligence (AI), including computer vision, natural language processing, and speech …

Adaptive focus for efficient video recognition

Y Wang, Z Chen, H Jiang, S Song… - proceedings of the …, 2021 - openaccess.thecvf.com
In this paper, we explore the spatial redundancy in video recognition with the aim to improve
the computational efficiency. It is observed that the most informative region in each frame of …

Stcrowd: A multimodal dataset for pedestrian perception in crowded scenes

P Cong, X Zhu, F Qiao, Y Ren, X Peng… - Proceedings of the …, 2022 - openaccess.thecvf.com
Accurately detecting and tracking pedestrians in 3D space is challenging due to large
variations in rotations, poses and scales. The situation becomes even worse for dense …

Shuffle-invariant network for action recognition in videos

Q Shi, HB Zhang, Z Li, JX Du, Q Lei, JH Liu - ACM Transactions on …, 2022 - dl.acm.org
The local key features in video are important for improving the accuracy of human action
recognition. However, most end-to-end methods focus on global feature learning from …

Attention-driven appearance-motion fusion network for action recognition

S Liu, X Ma - IEEE Transactions on Multimedia, 2022 - ieeexplore.ieee.org
Recent years have witnessed the popularity of using a two-stream architecture and attention
mechanism for action recognition with videos. However, it is time-consuming to train two …

Searching for two-stream models in multivariate space for video recognition

X Gong, H Wang, MZ Shou, M Feiszli… - Proceedings of the …, 2021 - openaccess.thecvf.com
Conventional video models rely on a single stream to capture the complex spatial-temporal
features. Recent work on two-stream video models, such as SlowFast network and …

TLEE: Temporal-wise and Layer-wise Early Exiting Network for Efficient Video Recognition on Edge Devices

Q Wang, W Fang, NN Xiong - IEEE Internet of Things Journal, 2023 - ieeexplore.ieee.org
With the explosive growth in video streaming comes a rising demand for efficient and
scalable video understanding. State-of-the-art video recognition approaches based on …

[HTML][HTML] 3D network with channel excitation and knowledge distillation for action recognition

Z Hu, J Mao, J Yao, S Bi - Frontiers in Neurorobotics, 2023 - frontiersin.org
Modern action recognition techniques frequently employ two networks: the spatial stream,
which accepts input from RGB frames, and the temporal stream, which accepts input from …

Fruity: A Multi-modal Dataset for Fruit Recognition and 6D-Pose Estimation in Precision Agriculture

M Abdulsalam, Z Chekakta, N Aouf… - … Conference on Control …, 2023 - ieeexplore.ieee.org
The application of robotic platforms for precision agriculture is gaining traction in modern
research. However, the demand for a complete fruit dataset is still not satisfied. In this paper …

Artificial Intelligence based Robotic Platforms for Autonomous Precision Agriculture

M Abdulsalam - 2023 - openaccess.city.ac.uk
Robotic applications are continuously expanding into every aspect of human livelihood, it
becomes paramount to leverage this trend for precision agriculture. The agricultural sector …