Unihcp: A unified model for human-centric perceptions

Y Ci, Y Wang, M Chen, S Tang, L Bai… - Proceedings of the …, 2023 - openaccess.thecvf.com
Human-centric perceptions (eg, pose estimation, human parsing, pedestrian detection,
person re-identification, etc.) play a key role in industrial applications of visual models. While …

Scale-aware fast R-CNN for pedestrian detection

J Li, X Liang, SM Shen, T Xu, J Feng… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
In this paper, we consider the problem of pedestrian detection in natural scenes. Intuitively,
instances of pedestrians with different spatial scales may exhibit dramatically different …

Online multi-object tracking using CNN-based single object tracker with spatial-temporal attention mechanism

Q Chu, W Ouyang, H Li, X Wang… - Proceedings of the …, 2017 - openaccess.thecvf.com
In this paper, we propose a CNN-based framework for online MOT. This framework utilizes
the merits of single object trackers in adapting appearance models and searching for target …

Optical flow guided feature: A fast and robust motion representation for video action recognition

S Sun, Z Kuang, L Sheng… - Proceedings of the …, 2018 - openaccess.thecvf.com
Motion representation plays a vital role in human action recognition in videos. In this study,
we introduce a novel compact motion representation for video action recognition, named …

Video scene analysis: an overview and challenges on deep learning algorithms

Q Abbas, MEA Ibrahim, MA Jaffar - Multimedia Tools and Applications, 2018 - Springer
Video scene analysis is a recent research topic due to its vital importance in many
applications such as real-time vehicle activity tracking, pedestrian detection, surveillance …

Stct: Sequentially training convolutional networks for visual tracking

L Wang, W Ouyang, X Wang… - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com
Due to the limited amount of training samples, fine-tuning pre-trained deep models online is
prone to over-fitting. In this paper, we propose a sequential training method for convolutional …

Factors in finetuning deep model for object detection with long-tail distribution

W Ouyang, X Wang, C Zhang… - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com
Finetuning from a pretrained deep model is found to yield state-of-the-art performance for
many vision tasks. This paper investigates many factors that influence the performance in …

[HTML][HTML] 基于岩石图像深度学习的岩性自动识别与分类方法

张野, 李明超, 韩帅 - 岩石学报, 2018 - html.rhhz.net
岩石岩性的识别与分类对于地质分析极为重要, 采用机器学习的方法建立识别模型进行自动分类
是一条新的途径. 基于Inception-v3 深度卷积神经网络模型, 建立了岩石图像集分析的深度学习 …

Jointly learning deep features, deformable parts, occlusion and classification for pedestrian detection

W Ouyang, H Zhou, H Li, Q Li, J Yan… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
Feature extraction, deformation handling, occlusion handling, and classification are four
important components in pedestrian detection. Existing methods learn or design these …

Spatio-contextual deep network-based multimodal pedestrian detection for autonomous driving

K Dasgupta, A Das, S Das… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Pedestrian Detection is the most critical module of an Autonomous Driving system. Although
a camera is commonly used for this purpose, its quality degrades severely in low-light night …