Diffusiondet: Diffusion model for object detection

S Chen, P Sun, Y Song, P Luo - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We propose DiffusionDet, a new framework that formulates object detection as a denoising
diffusion process from noisy boxes to object boxes. During the training stage, object boxes …

A survey on vision transformer

K Han, Y Wang, H Chen, X Chen, J Guo… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Transformer, first applied to the field of natural language processing, is a type of deep neural
network mainly based on the self-attention mechanism. Thanks to its strong representation …

Sparse r-cnn: End-to-end object detection with learnable proposals

P Sun, R Zhang, Y Jiang, T Kong… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract We present Sparse R-CNN, a purely sparse method for object detection in images.
Existing works on object detection heavily rely on dense object candidates, such as k anchor …

A survey on visual transformer

K Han, Y Wang, H Chen, X Chen, J Guo, Z Liu… - arXiv preprint arXiv …, 2020 - arxiv.org
Transformer, first applied to the field of natural language processing, is a type of deep neural
network mainly based on the self-attention mechanism. Thanks to its strong representation …

Rank-DETR for high quality object detection

Y Pu, W Liang, Y Hao, Y Yuan… - Advances in …, 2024 - proceedings.neurips.cc
Modern detection transformers (DETRs) use a set of object queries to predict a list of
bounding boxes, sort them by their classification confidence scores, and select the top …

Unihcp: A unified model for human-centric perceptions

Y Ci, Y Wang, M Chen, S Tang, L Bai… - Proceedings of the …, 2023 - openaccess.thecvf.com
Human-centric perceptions (eg, pose estimation, human parsing, pedestrian detection,
person re-identification, etc.) play a key role in industrial applications of visual models. While …

Occlusion handling and multi-scale pedestrian detection based on deep learning: A review

F Li, X Li, Q Liu, Z Li - IEEE Access, 2022 - ieeexplore.ieee.org
Pedestrian detection is an important branch of computer vision, and has important
applications in the fields of autonomous driving, artificial intelligence and video surveillance …

Progressive end-to-end object detection in crowded scenes

A Zheng, Y Zhang, X Zhang, X Qi… - Proceedings of the …, 2022 - openaccess.thecvf.com
In this paper, we propose a new query-based detection framework for crowd detection.
Previous query-based detectors suffer from two drawbacks: first, multiple predictions will be …

From handcrafted to deep features for pedestrian detection: A survey

J Cao, Y Pang, J Xie, FS Khan… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Pedestrian detection is an important but challenging problem in computer vision, especially
in human-centric tasks. Over the past decade, significant improvement has been witnessed …

Sparse r-cnn: An end-to-end framework for object detection

P Sun, R Zhang, Y Jiang, T Kong, C Xu… - IEEE transactions on …, 2023 - ieeexplore.ieee.org
Object detection serves as one of most fundamental computer vision tasks. Existing works
on object detection heavily rely on dense object candidates, such as anchor boxes pre …