A survey of modern deep learning based object detection models

SSA Zaidi, MS Ansari, A Aslam, N Kanwal… - Digital Signal …, 2022 - Elsevier
Object Detection is the task of classification and localization of objects in an image or video.
It has gained prominence in recent years due to its widespread applications. This article …

[HTML][HTML] A review on deep learning in UAV remote sensing

LP Osco, JM Junior, APM Ramos… - International Journal of …, 2021 - Elsevier
Abstract Deep Neural Networks (DNNs) learn representation from data with an impressive
capability, and brought important breakthroughs for processing images, time-series, natural …

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

CY Wang, A Bochkovskiy… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Real-time object detection is one of the most important research topics in computer vision.
As new approaches regarding architecture optimization and training optimization are …

Convolutions die hard: Open-vocabulary segmentation with single frozen convolutional clip

Q Yu, J He, X Deng, X Shen… - Advances in Neural …, 2024 - proceedings.neurips.cc
Open-vocabulary segmentation is a challenging task requiring segmenting and recognizing
objects from an open set of categories in diverse environments. One way to address this …

Towards large-scale small object detection: Survey and benchmarks

G Cheng, X Yuan, X Yao, K Yan, Q Zeng… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
With the rise of deep convolutional neural networks, object detection has achieved
prominent advances in past years. However, such prosperity could not camouflage the …

Swin transformer: Hierarchical vision transformer using shifted windows

Z Liu, Y Lin, Y Cao, H Hu, Y Wei… - Proceedings of the …, 2021 - openaccess.thecvf.com
This paper presents a new vision Transformer, called Swin Transformer, that capably serves
as a general-purpose backbone for computer vision. Challenges in adapting Transformer …

Anchor detr: Query design for transformer-based detector

Y Wang, X Zhang, T Yang, J Sun - … of the AAAI conference on artificial …, 2022 - ojs.aaai.org
In this paper, we propose a novel query design for the transformer-based object detection. In
previous transformer-based detectors, the object queries are a set of learned embeddings …

Simple copy-paste is a strong data augmentation method for instance segmentation

G Ghiasi, Y Cui, A Srinivas, R Qian… - Proceedings of the …, 2021 - openaccess.thecvf.com
Building instance segmentation models that are data-efficient and can handle rare object
categories is an important challenge in computer vision. Leveraging data augmentations is a …

Scaled-yolov4: Scaling cross stage partial network

CY Wang, A Bochkovskiy… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We show that the YOLOv4 object detection neural network based on the CSP approach,
scales both up and down and is applicable to small and large networks while maintaining …

A normalized Gaussian Wasserstein distance for tiny object detection

J Wang, C Xu, W Yang, L Yu - arXiv preprint arXiv:2110.13389, 2021 - arxiv.org
Detecting tiny objects is a very challenging problem since a tiny object only contains a few
pixels in size. We demonstrate that state-of-the-art detectors do not produce satisfactory …