Open world object detection: A survey

Y Li, Y Wang, W Wang, D Lin, B Li… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Exploring new knowledge is a fundamental human ability that can be mirrored in the
development of deep neural networks, especially in the field of object detection. Open world …

Single-stage zero-shot object detection network based on CLIP and pseudo-labeling

J Li, S Sun, K Zhang, J Zhang, L Zhuo - International Journal of Machine …, 2024 - Springer
The detection of unknown objects is a challenging task in computer vision because,
although there are diverse real-world detection object categories, existing object-detection …

Exploring Orthogonality in Open World Object Detection

Z Sun, J Li, Y Mu - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Open world object detection aims to identify objects of unseen categories and incrementally
recognize them once their annotations are provided. In distinction to the traditional paradigm …

DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection

JS Lim, Z Chen, M Baktashmotlagh, Z Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Class-agnostic object detection (OD) can be a cornerstone or a bottleneck for many
downstream vision tasks. Despite considerable advancements in bottom-up and multi-object …

Usd: Unknown sensitive detector empowered by decoupled objectness and segment anything model

Y He, W Chen, Y Tan, S Wang - arXiv preprint arXiv:2306.02275, 2023 - arxiv.org
Open World Object Detection (OWOD) is a novel and challenging computer vision task that
enables object detection with the ability to detect unknown objects. Existing methods …

QDETRv: Query-Guided DETR for One-Shot Object Localization in Videos

Y Kumar, S Mallick, A Mishra, S Rasipuram… - Proceedings of the …, 2024 - ojs.aaai.org
In this work, we study one-shot video object localization problem that aims to localize
instances of unseen objects in the target video using a single query image of the object …

SM4Depth: Seamless Monocular Metric Depth Estimation across Multiple Cameras and Scenes by One Model

Y Liu, F Xue, A Ming, M Zhao, H Ma… - Proceedings of the 32nd …, 2024 - dl.acm.org
In the last year, universal monocular metric depth estimation (universal MMDE) has gained
considerable attention, serving as the foundation model for various multimedia tasks, such …

Systematic Evaluation of Uncertainty Calibration in Pretrained Object Detectors

D Huseljic, M Herde, P Hahn, M Müjde… - International Journal of …, 2024 - Springer
In the field of deep learning based computer vision, the development of deep object
detection has led to unique paradigms (eg, two-stage or set-based) and architectures (eg …

Uni-YOLO: Vision-Language Model-Guided YOLO for Robust and Fast Universal Detection in the Open World

X Wang, W Ren, X Chen, H Fan, Y Tang… - Proceedings of the 32nd …, 2024 - dl.acm.org
Universal object detectors aim to detect any object in any scene without human annotation,
exhibiting superior generalization. However, the current universal object detectors show …

Sniffing Threatening Open-World Objects in Autonomous Driving by Open-Vocabulary Models

Y He, S Wang, W Chen, T Xun, Y Tan - Proceedings of the 32nd ACM …, 2024 - dl.acm.org
Autonomous driving (AD) is a typical application that requires effectively exploiting
multimedia information. For AD, it is critical to ensure safety by detecting unknown objects in …