Detclipv2: Scalable open-vocabulary object detection pre-training via word-region alignment

L Yao, J Han, X Liang, D Xu… - Proceedings of the …, 2023 - openaccess.thecvf.com
This paper presents DetCLIPv2, an efficient and scalable training framework that
incorporates large-scale image-text pairs to achieve open-vocabulary object detection …

Rethinking the value of network pruning

Z Liu, M Sun, T Zhou, G Huang, T Darrell - arXiv preprint arXiv:1810.05270, 2018 - arxiv.org
Network pruning is widely used for reducing the heavy inference cost of deep models in low-
resource settings. A typical pruning algorithm is a three-stage pipeline, ie, training (a large …

Graph r-cnn for scene graph generation

J Yang, J Lu, S Lee, D Batra… - Proceedings of the …, 2018 - openaccess.thecvf.com
We propose a novel scene graph generation model called Graph R-CNN, that is both
effective and efficient at detecting objects and their relations in images. Our model contains …

Neural baby talk

J Lu, J Yang, D Batra, D Parikh - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
We introduce a novel framework for image captioning that can produce natural language
explicitly grounded in entities that object detectors find in the image. Our approach …

Unbiased mean teacher for cross-domain object detection

J Deng, W Li, Y Chen, L Duan - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Cross-domain object detection is challenging, because object detection model is often
vulnerable to data variance, especially to the considerable domain shift between two …

Instance-conditional knowledge distillation for object detection

Z Kang, P Zhang, X Zhang, J Sun… - Advances in Neural …, 2021 - proceedings.neurips.cc
Abstract Knowledge distillation has shown great success in classification, however, it is still
challenging for detection. In a typical image for detection, representations from different …

Towards universal object detection by domain attention

X Wang, Z Cai, D Gao… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Despite increasing efforts on universal representations for visual recognition, few have
addressed object detection. In this paper, we develop an effective and efficient universal …

Differentiable learning-to-normalize via switchable normalization

P Luo, J Ren, Z Peng, R Zhang, J Li - arXiv preprint arXiv:1806.10779, 2018 - arxiv.org
We address a learning-to-normalize problem by proposing Switchable Normalization (SN),
which learns to select different normalizers for different normalization layers of a deep neural …

Probabilistic object detection: Definition and evaluation

D Hall, F Dayoub, J Skinner, H Zhang… - Proceedings of the …, 2020 - openaccess.thecvf.com
Abstract We introduce Probabilistic Object Detection, the task of detecting objects in images
and accurately quantifying the spatial and semantic uncertainties of the detections. Given …

Borrow from anywhere: Pseudo multi-modal object detection in thermal imagery

C Devaguptapu, N Akolekar… - Proceedings of the …, 2019 - openaccess.thecvf.com
Can we improve detection in the thermal domain by borrowing features from rich domains
like visual RGB? In this paper, we propose a pseudo-multimodal object detector trained on …