Clip-count: Towards text-guided zero-shot object counting

R Jiang, L Liu, C Chen - Proceedings of the 31st ACM International …, 2023 - dl.acm.org
Recent advances in visual-language models have shown remarkable zero-shot text-image
matching ability that is transferable to downstream tasks such as object detection and …

Training-free object counting with prompts

Z Shi, Y Sun, M Zhang - … of the IEEE/CVF Winter Conference …, 2024 - openaccess.thecvf.com
This paper tackles the problem of object counting in images. Existing approaches rely on
extensive training data with point annotations for each object, making data collection labor …

Point Segment and Count: A Generalized Framework for Object Counting

Z Huang, M Dai, Y Zhang, J Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Class-agnostic object counting aims to count all objects in an image with respect to example
boxes or class names aka few-shot and zero-shot counting. In this paper we propose a …

Open-world text-specified object counting

N Amini-Naieni, K Amini-Naieni, T Han… - arXiv preprint arXiv …, 2023 - arxiv.org
Our objective is open-world object counting in images, where the target object class is
specified by a text description. To this end, we propose CounTX, a class-agnostic, single …

Zero-shot object counting with language-vision models

J Xu, H Le, D Samaras - arXiv preprint arXiv:2309.13097, 2023 - arxiv.org
Class-agnostic object counting aims to count object instances of an arbitrary class at test
time. It is challenging but also enables many potential applications. Current methods require …

Referring Expression Counting

S Dai, J Liu, NM Cheung - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Existing counting tasks are limited to the class level which don't account for fine-grained
details within the class. In real applications it often requires in-context or referring human …

Vlcounter: Text-aware visual representation for zero-shot object counting

S Kang, WJ Moon, E Kim, JP Heo - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Zero-Shot Object Counting~(ZSOC) aims to count referred instances of arbitrary classes in a
query image without human-annotated exemplars. To deal with ZSOC, preceding studies …

DAVE-A Detect-and-Verify Paradigm for Low-Shot Counting

J Pelhan, V Zavrtanik, M Kristan - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Low-shot counters estimate the number of objects corresponding to a selected category
based on only few or no exemplars annotated in the image. The current state-of-the-art …

Semantically Enhanced Scene Captions with Physical and Weather Condition Changes

H Sakaino - Proceedings of the IEEE/CVF International …, 2023 - openaccess.thecvf.com
Abstract Vision-Language models (VLMs), ie, image-text pairs of CLIP, have boosted image-
based Deep Learning (DL). Moreover, Visual-Question-Answer (VQA) tools and open …

Counting guidance for high fidelity text-to-image synthesis

W Kang, K Galim, HI Koo - arXiv preprint arXiv:2306.17567, 2023 - arxiv.org
Recently, the quality and performance of text-to-image generation significantly advanced
due to the impressive results of diffusion models. However, text-to-image diffusion models …