Zero-shot object counting

R Jiang, L Liu, C Chen - Proceedings of the 31st ACM International …, 2023 - dl.acm.org

Recent advances in visual-language models have shown remarkable zero-shot text-image
matching ability that is transferable to downstream tasks such as object detection and …

被引用次数：38 相关文章所有 3 个版本

[PDF] thecvf.com

Training-free object counting with prompts

Z Shi, Y Sun, M Zhang - … of the IEEE/CVF Winter Conference …, 2024 - openaccess.thecvf.com

This paper tackles the problem of object counting in images. Existing approaches rely on
extensive training data with point annotations for each object, making data collection labor …

被引用次数：12 相关文章所有 5 个版本

[PDF] thecvf.com

Point Segment and Count: A Generalized Framework for Object Counting

Z Huang, M Dai, Y Zhang, J Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com

Class-agnostic object counting aims to count all objects in an image with respect to example
boxes or class names aka few-shot and zero-shot counting. In this paper we propose a …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Open-world text-specified object counting

N Amini-Naieni, K Amini-Naieni, T Han… - arXiv preprint arXiv …, 2023 - arxiv.org

Our objective is open-world object counting in images, where the target object class is
specified by a text description. To this end, we propose CounTX, a class-agnostic, single …

被引用次数：13 相关文章所有 4 个版本

[PDF] arxiv.org

Zero-shot object counting with language-vision models

J Xu, H Le, D Samaras - arXiv preprint arXiv:2309.13097, 2023 - arxiv.org

Class-agnostic object counting aims to count object instances of an arbitrary class at test
time. It is challenging but also enables many potential applications. Current methods require …

被引用次数：4 相关文章所有 2 个版本

[PDF] thecvf.com

Referring Expression Counting

S Dai, J Liu, NM Cheung - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Existing counting tasks are limited to the class level which don't account for fine-grained
details within the class. In real applications it often requires in-context or referring human …

被引用次数：2 相关文章

[PDF] aaai.org

Vlcounter: Text-aware visual representation for zero-shot object counting

S Kang, WJ Moon, E Kim, JP Heo - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org

Zero-Shot Object Counting~(ZSOC) aims to count referred instances of arbitrary classes in a
query image without human-annotated exemplars. To deal with ZSOC, preceding studies …

被引用次数：7 相关文章所有 3 个版本

[PDF] thecvf.com

DAVE-A Detect-and-Verify Paradigm for Low-Shot Counting

J Pelhan, V Zavrtanik, M Kristan - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Low-shot counters estimate the number of objects corresponding to a selected category
based on only few or no exemplars annotated in the image. The current state-of-the-art …

被引用次数：4 相关文章所有 3 个版本

[PDF] thecvf.com

Semantically Enhanced Scene Captions with Physical and Weather Condition Changes

H Sakaino - Proceedings of the IEEE/CVF International …, 2023 - openaccess.thecvf.com

Abstract Vision-Language models (VLMs), ie, image-text pairs of CLIP, have boosted image-
based Deep Learning (DL). Moreover, Visual-Question-Answer (VQA) tools and open …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Counting guidance for high fidelity text-to-image synthesis

W Kang, K Galim, HI Koo - arXiv preprint arXiv:2306.17567, 2023 - arxiv.org

Recently, the quality and performance of text-to-image generation significantly advanced
due to the impressive results of diffusion models. However, text-to-image diffusion models …

被引用次数：3 相关文章所有 2 个版本

高级搜索

QQ 群

Clip-count: Towards text-guided zero-shot object counting

Training-free object counting with prompts

Point Segment and Count: A Generalized Framework for Object Counting

Open-world text-specified object counting

Zero-shot object counting with language-vision models

Referring Expression Counting

Vlcounter: Text-aware visual representation for zero-shot object counting

DAVE-A Detect-and-Verify Paradigm for Low-Shot Counting

Semantically Enhanced Scene Captions with Physical and Weather Condition Changes

Counting guidance for high fidelity text-to-image synthesis

引用