Clip-count: Towards text-guided zero-shot object counting

R Jiang, L Liu, C Chen - Proceedings of the 31st ACM International …, 2023 - dl.acm.org
Recent advances in visual-language models have shown remarkable zero-shot text-image
matching ability that is transferable to downstream tasks such as object detection and …

Lmc: Large model collaboration with cross-assessment for training-free open-set object recognition

H Qu, X Hui, Y Cai, J Liu - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Open-set object recognition aims to identify if an object is from a class that has been
encountered during training or not. To perform open-set object recognition accurately, a key …

CLIP-Driven Open-Vocabulary 3D Scene Graph Generation via Cross-Modality Contrastive Learning

L Chen, X Wang, J Lu, S Lin… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract 3D Scene Graph Generation (3DSGG) aims to classify objects and their predicates
within 3D point cloud scenes. However current 3DSGG methods struggle with two main …

Learning to Count without Annotations

L Knobel, T Han, YM Asano - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
While recent supervised methods for reference-based object counting continue to improve
the performance on benchmark datasets they have to rely on small datasets due to the cost …

Towards zero-shot object counting via deep spatial prior cross-modality fusion

J Chen, Q Li, M Gao, W Zhai, G Jeon, D Camacho - Information Fusion, 2024 - Elsevier
Existing counting models predominantly operate on a specific category of objects, such as
crowds and vehicles. The recent emergence of multi-modal foundational models, eg …

Efficient Crowd Counting via Dual Knowledge Distillation

R Wang, Y Hao, L Hu, X Li, M Chen… - … on Image Processing, 2023 - ieeexplore.ieee.org
Most researchers focus on designing accurate crowd counting models with heavy
parameters and computations but ignore the resource burden during the model deployment …

Unlabeled scene adaptive crowd counting via meta-ensemble learning

C Ma, J Zeng, P Shao, A Qing, Y Wang - Transportation research part C …, 2024 - Elsevier
The objective of unlabeled scene adaptive crowd counting (USACC) is to adapt the crowd
counting model to a particular scene by utilizing only a handful of unlabeled images from …

[HTML][HTML] Cross-scale vision transformer for crowd localization

S Liu, Y Lian, Z Zhang, B Xiao, TS Durrani - Journal of King Saud …, 2024 - Elsevier
Crowd localization can provide the positions of individuals and the total number of people,
which has great application value for security monitoring and public management …

Enhancing Zero-shot Counting via Language-guided Exemplar Learning

M Wang, J Zhou, Y Dai, E Buys, M Gong - arXiv preprint arXiv:2402.05394, 2024 - arxiv.org
Recently, Class-Agnostic Counting (CAC) problem has garnered increasing attention owing
to its intriguing generality and superior efficiency compared to Category-Specific Counting …

Hierarchical Kernel Interaction Network for Remote Sensing Object Counting

H Wang, X Zeng, T Zhang, J Wei, X Hou… - … on Geoscience and …, 2024 - ieeexplore.ieee.org
Different from object counting in surveillance scenes, remote sensing object counting
encounters knotty challenges due to its tiny scale and cluttered background. However …