Crowdclip: Unsupervised crowd counting via vision-language model

R Jiang, L Liu, C Chen - Proceedings of the 31st ACM International …, 2023 - dl.acm.org

Recent advances in visual-language models have shown remarkable zero-shot text-image
matching ability that is transferable to downstream tasks such as object detection and …

被引用次数：25 相关文章所有 3 个版本

[PDF] neurips.cc

Lmc: Large model collaboration with cross-assessment for training-free open-set object recognition

H Qu, X Hui, Y Cai, J Liu - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Open-set object recognition aims to identify if an object is from a class that has been
encountered during training or not. To perform open-set object recognition accurately, a key …

被引用次数：6 相关文章所有 4 个版本

[PDF] thecvf.com

CLIP-Driven Open-Vocabulary 3D Scene Graph Generation via Cross-Modality Contrastive Learning

L Chen, X Wang, J Lu, S Lin… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract 3D Scene Graph Generation (3DSGG) aims to classify objects and their predicates
within 3D point cloud scenes. However current 3DSGG methods struggle with two main …

[PDF] thecvf.com

Learning to Count without Annotations

L Knobel, T Han, YM Asano - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com

While recent supervised methods for reference-based object counting continue to improve
the performance on benchmark datasets they have to rely on small datasets due to the cost …

Towards zero-shot object counting via deep spatial prior cross-modality fusion

J Chen, Q Li, M Gao, W Zhai, G Jeon, D Camacho - Information Fusion, 2024 - Elsevier

Existing counting models predominantly operate on a specific category of objects, such as
crowds and vehicles. The recent emergence of multi-modal foundational models, eg …

Efficient Crowd Counting via Dual Knowledge Distillation

R Wang, Y Hao, L Hu, X Li, M Chen… - … on Image Processing, 2023 - ieeexplore.ieee.org

Most researchers focus on designing accurate crowd counting models with heavy
parameters and computations but ignore the resource burden during the model deployment …

被引用次数：2 相关文章所有 4 个版本

Unlabeled scene adaptive crowd counting via meta-ensemble learning

C Ma, J Zeng, P Shao, A Qing, Y Wang - Transportation research part C …, 2024 - Elsevier

The objective of unlabeled scene adaptive crowd counting (USACC) is to adapt the crowd
counting model to a particular scene by utilizing only a handful of unlabeled images from …

[HTML][HTML] Cross-scale vision transformer for crowd localization

S Liu, Y Lian, Z Zhang, B Xiao, TS Durrani - Journal of King Saud …, 2024 - Elsevier

Crowd localization can provide the positions of individuals and the total number of people,
which has great application value for security monitoring and public management …

[HTML] arxiv.org

Enhancing Zero-shot Counting via Language-guided Exemplar Learning

M Wang, J Zhou, Y Dai, E Buys, M Gong - arXiv preprint arXiv:2402.05394, 2024 - arxiv.org

Recently, Class-Agnostic Counting (CAC) problem has garnered increasing attention owing
to its intriguing generality and superior efficiency compared to Category-Specific Counting …

Hierarchical Kernel Interaction Network for Remote Sensing Object Counting

H Wang, X Zeng, T Zhang, J Wei, X Hou… - … on Geoscience and …, 2024 - ieeexplore.ieee.org

Different from object counting in surveillance scenes, remote sensing object counting
encounters knotty challenges due to its tiny scale and cluttered background. However …

被引用次数：1 相关文章所有 2 个版本

高级搜索

QQ 群