- 学术资源搜索

Embracing Diversity: Interpretable Zero-shot classification beyond one vector per class

M Moayeri, M Rabbat, M Ibrahim… - The 2024 ACM …, 2024 - dl.acm.org

Vision-language models enable open-world classification of objects without the need for any
retraining. While this zero-shot paradigm marks a significant advance, even today's best …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

FairerCLIP: Debiasing CLIP's Zero-Shot Predictions using Functions in RKHSs

S Dehdashtian, L Wang, VN Boddeti - arXiv preprint arXiv:2403.15593, 2024 - arxiv.org

Large pre-trained vision-language models such as CLIP provide compact and general-
purpose representations of text and images that are demonstrably effective across multiple …

被引用次数：11 相关文章所有 3 个版本

[PDF] msu.edu

[PDF][PDF] Fairerclip: Debiasing zero-shot predictions of clip in rkhss

S Dehdashtian, L Wang, VN Boddeti - International Conference on …, 2024 - hal.cse.msu.edu

Large pre-trained vision-language models such as CLIP provide compact and general-
purpose representations of text and images that are demonstrably effective across multiple …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Learning to Prompt with Text Only Supervision for Vision-Language Models

MU Khattak, MF Naeem, M Naseer, L Van Gool… - arXiv preprint arXiv …, 2024 - arxiv.org

Foundational vision-language models such as CLIP are becoming a new paradigm in
vision, due to their excellent generalization abilities. However, adapting these models for …

被引用次数：10 相关文章所有 2 个版本

[PDF] arxiv.org

Invariant Test-Time Adaptation for Vision-Language Model Generalization

H Ma, Y Zhu, C Zhang, P Zhao, B Wu, LK Huang… - arXiv preprint arXiv …, 2024 - arxiv.org

Vision-language foundation models have exhibited remarkable success across a multitude
of downstream tasks due to their scalability on extensive image-text paired datasets …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

BendVLM: Test-Time Debiasing of Vision-Language Embeddings

W Gerych, H Zhang, K Hamidieh, E Pan… - arXiv preprint arXiv …, 2024 - arxiv.org

Vision-language model (VLM) embeddings have been shown to encode biases present in
their training data, such as societal biases that prescribe negative characteristics to …

被引用次数：1 相关文章所有 3 个版本

[PDF] openreview.net

OTTER: Effortless Label Distribution Adaptation of Zero-shot Models

C Shin, J Zhao, S Cromp, H Vishwakarma… - The Thirty-eighth …, 2024 - openreview.net

Popular zero-shot models suffer due to artifacts inherited from pretraining. One particularly
detrimental issue, caused by unbalanced web-scale pretraining data, is mismatched label …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

高级搜索

QQ 群

Embracing Diversity: Interpretable Zero-shot classification beyond one vector per class

FairerCLIP: Debiasing CLIP's Zero-Shot Predictions using Functions in RKHSs

[PDF][PDF] Fairerclip: Debiasing zero-shot predictions of clip in rkhss

Learning to Prompt with Text Only Supervision for Vision-Language Models

Invariant Test-Time Adaptation for Vision-Language Model Generalization

BendVLM: Test-Time Debiasing of Vision-Language Embeddings

OTTER: Effortless Label Distribution Adaptation of Zero-shot Models

DPA: Dual Prototypes Alignment for Unsupervised Adaptation of Vision-Language Models

CoAPT: Context Attribute words for Prompt Tuning

OTTER: Improving Zero-Shot Classification via Optimal Transport

引用