Y Wang, Z Yu, J Wang, Q Heng, H Chen, W Ye… - arXiv e …, 2023 - ui.adsabs.harvard.edu
Abstract Vision-Language models (VLMs) that use contrastive language-image pre-training
have shown promising zero-shot classification performance. However, their performance on …