Z Ren,
Y Su,
X Liu - Advances in neural information …, 2024 - proceedings.neurips.cc
The zero-shot open-vocabulary setting poses challenges for image classification.
Fortunately, utilizing a vision-language model like CLIP, pre-trained on image-textpairs …