X Li,
Z Wang,
C Xie - Advances in Neural Information …, 2024 - proceedings.neurips.cc
CLIP, one of the pioneering foundation models that connect images and text, has enabled
many recent breakthroughs in computer vision. However, its associated training cost is …