B Zhao, LP Dirac, P Varshavskaya - arXiv preprint arXiv:2409.17080, 2024 - arxiv.org
Large vision-language models (VLMs) have become state-of-the-art for many computer
vision tasks, with in-context learning (ICL) as a popular adaptation strategy for new ones. But …