Z Liu,
H Liang,
X Huang, W Xiong, Q Yu, L Sun… - arXiv preprint arXiv …, 2024 - arxiv.org
Recently, with the rise of web images, managing and understanding large-scale image
datasets has become increasingly important. Vision Large Language Models (VLLMs) have …