W Hu, Y Xu, Y Li, W Li, Z Chen, Z Tu - arXiv e-prints, 2023 - ui.adsabs.harvard.edu
Abstract Vision Language Models (VLMs), which extend Large Language Models (LLM) by
incorporating visual understanding capability, have demonstrated significant advancements …