YF Zhang, W Yu, Q Wen, X Wang, Z Zhang… - arXiv e …, 2024 - ui.adsabs.harvard.edu
In the realms of computer vision and natural language processing, Large Vision-Language
Models (LVLMs) have become indispensable tools, proficient in generating textual …