T Yu, Y Yao, H Zhang, T He, Y Han, G Cui… - arXiv e …, 2023 - ui.adsabs.harvard.edu
Abstract Multimodal Large Language Models (MLLMs) have recently demonstrated
impressive capabilities in multimodal understanding, reasoning, and interaction. However …