X Mai, Z Tao, J Lin, H Wang, Y Chang, Y Kang… - arXiv preprint arXiv …, 2024 - arxiv.org
Multimodal Large Models (MLMs) are becoming a significant research focus, combining powerful large language models with multimodal learning to perform complex tasks across …