D Go, T Whang,
C Lee, H Kim, S Park, S Ji… - arXiv preprint arXiv …, 2024 - arxiv.org
The integration of Retrieval-Augmented Generation (RAG) with Multimodal Large Language
Models (MLLMs) has expanded the scope of multimodal query resolution. However, current …