S Lee, J Youn, M Kim, SH Yoon - arXiv preprint arXiv:2310.18341, 2023 - arxiv.org
Purpose: Recent advancements in large language models (LLMs) have expanded their
capabilities in a multimodal fashion, potentially replicating the image interpretation of human …