Y Yang,
T Zhou,
K Li, D Tao, L Li… - Proceedings of the …, 2024 - openaccess.thecvf.com
While large language models (LLMs) excel in a simulated world of texts they struggle to
interact with the more realistic world without perceptions of other modalities such as visual or …