X Zhuang, Z Zhu, Z Chen, Y Xie, L Liang… - Proceedings of the …, 2024 - aclanthology.org
Abstract Large Vision-Language Models (LVLMs) may produce outputs that are unfaithful to
reality, also known as visual hallucinations (VH), which hinders their application in …