Pensieve: Retrospect-then-compare mitigates visual hallucination

D Yang, B Cao, G Chen, C Jiang - arXiv preprint arXiv:2403.14401, 2024 - arxiv.org
Multi-modal Large Language Models (MLLMs) demonstrate remarkable success across
various vision-language tasks. However, they suffer from visual hallucination, where the …

Pensieve: Retrospect-then-Compare Mitigates Visual Hallucination

D Yang, B Cao, G Chen, C Jiang - arXiv e-prints, 2024 - ui.adsabs.harvard.edu
Abstract Multi-modal Large Language Models (MLLMs) demonstrate remarkable success
across various vision-language tasks. However, they suffer from visual hallucination, where …