Object hallucination in image captioning

A Rohrbach, LA Hendricks, K Burns, T Darrell… - arXiv preprint arXiv …, 2018 - arxiv.org
… their rate of object hallucination. We analyze how captioning model architectures and
learning objectives contribute to object hallucination, explore when hallucination is likely due to …

Let there be a clock on the beach: Reducing object hallucination in image captioning

AF Biten, L Gómez, D Karatzas - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
… the object bias in image captioning models. To reduce object hallucination in image captioning,
… to be treated as ground truth to train image captioning models. By extensive analysis, we …

Do More Details Always Introduce More Hallucinations in LVLM-based Image Captioning?

M Feng, Y Tang, Z Zhang, C Xu - arXiv preprint arXiv:2406.12663, 2024 - arxiv.org
… LVLMs often suffer from a significant challenge known as hallucination (Rohrbach et al.…
actual objects present in the input image. In this work, we mainly consider object hallucination. …

Pseudo Content Hallucination for Unpaired Image Captioning

H Ben, S Wang, M Wang, R Hong - Proceedings of the 2024 …, 2024 - dl.acm.org
… For iteration refinement, we hallucinate the generated captions with objects to construct
the pseudo-matched sentences to refine the generator, which incrementally increases the …

Thinking hallucination for video captioning

N Ullah, PP Mohanta - Proceedings of the Asian …, 2022 - openaccess.thecvf.com
… is known as a hallucination [27,38,24] in the literature. Unlike image captioning, there are
two types of hallucination in the case of video captioning: object hallucination occurs when the …

[PDF][PDF] Reducing Object Hallucination in Visual Question Answering

T Banerjee - tanushreebanerjee.github.io
… Work in this paper indicates that features based on object detection model output and
image captioning model output may be a good starting point for designing such a more …

Mitigating fine-grained hallucination by fine-tuning large vision-language models with caption rewrites

L Wang, J He, S Li, N Liu, EP Lim - International Conference on Multimedia …, 2024 - Springer
… that enables instruction-tuned LVLMs to reduce fine-grained object hallucination by fine-tuning
them on additional rewritten captions derived from curated high-quality image captions. …

Hybrid attention network for image captioning

W Jiang, Q Li, K Zhan, Y Fang, F Shen - Displays, 2022 - Elsevier
… how human captioning attention can strengthen image captioning. … Recent progress on
image captioning is driven by the development of … Object hallucination in image captioning

[HTML][HTML] Explain and improve: LRP-inference fine-tuning for image captioning models

J Sun, S Lapuschkin, W Samek, A Binder - Information Fusion, 2022 - Elsevier
… that image captioning models sometimes hallucinate words … bias image captioning models
from frequently occurring object … generating frequent object words rather than hallucinate them…

Deep learning approaches on image captioning: A review

T Ghandi, H Pourreza, H Mahyar - ACM Computing Surveys, 2023 - dl.acm.org
… We address the challenges faced in this ield by emphasizing issues such as object
hallucination, missing context, illumination conditions, contextual understanding, and referring …