Denoising large-scale image captioning from alt-text data using content selection models

KR Chandu, P Sharma, S Changpinyo… - arXiv preprint arXiv …, 2020 - arxiv.org
Training large-scale image captioning (IC) models demands access to a rich and diverse set
of training examples, gathered from the wild, often from noisy alt-text data. However, recent …

Denoising Large-Scale Image Captioning from Alt-text Data Using Content Selection Models

KR Chandu, P Sharma, S Changpinyo… - Proceedings of the …, 2022 - aclanthology.org
Training large-scale image captioning (IC) models demands access to a rich and diverse set
of training examples that are expensive to curate both in terms of time and man-power …