Automatic alt-text: Computer-generated image descriptions for blind users on a social network service

S Wu, J Wieland, O Farivar, J Schiller - … of the 2017 ACM conference on …, 2017 - dl.acm.org
… The cost of system failure in our use case is much higher than in existing auto-captioning
systems – eg, blind users could be misled to make inappropriate comments about photos in …

Smallcap: lightweight image captioning prompted with retrieval augmentation

R Ramos, B Martins, D Elliott… - Proceedings of the …, 2023 - openaccess.thecvf.com
… SMALLCAP, an image captioning model, prompted with captions retrieved from an external
datastore of text, based on the input image. This formulation of image captioning enables a …

Sieve: Multimodal dataset pruning using image captioning models

A Mahmoud, M Elhoushi, A Abbas… - Proceedings of the …, 2024 - openaccess.thecvf.com
text pairs. To bridge the gap between the limited diversity of generated captions and the
high diversity of alternative text (alt-text… model pretrained on unlabeled text corpus. Using …

AutoCaption: Image captioning with neural architecture search

X Zhu, W Wang, L Guo, J Liu - arXiv preprint arXiv:2012.09742, 2020 - arxiv.org
… Our goal here is to optimize the text generation module of the image captioning model. We
used the RNN architecture as our text generation module and our goal here is to optimize the …

Image captioning as an assistive technology: Lessons learned from vizwiz 2020 challenge

P Dognin, I Melnyk, Y Mroueh, I Padhi, M Rigotti… - Journal of Artificial …, 2022 - jair.org
captioning competition. Our work provides a step towards improved assistive image captioning
… When alternative text is either missing or provides non-meaningful content, there is a …

Toward scalable social alt text: Conversational crowdsourcing as a tool for refining vision-to-language technology for the blind

E Salisbury, E Kamar, M Morris - … of the AAAI Conference on Human …, 2017 - ojs.aaai.org
… AI image captioning systems … as image labeling with objective ground truth answers,
and it is unclear how human input can be combined with automated image captioning for alt-text

Computer vision and conflicting values: Describing people with automated alt text

M Hanley, S Barocas, K Levy, S Azenkot… - Proceedings of the …, 2021 - dl.acm.org
… Microsoft, which offers image captioning services to “improve accessibility” through Azure
AI, appears to include gender in some instances and not others [44]. As of April 2021, Amazon …

Attentive linear transformation for image captioning

S Ye, J Han, N Liu - IEEE Transactions on Image Processing, 2018 - ieeexplore.ieee.org
… attentive linear transformation (ALT) for automatic generation of image captions. Instead of
… , ALT learns to attend to the high-dimensional transformation matrix from the image feature …

ImageExplorer: Multi-layered touch exploration to encourage skepticism towards imperfect AI-generated image captions

J Lee, J Herskovitz, YH Peng, A Guo - … of the 2022 CHI Conference on …, 2022 - dl.acm.org
… ’s Automatic AltText system originally aimed to generate image tags that describe the prominent
objects in an image [… -language image description, along with providing tags grouped by …

A text-guided generation and refinement model for image captioning

D Wang, Z Hu, Y Zhou, R Hong… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
… Meanwhile, we observe that captions generated by previous image captioning works [6], [14]
are logical and fluent in language, but they often produce some inaccurate details. …