alt text automatic image captioning- 学术资源搜索

Automatic alt-text: Computer-generated image descriptions for blind users on a social network service

S Wu, J Wieland, O Farivar, J Schiller - … of the 2017 ACM conference on …, 2017 - dl.acm.org

… The cost of system failure in our use case is much higher than in existing auto-captioning
systems – eg, blind users could be misled to make inappropriate comments about photos in …

被引用次数：279 相关文章

[PDF] thecvf.com

Smallcap: lightweight image captioning prompted with retrieval augmentation

R Ramos, B Martins, D Elliott… - Proceedings of the …, 2023 - openaccess.thecvf.com

… SMALLCAP, an image captioning model, prompted with captions retrieved from an external
datastore of text, based on the input image. This formulation of image captioning enables a …

被引用次数：48 相关文章所有 5 个版本

[PDF] thecvf.com

Sieve: Multimodal dataset pruning using image captioning models

A Mahmoud, M Elhoushi, A Abbas… - Proceedings of the …, 2024 - openaccess.thecvf.com

… text pairs. To bridge the gap between the limited diversity of generated captions and the
high diversity of alternative text (alt-text… model pretrained on unlabeled text corpus. Using …

被引用次数：7 相关文章所有 4 个版本

[PDF] arxiv.org

AutoCaption: Image captioning with neural architecture search

X Zhu, W Wang, L Guo, J Liu - arXiv preprint arXiv:2012.09742, 2020 - arxiv.org

… Our goal here is to optimize the text generation module of the image captioning model. We
used the RNN architecture as our text generation module and our goal here is to optimize the …

被引用次数：16 相关文章所有 2 个版本

[PDF] jair.org Full View

Image captioning as an assistive technology: Lessons learned from vizwiz 2020 challenge

P Dognin, I Melnyk, Y Mroueh, I Padhi, M Rigotti… - Journal of Artificial …, 2022 - jair.org

… captioning competition. Our work provides a step towards improved assistive image captioning
… When alternative text is either missing or provides non-meaningful content, there is a …

被引用次数：44 相关文章所有 11 个版本

[PDF] aaai.org

Toward scalable social alt text: Conversational crowdsourcing as a tool for refining vision-to-language technology for the blind

E Salisbury, E Kamar, M Morris - … of the AAAI Conference on Human …, 2017 - ojs.aaai.org

… AI image captioning systems … as image labeling with objective ground truth answers,
and it is unclear how human input can be combined with automated image captioning for alt-text …

被引用次数：89 相关文章所有 9 个版本

[PDF] arxiv.org

Computer vision and conflicting values: Describing people with automated alt text

M Hanley, S Barocas, K Levy, S Azenkot… - Proceedings of the …, 2021 - dl.acm.org

… Microsoft, which offers image captioning services to “improve accessibility” through Azure
AI, appears to include gender in some instances and not others [44]. As of April 2021, Amazon …

被引用次数：30 相关文章所有 5 个版本

Attentive linear transformation for image captioning

S Ye, J Han, N Liu - IEEE Transactions on Image Processing, 2018 - ieeexplore.ieee.org

… attentive linear transformation (ALT) for automatic generation of image captions. Instead of
… , ALT learns to attend to the high-dimensional transformation matrix from the image feature …

被引用次数：78 相关文章所有 4 个版本

ImageExplorer: Multi-layered touch exploration to encourage skepticism towards imperfect AI-generated image captions

J Lee, J Herskovitz, YH Peng, A Guo - … of the 2022 CHI Conference on …, 2022 - dl.acm.org

… ’s Automatic AltText system originally aimed to generate image tags that describe the prominent
objects in an image [… -language image description, along with providing tags grouped by …

被引用次数：30 相关文章

A text-guided generation and refinement model for image captioning

D Wang, Z Hu, Y Zhou, R Hong… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

… Meanwhile, we observe that captions generated by previous image captioning works [6], [14]
are logical and fluent in language, but they often produce some inaccurate details. …