alt text automatic image captioning- 学术资源搜索

Conceptual captions: A cleaned, hypernymed, image alt-text dataset for automatic image captioning

P Sharma, N Ding, S Goodman… - Proceedings of the 56th …, 2018 - aclanthology.org

… In order to assess the impact of the Conceptual Captions dataset, we consider several image
captioning models previously proposed in the literature. These models can be understood …

被引用次数：2185 相关文章所有 3 个版本

[PDF] arxiv.org

Denoising large-scale image captioning from alt-text data using content selection models

KR Chandu, P Sharma, S Changpinyo… - arXiv preprint arXiv …, 2020 - arxiv.org

… pieces of information consistent with the image as a skeleton. Sub… automatically extracted
from the alttext captions. We focus on language-based skeletons that are derived from captions …

被引用次数：2 相关文章所有 2 个版本

[PDF] springer.com

Evaluating the effectiveness of automatic image captioning for web accessibility

M Leotta, F Mori, M Ribaudo - Universal access in the information society, 2023 - Springer

… we associated a set of five textual descriptions: the first is the alternative text written by
the human contributors to the online encyclopedia Footnote 25 , while the other four are the …

被引用次数：5 相关文章所有 7 个版本

[PDF] cutrell.org

Caption crawler: Enabling reusable alternative text descriptions using reverse image search

D Guinness, E Cutrell, MR Morris - … of the 2018 chi conference on human …, 2018 - dl.acm.org

… Our system also extracts the alt text and image captions in the DOM while the user is
browsing a page using the browser extension. This allows the system to keep improving as more …

被引用次数：108 相关文章所有 6 个版本

[PDF] thecvf.com

Scaling up vision-language pre-training for image captioning

X Hu, Z Gan, J Wang, Z Yang, Z Liu… - Proceedings of the …, 2022 - openaccess.thecvf.com

… As some alttexts are too long, we split them up by punctuation marks, such as period and
exclamation mark, and select the longest part. To filter out some rare or misspelled words, we …

被引用次数：237 相关文章所有 5 个版本

[PDF] arxiv.org

Clipscore: A reference-free evaluation metric for image captioning

J Hessel, A Holtzman, M Forbes, RL Bras… - arXiv preprint arXiv …, 2021 - arxiv.org

… We measure CLIP-S’s capacity to reconstruct a set of 2.8K human judgments of alttext …
Each alt-text was rated on a scale of 0 to 3 in terms of its probable utility as an alt-text. While the …

被引用次数：717 相关文章所有 5 个版本

[PDF] acm.org

What's in an ALT Tag? Exploring Caption Content Priorities through Collaborative Captioning

A Muehlbradt, SK Kane - ACM Transactions on Accessible Computing …, 2022 - dl.acm.org

… In this study, we explore contextual differences in image captioning based on the domain, …
general feedback about the process of image captioning, image captions, and the study, we …

被引用次数：14 相关文章所有 2 个版本

[PDF] arxiv.org

Textcaps: a dataset for image captioning with reading comprehension

O Sidorov, R Hu, M Rohrbach, A Singh - … 23–28, 2020, Proceedings, Part II …, 2020 - Springer

… image captioning models how “to read”, ie, allow us to design and train image captioning
algorithms which are able to process and include information from the text in the image. In …

被引用次数：279 相关文章所有 4 个版本

[PDF] thecvf.com

Transform and tell: Entity-aware news image captioning

A Tran, A Mathews, L Xie - … of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com

… set a new SOTA for news image captioning. Our model can incorporate real-world knowledge
about entities across different modalities and generate text with better linguistic diversity. …

被引用次数：99 相关文章所有 7 个版本

[PDF] thecvf.com

Guiding image captioning models toward more specific captions

S Kornblith, L Li, Z Wang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

… These problems are further exacerbated when models are trained directly on image-alt
text pairs collected from the internet. In this work, we show that it is possible to generate more …

被引用次数：4 相关文章所有 5 个版本

高级搜索

QQ 群

Conceptual captions: A cleaned, hypernymed, image alt-text dataset for automatic image captioning

Denoising large-scale image captioning from alt-text data using content selection models

Evaluating the effectiveness of automatic image captioning for web accessibility

Caption crawler: Enabling reusable alternative text descriptions using reverse image search

Scaling up vision-language pre-training for image captioning

Clipscore: A reference-free evaluation metric for image captioning

What's in an ALT Tag? Exploring Caption Content Priorities through Collaborative Captioning

Textcaps: a dataset for image captioning with reading comprehension

Transform and tell: Entity-aware news image captioning

Guiding image captioning models toward more specific captions

相关搜索

引用