alt text automatic image captioning- 学术资源搜索

# PraCegoVer: A Large Dataset for Image Captioning in Portuguese

GO Santos, EL Colombini, S Avila - arXiv preprint arXiv:2103.11474, 2021 - arxiv.org

… present in the image, their attributes, … images and audio descriptions in Portuguese. As far
as we know, this is the first dataset proposed for the Image Captioning problem with captions1 …

被引用次数：4 相关文章所有 2 个版本

[PDF] unimore.it

Synthcap: Augmenting transformers with synthetic data for image captioning

D Caffagni, M Barraco, M Cornia, L Baraldi… - … Conference on Image …, 2023 - Springer

… image captioning models and using large-scale image-text … In this work, we explore an
alternative to web-crawled data … artificial pictures are useful for the task of image captioning, …

被引用次数：5 相关文章所有 5 个版本

[PDF] thesciencein.org

A review of Deep learning image captioning approaches

YA Thakare, KH Walse - Journal of Integrated Science and …, 2024 - pubs.thesciencein.org

… automatic image captioning and grounded language understanding tasks. It comprises
31,000 images … by 158 thousand human-written captions. The dataset includes detectors for …

被引用次数：2 相关文章

[PDF] thecvf.com

Large-scale bidirectional training for zero-shot image captioning

T Kim, M Marsden, P Ahn, S Kim… - Proceedings of the …, 2024 - openaccess.thecvf.com

… image captioning. However, we find that large-scale bidirectional training between image …
text enables zero-shot image captioning. In this paper, we introduce Bidirectional Image Text …

被引用次数：3 相关文章所有 3 个版本

[PDF] arxiv.org

Fine-grained image captioning with clip reward

J Cho, S Yoon, A Kale, F Dernoncourt, T Bui… - arXiv preprint arXiv …, 2022 - arxiv.org

… A More Image Captioning Examples We provide more image captioning examples using
different reward functions in Table 5. Overall, the captions from the model with CLIP-S+…

被引用次数：69 相关文章所有 4 个版本

[HTML] sciencedirect.com

[HTML][HTML] Image captioning for effective use of language models in knowledge-based visual question answering

A Salaberria, G Azkune, OL de Lacalle, A Soroa… - Expert Systems with …, 2023 - Elsevier

… a system that relies exclusively on text will allow LMs to … automatic image captioning as a
way to verbalize the information in the image, where the captions are descriptions of the images …

被引用次数：47 相关文章所有 4 个版本

[PDF] arxiv.org

Learning combinatorial prompts for universal controllable image captioning

Z Wang, J Xiao, Y Zhuang, F Gao, J Shao… - International Journal of …, 2024 - Springer

… In this section, we first introduce the preliminaries about the prompt-based image captioning
in Sect. 3.1. Then, we introduce the detailed generation process of combinatorial prompts …

被引用次数：8 相关文章所有 2 个版本

[PDF] thecvf.com

Entangled transformer for image captioning

G Li, L Zhu, P Liu, Y Yang - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com

… In image captioning, the typical attention mechanisms are … are the dominating architectures
in image captioning. However, … performance on the MSCOCO image captioning dataset. The …

被引用次数：395 相关文章所有 10 个版本

Unpaired image captioning with semantic-constrained self-learning

H Ben, Y Pan, Y Li, T Yao, R Hong… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

… of image and sentence features that is structured by visual concepts to enable image-to-text …
intermediate representation of image and sentence for unpaired image captioning. A cycle-…

被引用次数：48 相关文章

[PDF] arxiv.org

ROME: Testing Image Captioning Systems via Recursive Object Melting

B Yu, Z Zhong, J Li, Y Yang, S He, P He - Proceedings of the 32nd ACM …, 2023 - dl.acm.org

… errors of the four IC models and MS Azure API, and use them to manually test the performance
of IC systems integrated in two popular commercial software applications: alternative text …

被引用次数：5 相关文章所有 4 个版本

高级搜索

QQ 群

# PraCegoVer: A Large Dataset for Image Captioning in Portuguese

Synthcap: Augmenting transformers with synthetic data for image captioning

A review of Deep learning image captioning approaches

Large-scale bidirectional training for zero-shot image captioning

Fine-grained image captioning with clip reward

[HTML][HTML] Image captioning for effective use of language models in knowledge-based visual question answering

Learning combinatorial prompts for universal controllable image captioning

Entangled transformer for image captioning

Unpaired image captioning with semantic-constrained self-learning

ROME: Testing Image Captioning Systems via Recursive Object Melting

引用