# PraCegoVer: A Large Dataset for Image Captioning in Portuguese

GO Santos, EL Colombini, S Avila - arXiv preprint arXiv:2103.11474, 2021 - arxiv.org
… present in the image, their attributes, … images and audio descriptions in Portuguese. As far
as we know, this is the first dataset proposed for the Image Captioning problem with captions1 …

Synthcap: Augmenting transformers with synthetic data for image captioning

D Caffagni, M Barraco, M Cornia, L Baraldi… - … Conference on Image …, 2023 - Springer
image captioning models and using large-scale image-text … In this work, we explore an
alternative to web-crawled data … artificial pictures are useful for the task of image captioning, …

A review of Deep learning image captioning approaches

YA Thakare, KH Walse - Journal of Integrated Science and …, 2024 - pubs.thesciencein.org
automatic image captioning and grounded language understanding tasks. It comprises
31,000 images … by 158 thousand human-written captions. The dataset includes detectors for …

Large-scale bidirectional training for zero-shot image captioning

T Kim, M Marsden, P Ahn, S Kim… - Proceedings of the …, 2024 - openaccess.thecvf.com
image captioning. However, we find that large-scale bidirectional training between image …
text enables zero-shot image captioning. In this paper, we introduce Bidirectional Image Text

Fine-grained image captioning with clip reward

J Cho, S Yoon, A Kale, F Dernoncourt, T Bui… - arXiv preprint arXiv …, 2022 - arxiv.org
… A More Image Captioning Examples We provide more image captioning examples using
different reward functions in Table 5. Overall, the captions from the model with CLIP-S+…

[HTML][HTML] Image captioning for effective use of language models in knowledge-based visual question answering

A Salaberria, G Azkune, OL de Lacalle, A Soroa… - Expert Systems with …, 2023 - Elsevier
… a system that relies exclusively on text will allow LMs to … automatic image captioning as a
way to verbalize the information in the image, where the captions are descriptions of the images

Learning combinatorial prompts for universal controllable image captioning

Z Wang, J Xiao, Y Zhuang, F Gao, J Shao… - International Journal of …, 2024 - Springer
… In this section, we first introduce the preliminaries about the prompt-based image captioning
in Sect. 3.1. Then, we introduce the detailed generation process of combinatorial prompts …

Entangled transformer for image captioning

G Li, L Zhu, P Liu, Y Yang - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
… In image captioning, the typical attention mechanisms are … are the dominating architectures
in image captioning. However, … performance on the MSCOCO image captioning dataset. The …

Unpaired image captioning with semantic-constrained self-learning

H Ben, Y Pan, Y Li, T Yao, R Hong… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
… of image and sentence features that is structured by visual concepts to enable image-to-text
intermediate representation of image and sentence for unpaired image captioning. A cycle-…

ROME: Testing Image Captioning Systems via Recursive Object Melting

B Yu, Z Zhong, J Li, Y Yang, S He, P He - Proceedings of the 32nd ACM …, 2023 - dl.acm.org
… errors of the four IC models and MS Azure API, and use them to manually test the performance
of IC systems integrated in two popular commercial software applications: alternative text