… image" captioning methods. In this paper, we discuss various methods of imagecaptioning … most common problems and challenges of imagecaptioning. We provide a comprehensive …
… We present an imagecaptioning model that combines image features with fine-grained entities and object labels, and learns to produce fluent and informative imagecaptions. …
… in imagecaptioning has not reached a conclusive answer yet. This work aims at providing a comprehensive overview of imagecaptioning approaches, from visual encoding and text …
W Kang, J Mun, S Lee, B Roh - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
… learning and DALL·E [40] for the text-to-image generation task. This is mainly thanks to the … described in alt-texts of web-crawled data. Inspired by this, research on imagecaptioning is …
… In this section, we evaluate the robustness of the automaticimagecaptioning metrics. For this purpose, we employ the binary (two-alternative) forced choice task introduced in (Hodosh …
H Jeong, M Chun, H Lee, SY Oh, H Jung - Companion proceedings of …, 2023 - dl.acm.org
… the user enters a web page URL, and the alttext checker identifies any images without alttext. WATAA then uses an imagecaptioning model to generate automaticalttext for each …
… To assess the role of visual features extracted from CLIPlike models in imagecaptioning, … features in standard and more challenging imagecaptioning settings. We use the commonly …
Q Xia, H Huang, N Duan, D Zhang, L Ji, Z Sui… - … Processing and Chinese …, 2021 - Springer
… benchmark datasets, including COCO Captions and Flickr30k Captions. We also use XGPT to generate imagecaptions as data augmentation for the image retrieval task and achieve …
… auxiliary text, such as generating or editing an image … imagecaptioning. Note that our method does not employ the CLIP’s textual encoder, since there is no input text, and the output text …