From show to tell: A survey on deep learning-based image captioning

M Stefanini, M Cornia, L Baraldi… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Connecting Vision and Language plays an essential role in Generative Intelligence. For this
reason, large research efforts have been devoted to image captioning, ie describing images …

Good news, everyone! context driven entity-aware captioning for news images

AF Biten, L Gomez, M Rusinol… - Proceedings of the …, 2019 - openaccess.thecvf.com
Current image captioning systems perform at a merely descriptive level, essentially
enumerating the objects in the scene and their relations. Humans, on the contrary, interpret …

[PDF][PDF] A shared task on multimodal machine translation and crosslingual image description

L Specia, S Frank, K Sima'An… - Proceedings of the First …, 2016 - aclanthology.org
This paper introduces and summarises the findings of a new shared task at the intersection
of Natural Language Processing and Computer Vision: the generation of image descriptions …

Semantic interdisciplinary evaluation of image captioning models

U Sirisha, B Sai Chandana - Cogent Engineering, 2022 - Taylor & Francis
In our day-to-day life, synchronizing vision and language aspects plays a crucial role in
solving various real-time challenges. Image captioning is one of them, and it aims to …

Visual news: Benchmark and challenges in news image captioning

F Liu, Y Wang, T Wang, V Ordonez - arXiv preprint arXiv:2010.03743, 2020 - arxiv.org
We propose Visual News Captioner, an entity-aware model for the task of news image
captioning. We also introduce Visual News, a large-scale benchmark consisting of more …

Transform and tell: Entity-aware news image captioning

A Tran, A Mathews, L Xie - … of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com
We propose an end-to-end model which generates captions for images embedded in news
articles. News images present two key challenges: they rely on real-world knowledge …

Predicting economic development using geolocated wikipedia articles

E Sheehan, C Meng, M Tan, B Uzkent, N Jean… - Proceedings of the 25th …, 2019 - dl.acm.org
Progress on the UN Sustainable Development Goals (SDGs) is hampered by a persistent
lack of data regarding key social, environmental, and economic indicators, particularly in …

Boosting entity-aware image captioning with multi-modal knowledge graph

W Zhao, X Wu - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Entity-aware image captioning aims to describe named entities and events related to the
image by utilizing the background knowledge in the associated article. This task remains …

Journalistic guidelines aware news image captioning

X Yang, S Karaman, J Tetreault, A Jaimes - arXiv preprint arXiv …, 2021 - arxiv.org
The task of news article image captioning aims to generate descriptive and informative
captions for news article images. Unlike conventional image captions that simply describe …

[HTML][HTML] Automatic and intelligent content visualization system based on deep learning and genetic algorithm

M Ince - Neural Computing and Applications, 2022 - Springer
Increasing demand in distance education, e-learning, web-based learning, and other digital
sectors (eg, entertainment) has led to excessive amounts of e-content. Learning objects …