Current image captioning systems perform at a merely descriptive level, essentially enumerating the objects in the scene and their relations. Humans, on the contrary, interpret …
L Specia, S Frank, K Sima'An… - Proceedings of the First …, 2016 - aclanthology.org
This paper introduces and summarises the findings of a new shared task at the intersection of Natural Language Processing and Computer Vision: the generation of image descriptions …
U Sirisha, B Sai Chandana - Cogent Engineering, 2022 - Taylor & Francis
In our day-to-day life, synchronizing vision and language aspects plays a crucial role in solving various real-time challenges. Image captioning is one of them, and it aims to …
We propose Visual News Captioner, an entity-aware model for the task of news image captioning. We also introduce Visual News, a large-scale benchmark consisting of more …
A Tran, A Mathews, L Xie - … of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com
We propose an end-to-end model which generates captions for images embedded in news articles. News images present two key challenges: they rely on real-world knowledge …
Progress on the UN Sustainable Development Goals (SDGs) is hampered by a persistent lack of data regarding key social, environmental, and economic indicators, particularly in …
W Zhao, X Wu - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Entity-aware image captioning aims to describe named entities and events related to the image by utilizing the background knowledge in the associated article. This task remains …
The task of news article image captioning aims to generate descriptive and informative captions for news article images. Unlike conventional image captions that simply describe …
M Ince - Neural Computing and Applications, 2022 - Springer
Increasing demand in distance education, e-learning, web-based learning, and other digital sectors (eg, entertainment) has led to excessive amounts of e-content. Learning objects …