Injecting semantic concepts into end-to-end image captioning

Z Fang, J Wang, X Hu, L Liang, Z Gan… - Proceedings of the …, 2022 - openaccess.thecvf.com
tags play an important role in improving the captioning performance. Instead of gleaning the
object tags … novel fully VIsion Transformer based image CAPtioning model, dubbed ViTCAP, …

Iconographic image captioning for artworks

E Cetinic - … ICPR International Workshops and Challenges: Virtual …, 2021 - Springer
… not structured primarily as an image captioning dataset, each … on the down-stream task of
image captioning [42]. Transformer-… of artwork images with the goal to generate image captions

Entity-Aware Multimodal Alignment Framework for News Image Captioning

J Zhang, H Zhang, X Wan - arXiv preprint arXiv:2402.19404, 2024 - arxiv.org
… Therefore, we attempt to create a specialized image-text matching task within the news
image captioning task to align vision features with entity-aware textual features. Furthermore, …

X-linear attention networks for image captioning

Y Pan, T Yao, Y Li, T Mei - … of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com
… In the standard task of image captioning, we are given an image I to be described with a
natural-language sentence Y1:T . The sentence Y1:T = 1w1, w2, ..., wT l is a sequence of T …

Cnn+ cnn: Convolutional decoders for image captioning

Q Wang, AB Chan - arXiv preprint arXiv:1805.09019, 2018 - arxiv.org
… Inspired by the applications of CNNs in the field of NLP, we develop a framework that only
employ CNNs for image captioning. The main contributions of this paper are: 1. We propose a …

Designing tools for high-quality alt text authoring

K Mack, E Cutrell, B Lee, MR Morris - Proceedings of the 23rd …, 2021 - dl.acm.org
… Recently, researchers created “Twitter A11y,” a browser extension that implements six
methods of adding alt text to images without captions on Twitter [10]. Each of these features …

Image captioning with adaptive incremental global context attention

C Wang, X Gu - Applied Intelligence, 2022 - Springer
… of subsequent words when generating a sentence for image captioning task [9, 10]. … for
image captioning, which makes full use of the global information of the generated target captions

Utilizing a Dense Video Captioning Technique for Generating Image Descriptions of Comics for People with Visual Impairments

S Kim, S Lee, K Kim, U Oh - … of the 29th International Conference on …, 2024 - dl.acm.org
Automatic Alt Text) system that utilizes computer vision technology to identify faces, objects,
and themes in images … results generated by DCC and the image captioning model, DCC was …

Image captioning with novel topics guidance and retrieval-based topics re-weighting

M Al-Qatf, X Wang, A Hawbani… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
… throughout the whole captioning task without considering the … captioning network to focus
on inaccurate image objects. To tackle these challenges, we propose a novel image captioning

Adaptive text denoising network for image caption editing

M Yuan, BK Bao, Z Tan, C Xu - ACM Transactions on Multimedia …, 2023 - dl.acm.org
… We extensively evaluate our proposals on the MS-COCO image captioning dataset and
prove the effectiveness of our method when compared with the state-of-the-arts. …