Captioning images taken by people who are blind

D Gurari, Y Zhao, M Zhang, N Bhattacharya - Computer Vision–ECCV …, 2020 - Springer
While an important problem in the vision community is to design algorithms that can
automatically caption images, few publicly-available datasets for algorithm development …

Multi-modal image captioning for the visually impaired

H Ahsan, N Bhalla, D Bhatt, K Shah - arXiv preprint arXiv:2105.08106, 2021 - arxiv.org
One of the ways blind people understand their surroundings is by clicking images and
relying on descriptions generated by image captioning systems. Current work on captioning …

Image captioning as an assistive technology: Lessons learned from vizwiz 2020 challenge

P Dognin, I Melnyk, Y Mroueh, I Padhi, M Rigotti… - Journal of Artificial …, 2022 - jair.org
Image captioning has recently demonstrated impressive progress largely owing to the
introduction of neural network algorithms trained on curated dataset like MS-COCO. Often …

Understanding and evaluating racial biases in image captioning

D Zhao, A Wang… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Image captioning is an important task for benchmarking visual reasoning and for enabling
accessibility for people with vision impairments. However, as in many machine learning …

Compare and reweight: Distinctive image captioning using similar images sets

J Wang, W Xu, Q Wang, AB Chan - … , Glasgow, UK, August 23–28, 2020 …, 2020 - Springer
A wide range of image captioning models has been developed, achieving significant
improvement based on popular metrics, such as BLEU, CIDEr, and SPICE. However …

A new image captioning approach for visually impaired people

B Makav, V Kılıç - 2019 11th international conference on …, 2019 - ieeexplore.ieee.org
Automatic caption generation in natural language to describe the visual content of an image
has attracted an increasing amount of attention in the last decade due to its potential …

Describing like humans: on diversity in image captioning

Q Wang, AB Chan - … of the IEEE/CVF Conference on …, 2019 - openaccess.thecvf.com
Recently, the state-of-the-art models for image captioning have overtaken human
performance based on the most popular metrics, such as BLEU, METEOR, ROUGE and …

Good news, everyone! context driven entity-aware captioning for news images

AF Biten, L Gomez, M Rusinol… - Proceedings of the …, 2019 - openaccess.thecvf.com
Current image captioning systems perform at a merely descriptive level, essentially
enumerating the objects in the scene and their relations. Humans, on the contrary, interpret …

Boosted attention: Leveraging human attention for image captioning

S Chen, Q Zhao - … of the European conference on computer …, 2018 - openaccess.thecvf.com
Visual attention has shown usefulness in image captioning, with the goal of enabling a
caption model to selectively focus on regions of interest. Existing models typically rely on top …

Improving image captioning by leveraging knowledge graphs

Y Zhou, Y Sun, V Honavar - 2019 IEEE winter conference on …, 2019 - ieeexplore.ieee.org
We explore the use of a knowledge graphs, that capture general or commonsense
knowledge, to augment the information extracted from images by the state-of-the-art …