A comprehensive survey on image captioning: from handcrafted to deep learning-based techniques, a taxonomy and open research issues

H Sharma, D Padha - Artificial Intelligence Review, 2023 - Springer
Image captioning is a pretty modern area of the convergence of computer vision and natural
language processing and is widely used in a range of applications such as multi-modal …

NWPU-captions dataset and MLCA-net for remote sensing image captioning

Q Cheng, H Huang, Y Xu, Y Zhou, H Li… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Recently, the burgeoning demands for captioning-related applications have inspired great
endeavors in the remote sensing community. However, current benchmark datasets are …

Language Integration in Remote Sensing: Tasks, datasets, and future directions

L Bashmal, Y Bazi, F Melgani… - … and Remote Sensing …, 2023 - ieeexplore.ieee.org
The emerging field of vision–language models, which combines computer vision and natural
language processing (NLP), has gained significant interest and exploration. This integration …

Truncation cross entropy loss for remote sensing image captioning

X Li, X Zhang, W Huang, Q Wang - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Recently, remote sensing image captioning (RSIC) has drawn an increasing attention. In this
field, the encoder-decoder-based methods have become the mainstream due to their …

Change captioning: A new paradigm for multitemporal remote sensing image analysis

G Hoxha, S Chouaf, F Melgani… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Change detection (CD) is among the most important applications in remote sensing (RS)
that allows identifying the changes that occurred in a given geographical area across …

Toward remote sensing image retrieval under a deep image captioning perspective

G Hoxha, F Melgani, B Demir - IEEE Journal of Selected Topics …, 2020 - ieeexplore.ieee.org
The performance of remote sensing image retrieval (RSIR) systems depends on the
capability of the extracted features in characterizing the semantic content of images. Existing …

Learning consensus-aware semantic knowledge for remote sensing image captioning

Y Li, X Zhang, X Cheng, X Tang, L Jiao - Pattern Recognition, 2024 - Elsevier
Tremendous progresses have been made in remote sensing image captioning (RSIC) task
in recent years, yet there still some unresolved problems:(1) facing the gap between the …

GLCM: Global–local captioning model for remote sensing image captioning

Q Wang, W Huang, X Zhang, X Li - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Remote sensing image captioning (RSIC), which describes a remote sensing image with a
semantically related sentence, has been a cross-modal challenge between computer vision …

Exploring transformer and multilabel classification for remote sensing image captioning

H Kandala, S Saha, B Banerjee… - IEEE Geoscience and …, 2022 - ieeexplore.ieee.org
High-resolution remote sensing images are now available with the progress of remote
sensing technology. With respect to popular remote sensing tasks, such as scene …

Visual question generation from remote sensing images

L Bashmal, Y Bazi, F Melgani, R Ricci… - IEEE Journal of …, 2023 - ieeexplore.ieee.org
Visual question generation (VQG) is a fundamental task in vision-language understanding
that aims to generate relevant questions about the given input image. In this article, we …