Vision-language models in remote sensing: Current progress and future trends

X Li, C Wen, Y Hu, Z Yuan… - IEEE Geoscience and …, 2024 - ieeexplore.ieee.org
The remarkable achievements of ChatGPT and Generative Pre-trained Transformer 4 (GPT-
4) have sparked a wave of interest and research in the field of large language models …

Language Integration in Remote Sensing: Tasks, datasets, and future directions

L Bashmal, Y Bazi, F Melgani… - … and Remote Sensing …, 2023 - ieeexplore.ieee.org
The emerging field of vision–language models, which combines computer vision and natural
language processing (NLP), has gained significant interest and exploration. This integration …

Rrsis: Referring remote sensing image segmentation

Z Yuan, L Mou, Y Hua, XX Zhu - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Localizing desired objects from remote sensing images is of great use in practical
applications. Referring image segmentation, which aims at segmenting out the objects to …

Visual question answering on remote sensing images

S Lobry, D Tuia - Advances in Machine Learning and Image Analysis for …, 2024 - Elsevier
Remote sensing visual question answering (RSVQA) aims at predicting an answer to a
question (both in natural language) about an overhead image. Through natural language …

[HTML][HTML] A multi-scale contextual attention network for remote sensing visual question answering

J Feng, H Wang - International Journal of Applied Earth Observation and …, 2024 - Elsevier
Remote sensing visual question answering (RSVQA) is a user-friendly method used for
analyzing remote sensing images (RSIs) in various tasks. However, current methods often …

Overcoming Language Bias in Remote Sensing Visual Question Answering Via Adversarial Training

Z Yuan, L Mou, XX Zhu - IGARSS 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
The Visual Question Answering (VQA) system offers a user-friendly interface and enables
human-computer interaction. However, VQA models commonly face the challenge of …