Vision-language models in remote sensing: Current progress and future trends

X Li, C Wen, Y Hu, Z Yuan… - IEEE Geoscience and …, 2024 - ieeexplore.ieee.org
The remarkable achievements of ChatGPT and Generative Pre-trained Transformer 4 (GPT-
4) have sparked a wave of interest and research in the field of large language models …

Remote sensing object detection in the deep learning era—a review

S Gui, S Song, R Qin, Y Tang - Remote Sensing, 2024 - mdpi.com
Given the large volume of remote sensing images collected daily, automatic object detection
and segmentation have been a consistent need in Earth observation (EO). However, objects …

Geochat: Grounded large vision-language model for remote sensing

K Kuckreja, MS Danish, M Naseer… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Recent advancements in Large Vision-Language Models (VLMs) have shown great
promise in natural image domains allowing users to hold a dialogue about given visual …

Rsvg: Exploring data and models for visual grounding on remote sensing data

Y Zhan, Z Xiong, Y Yuan - IEEE Transactions on Geoscience …, 2023 - ieeexplore.ieee.org
In this article, we introduce the task of visual grounding for remote sensing data (RSVG).
RSVG aims to localize the referred objects in remote sensing (RS) images with the guidance …

Rsgpt: A remote sensing vision language model and benchmark

Y Hu, J Yuan, C Wen, X Lu, X Li - arXiv preprint arXiv:2307.15266, 2023 - arxiv.org
The emergence of large-scale large language models, with GPT-4 as a prominent example,
has significantly propelled the rapid advancement of artificial general intelligence and …

Parameter-efficient transfer learning for remote sensing image-text retrieval

Y Yuan, Y Zhan, Z Xiong - IEEE Transactions on Geoscience …, 2023 - ieeexplore.ieee.org
Vision-and-language pretraining (VLP) models have experienced a surge in popularity
recently. By fine-tuning them on specific datasets, significant performance improvements …

Change detection meets visual question answering

Z Yuan, L Mou, Z Xiong, XX Zhu - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
The Earth's surface is continually changing, and identifying changes plays an important role
in urban planning and sustainability. Although change detection techniques have been …

A spatial hierarchical reasoning network for remote sensing visual question answering

Z Zhang, L Jiao, L Li, X Liu, P Chen… - … on Geoscience and …, 2023 - ieeexplore.ieee.org
For visual question answering on remote sensing (RSVQA), current methods scarcely
consider geospatial objects typically with large-scale differences and positional sensitive …

Application of question answering systems for intelligent agriculture production and sustainable management: A review

T Yang, Y Mei, L Xu, H Yu, Y Chen - Resources, Conservation and …, 2024 - Elsevier
The increasing application of artificial intelligence in agriculture production and
management has generated a large amount of data, leading to a demand for processing this …

Machine-to-machine visual dialoguing with ChatGPT for enriched textual image description

R Ricci, Y Bazi, F Melgani - Remote Sensing, 2024 - mdpi.com
Image captioning is a technique that enables the automatic extraction of natural language
descriptions about the contents of an image. On the one hand, information in the form of …