Vision-language models in remote sensing: Current progress and future trends

X Li, C Wen, Y Hu, Z Yuan… - IEEE Geoscience and …, 2024 - ieeexplore.ieee.org
The remarkable achievements of ChatGPT and Generative Pre-trained Transformer 4 (GPT-
4) have sparked a wave of interest and research in the field of large language models …

Onboard information fusion for multisatellite collaborative observation: Summary, challenges, and perspectives

G Gao, L Yao, W Li, L Zhang… - IEEE Geoscience and …, 2023 - ieeexplore.ieee.org
Onboard information fusion for multisatellites, which is based on spatial computing mode,
can improve the satellites' capability, such as the spatial–temporal coverage, detection …

Rsvg: Exploring data and models for visual grounding on remote sensing data

Y Zhan, Z Xiong, Y Yuan - IEEE Transactions on Geoscience …, 2023 - ieeexplore.ieee.org
In this article, we introduce the task of visual grounding for remote sensing data (RSVG).
RSVG aims to localize the referred objects in remote sensing (RS) images with the guidance …

Rsgpt: A remote sensing vision language model and benchmark

Y Hu, J Yuan, C Wen, X Lu, X Li - arXiv preprint arXiv:2307.15266, 2023 - arxiv.org
The emergence of large-scale large language models, with GPT-4 as a prominent example,
has significantly propelled the rapid advancement of artificial general intelligence and …

Distilling knowledge from super-resolution for efficient remote sensing salient object detection

Y Liu, Z Xiong, Y Yuan, Q Wang - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Current state-of-the-art remote sensing salient object detectors always require high-
resolution spatial context to ensure excellent performance, which incurs enormous …

Parameter-efficient transfer learning for remote sensing image-text retrieval

Y Yuan, Y Zhan, Z Xiong - IEEE Transactions on Geoscience …, 2023 - ieeexplore.ieee.org
Vision-and-language pretraining (VLP) models have experienced a surge in popularity
recently. By fine-tuning them on specific datasets, significant performance improvements …

Artificial intelligence to advance Earth observation: a perspective

D Tuia, K Schindler, B Demir, G Camps-Valls… - arXiv preprint arXiv …, 2023 - arxiv.org
Earth observation (EO) is a prime instrument for monitoring land and ocean processes,
studying the dynamics at work, and taking the pulse of our planet. This article gives a bird's …

A spatial hierarchical reasoning network for remote sensing visual question answering

Z Zhang, L Jiao, L Li, X Liu, P Chen… - … on Geoscience and …, 2023 - ieeexplore.ieee.org
For visual question answering on remote sensing (RSVQA), current methods scarcely
consider geospatial objects typically with large-scale differences and positional sensitive …

Edge-Guided Remote Sensing Image Compression

P Han, B Zhao, X Li - IEEE Transactions on Geoscience and …, 2023 - ieeexplore.ieee.org
Using high-fidelity image compression makes it possible to transmit remote-sensing images
in real-time. Nevertheless, existing lossy remote-sensing image compression (RSIC) …

From image to language: A critical analysis of visual question answering (vqa) approaches, challenges, and opportunities

MF Ishmam, MSH Shovon, MF Mridha, N Dey - Information Fusion, 2024 - Elsevier
The multimodal task of Visual Question Answering (VQA) encompassing elements of
Computer Vision (CV) and Natural Language Processing (NLP), aims to generate answers …