Vision-language models in remote sensing: Current progress and future trends

X Li, C Wen, Y Hu, Z Yuan… - IEEE Geoscience and …, 2024 - ieeexplore.ieee.org
The remarkable achievements of ChatGPT and Generative Pre-trained Transformer 4 (GPT-
4) have sparked a wave of interest and research in the field of large language models …

Remote sensing object detection in the deep learning era—a review

S Gui, S Song, R Qin, Y Tang - Remote Sensing, 2024 - mdpi.com
Given the large volume of remote sensing images collected daily, automatic object detection
and segmentation have been a consistent need in Earth observation (EO). However, objects …

Attention, please! A survey of neural attention models in deep learning

A de Santana Correia, EL Colombini - Artificial Intelligence Review, 2022 - Springer
In humans, Attention is a core property of all perceptual and cognitive operations. Given our
limited ability to process competing sources, attention mechanisms select, modulate, and …

Hyperspectral image classification based on 3-D octave convolution with spatial–spectral attention network

X Tang, F Meng, X Zhang, YM Cheung… - … on Geoscience and …, 2020 - ieeexplore.ieee.org
In recent years, with the development of deep learning (DL), the hyperspectral image (HSI)
classification methods based on DL have shown superior performance. Although these DL …

RSVQA: Visual question answering for remote sensing data

S Lobry, D Marcos, J Murray… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
This article introduces the task of visual question answering for remote sensing data
(RSVQA). Remote sensing images contain a wealth of information, which can be useful for a …

Rsgpt: A remote sensing vision language model and benchmark

Y Hu, J Yuan, C Wen, X Lu, X Li - arXiv preprint arXiv:2307.15266, 2023 - arxiv.org
The emergence of large-scale large language models, with GPT-4 as a prominent example,
has significantly propelled the rapid advancement of artificial general intelligence and …

NWPU-captions dataset and MLCA-net for remote sensing image captioning

Q Cheng, H Huang, Y Xu, Y Zhou, H Li… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Recently, the burgeoning demands for captioning-related applications have inspired great
endeavors in the remote sensing community. However, current benchmark datasets are …

A decoupling paradigm with prompt learning for remote sensing image change captioning

C Liu, R Zhao, J Chen, Z Qi, Z Zou… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Remote sensing image change captioning (RSICC) is a novel task that aims to describe the
differences between bitemporal images by natural language. Previous methods ignore a …

Word–sentence framework for remote sensing image captioning

Q Wang, W Huang, X Zhang, X Li - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Remote sensing image captioning (RSIC), which aims at generating a well-formed sentence
for a remote sensing image, has attracted more attention in recent years. The general …

Truncation cross entropy loss for remote sensing image captioning

X Li, X Zhang, W Huang, Q Wang - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Recently, remote sensing image captioning (RSIC) has drawn an increasing attention. In this
field, the encoder-decoder-based methods have become the mainstream due to their …