Brain-inspired remote sensing foundation models and open problems: A comprehensive survey

L Jiao, Z Huang, X Lu, X Liu, Y Yang… - IEEE Journal of …, 2023 - ieeexplore.ieee.org
The foundation model (FM) has garnered significant attention for its remarkable transfer
performance in downstream tasks. Typically, it undergoes task-agnostic pretraining on a …

A Review of Deep Learning-Based Methods for Road Extraction from High-Resolution Remote Sensing Images

R Liu, J Wu, W Lu, Q Miao, H Zhang, X Liu, Z Lu, L Li - Remote Sensing, 2024 - mdpi.com
Road extraction from high-resolution remote sensing images has long been a focal and
challenging research topic in the field of computer vision. Accurate extraction of road …

Chatearthnet: A global-scale, high-quality image-text dataset for remote sensing

Z Yuan, Z Xiong, L Mou, XX Zhu - arXiv preprint arXiv:2402.11325, 2024 - arxiv.org
An in-depth comprehension of global land cover is essential in Earth observation, forming
the foundation for a multitude of applications. Although remote sensing technology has …

VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding

X Li, J Ding, M Elhoseiny - arXiv preprint arXiv:2406.12384, 2024 - arxiv.org
We introduce a new benchmark designed to advance the development of general-purpose,
large-scale vision-language models for remote sensing images. Although several vision …

Towards Vision-Language Geo-Foundation Model: A Survey

Y Zhou, L Feng, Y Ke, X Jiang, J Yan, X Yang… - arXiv preprint arXiv …, 2024 - arxiv.org
Vision-Language Foundation Models (VLFMs) have made remarkable progress on various
multimodal tasks, such as image captioning, image-text retrieval, visual question answering …