Rrsis: Referring remote sensing image segmentation

L Jiao, Z Huang, X Lu, X Liu, Y Yang… - IEEE Journal of …, 2023 - ieeexplore.ieee.org

The foundation model (FM) has garnered significant attention for its remarkable transfer
performance in downstream tasks. Typically, it undergoes task-agnostic pretraining on a …

被引用次数：3 相关文章所有 3 个版本

[PDF] mdpi.com

A Review of Deep Learning-Based Methods for Road Extraction from High-Resolution Remote Sensing Images

R Liu, J Wu, W Lu, Q Miao, H Zhang, X Liu, Z Lu, L Li - Remote Sensing, 2024 - mdpi.com

Road extraction from high-resolution remote sensing images has long been a focal and
challenging research topic in the field of computer vision. Accurate extraction of road …

[PDF] arxiv.org

Chatearthnet: A global-scale, high-quality image-text dataset for remote sensing

Z Yuan, Z Xiong, L Mou, XX Zhu - arXiv preprint arXiv:2402.11325, 2024 - arxiv.org

An in-depth comprehension of global land cover is essential in Earth observation, forming
the foundation for a multitude of applications. Although remote sensing technology has …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding

X Li, J Ding, M Elhoseiny - arXiv preprint arXiv:2406.12384, 2024 - arxiv.org

We introduce a new benchmark designed to advance the development of general-purpose,
large-scale vision-language models for remote sensing images. Although several vision …

[PDF] arxiv.org

Towards Vision-Language Geo-Foundation Model: A Survey

Y Zhou, L Feng, Y Ke, X Jiang, J Yan, X Yang… - arXiv preprint arXiv …, 2024 - arxiv.org

Vision-Language Foundation Models (VLFMs) have made remarkable progress on various
multimodal tasks, such as image captioning, image-text retrieval, visual question answering …

高级搜索

QQ 群