Foundation Models Defining a New Era in Vision: a Survey and Outlook

M Awais, M Naseer, S Khan, RM Anwer… - … on Pattern Analysis …, 2025 - ieeexplore.ieee.org
Vision systems that see and reason about the compositional nature of visual scenes are
fundamental to understanding our world. The complex relations between objects and their …

Explain any concept: Segment anything meets concept-based explanation

A Sun, P Ma, Y Yuan, S Wang - Advances in Neural …, 2024 - proceedings.neurips.cc
EXplainable AI (XAI) is an essential topic to improve human understanding of deep neural
networks (DNNs) given their black-box internals. For computer vision tasks, mainstream …

Segment anything for videos: A systematic survey

C Zhang, Y Cui, W Lin, G Huang, Y Rong, L Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
The recent wave of foundation models has witnessed tremendous success in computer
vision (CV) and beyond, with the segment anything model (SAM) having sparked a passion …

Exploring semantic prompts in the segment anything model for domain adaptation

Z Wang, Y Zhang, Z Zhang, Z Jiang, Y Yu, L Li, L Li - Remote Sensing, 2024 - mdpi.com
Robust segmentation in adverse weather conditions is crucial for autonomous driving.
However, these scenes struggle with recognition and make annotations expensive, resulting …

GeoSAM: Fine-tuning SAM with sparse and dense visual prompting for automated segmentation of mobility infrastructure

RI Sultan, C Li, H Zhu, P Khanduri, M Brocanelli… - arXiv preprint arXiv …, 2023 - arxiv.org
The Segment Anything Model (SAM) has shown impressive performance when applied to
natural image segmentation. However, it struggles with geographical images like aerial and …

[HTML][HTML] UAV (Unmanned Aerial Vehicle): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking

MM Rahman, S Siddique, M Kamal, RH Rifat… - Algorithms, 2024 - mdpi.com
Unmanned Aerial Vehicles (UAVs) have transformed the process of data collection and
analysis in a variety of research disciplines, delivering unparalleled adaptability and …

Towards underwater camouflaged object tracking: An experimental evaluation of sam and sam 2

C Zhang, L Liu, G Huang, H Wen, X Zhou… - arXiv preprint arXiv …, 2024 - arxiv.org
Over the past decade, significant progress has been made in visual object tracking, largely
due to the availability of large-scale training datasets. However, existing tracking datasets …

Progressive Representation Learning for Real-Time UAV Tracking

C Fu, X Lei, H Zuo, L Yao, G Zheng… - 2024 IEEE/RSJ …, 2024 - ieeexplore.ieee.org
Visual object tracking has significantly promoted autonomous applications for unmanned
aerial vehicles (UAVs). However, learning robust object representations for UAV tracking is …

Prompt-Driven Temporal Domain Adaptation for Nighttime UAV Tracking

C Fu, Y Wang, L Yao, G Zheng… - 2024 IEEE/RSJ …, 2024 - ieeexplore.ieee.org
Nighttime UAV tracking under low-illuminated scenarios has achieved great progress by
domain adaptation (DA). However, previous DA training-based works are deficient in …

Global Multi-Scale Optimization and Prediction Head Attentional Siamese Network for Aerial Tracking

Q Chen, J Liu, X Wang, Y Zuo, C Liu - Symmetry, 2023 - mdpi.com
Siamese-based trackers have been widely used in object tracking. However, aerial remote
tracking suffers from various challenges such as scale variation, viewpoint change …