Segment anything in high quality

L Ke, M Ye, M Danelljan, YW Tai… - Advances in Neural …, 2024 - proceedings.neurips.cc
Abstract The recent Segment Anything Model (SAM) represents a big leap in scaling up
segmentation models, allowing for powerful zero-shot capabilities and flexible prompting …

Battle of the backbones: A large-scale comparison of pretrained models across computer vision tasks

M Goldblum, H Souri, R Ni, M Shu… - Advances in …, 2024 - proceedings.neurips.cc
Neural network based computer vision systems are typically built on a backbone, a
pretrained or randomly initialized feature extractor. Several years ago, the default option was …

Lhrs-bot: Empowering remote sensing with vgi-enhanced large multimodal language model

D Muhtar, Z Li, F Gu, X Zhang, P Xiao - European Conference on …, 2024 - Springer
The revolutionary capabilities of large language models (LLMs) have paved the way for
multimodal large language models (MLLMs) and fostered diverse applications across …

Cinematic mindscapes: High-quality video reconstruction from brain activity

Z Chen, J Qing, JH Zhou - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Reconstructing human vision from brain activities has been an appealing task that helps to
understand our cognitive process. Even though recent research has seen great success in …

IRSAM: Advancing segment anything model for infrared small target detection

M Zhang, Y Wang, J Guo, Y Li, X Gao… - European Conference on …, 2024 - Springer
Abstract The recent Segment Anything Model (SAM) is a significant advancement in natural
image segmentation, exhibiting potent zero-shot performance suitable for various …

Attentionviz: A global view of transformer attention

C Yeh, Y Chen, A Wu, C Chen, F Viégas… - … on Visualization and …, 2023 - ieeexplore.ieee.org
Transformer models are revolutionizing machine learning, but their inner workings remain
mysterious. In this work, we present a new visualization technique designed to help …

Firerisk: A remote sensing dataset for fire risk assessment with benchmarks using supervised and self-supervised learning

S Shen, S Seneviratne, X Wanyan… - … Conference on Digital …, 2023 - ieeexplore.ieee.org
In recent decades, wildfires have caused tremendous property losses, fatalities, and
extensive damage to forest ecosystems. Inspired by the abundance of publicly available …

Teaching matters: Investigating the role of supervision in vision transformers

M Walmer, S Suri, K Gupta… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Vision Transformers (ViTs) have gained significant popularity in recent years and
have proliferated into many applications. However, their behavior under different learning …

An MRI deep learning model predicts outcome in rectal cancer

X Jiang, H Zhao, OL Saldanha, S Nebelung, C Kuhl… - Radiology, 2023 - pubs.rsna.org
Background Deep learning (DL) models can potentially improve prognostication of rectal
cancer but have not been systematically assessed. Purpose To develop and validate an MRI …

Explainability of Vision Transformers: A Comprehensive Review and New Perspectives

R Kashefi, L Barekatain, M Sabokrou… - arXiv preprint arXiv …, 2023 - arxiv.org
Transformers have had a significant impact on natural language processing and have
recently demonstrated their potential in computer vision. They have shown promising results …