Image segmentation using deep learning: A survey

S Minaee, Y Boykov, F Porikli, A Plaza… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Image segmentation is a key task in computer vision and image processing with important
applications such as scene understanding, medical image analysis, robotic perception …

Rethinking semantic segmentation: A prototype view

T Zhou, W Wang, E Konukoglu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Prevalent semantic segmentation solutions, despite their different network designs (FCN
based or attention based) and mask decoding strategies (parametric softmax based or pixel …

Language-driven semantic segmentation

B Li, KQ Weinberger, S Belongie, V Koltun… - arXiv preprint arXiv …, 2022 - arxiv.org
We present LSeg, a novel model for language-driven semantic image segmentation. LSeg
uses a text encoder to compute embeddings of descriptive input labels (eg," grass" or" …

Image segmentation review: Theoretical background and recent advances

KK Brar, B Goyal, A Dogra, MA Mustafa, R Majumdar… - Information …, 2024 - Elsevier
Image segmentation is a significant topic in image refining and automated image analysis
with relevance for instance object recognition, diagnostic imaging scanning, mechanized …

Segmenter: Transformer for semantic segmentation

R Strudel, R Garcia, I Laptev… - Proceedings of the …, 2021 - openaccess.thecvf.com
Image segmentation is often ambiguous at the level of individual image patches and
requires contextual information to reach label consensus. In this paper we introduce …

Swin transformer: Hierarchical vision transformer using shifted windows

Z Liu, Y Lin, Y Cao, H Hu, Y Wei… - Proceedings of the …, 2021 - openaccess.thecvf.com
This paper presents a new vision Transformer, called Swin Transformer, that capably serves
as a general-purpose backbone for computer vision. Challenges in adapting Transformer …

Vision transformers for dense prediction

R Ranftl, A Bochkovskiy… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We introduce dense prediction transformers, an architecture that leverages vision
transformers in place of convolutional networks as a backbone for dense prediction tasks …

Focal self-attention for local-global interactions in vision transformers

J Yang, C Li, P Zhang, X Dai, B Xiao, L Yuan… - arXiv preprint arXiv …, 2021 - arxiv.org
Recently, Vision Transformer and its variants have shown great promise on various
computer vision tasks. The ability of capturing short-and long-range visual dependencies …

Exploring cross-image pixel contrast for semantic segmentation

W Wang, T Zhou, F Yu, J Dai… - Proceedings of the …, 2021 - openaccess.thecvf.com
Current semantic segmentation methods focus only on mining" local" context, ie,
dependencies between pixels within individual images, by context-aggregation modules …

Self-support few-shot semantic segmentation

Q Fan, W Pei, YW Tai, CK Tang - European Conference on Computer …, 2022 - Springer
Existing few-shot segmentation methods have achieved great progress based on the
support-query matching framework. But they still heavily suffer from the limited coverage of …