Transformers in medical imaging: A survey

F Shamshad, S Khan, SW Zamir, MH Khan… - Medical Image …, 2023 - Elsevier
Following unprecedented success on the natural language tasks, Transformers have been
successfully applied to several computer vision problems, achieving state-of-the-art results …

LAANet: lightweight attention-guided asymmetric network for real-time semantic segmentation

X Zhang, B Du, Z Wu, T Wan - Neural Computing and Applications, 2022 - Springer
With the increasing demand for real-world scenarios such as robot navigation and
autonomous driving, how to achieve a good trade-off between segmentation accuracy …

Rethinking semantic segmentation: A prototype view

T Zhou, W Wang, E Konukoglu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Prevalent semantic segmentation solutions, despite their different network designs (FCN
based or attention based) and mask decoding strategies (parametric softmax based or pixel …

SegFormer: Simple and efficient design for semantic segmentation with transformers

E Xie, W Wang, Z Yu, A Anandkumar… - Advances in neural …, 2021 - proceedings.neurips.cc
We present SegFormer, a simple, efficient yet powerful semantic segmentation framework
which unifies Transformers with lightweight multilayer perceptron (MLP) decoders …

Segmenter: Transformer for semantic segmentation

R Strudel, R Garcia, I Laptev… - Proceedings of the …, 2021 - openaccess.thecvf.com
Image segmentation is often ambiguous at the level of individual image patches and
requires contextual information to reach label consensus. In this paper we introduce …

Segvit: Semantic segmentation with plain vision transformers

B Zhang, Z Tian, Q Tang, X Chu… - Advances in Neural …, 2022 - proceedings.neurips.cc
We explore the capability of plain Vision Transformers (ViTs) for semantic segmentation and
propose the SegViT. Previous ViT-based segmentation networks usually learn a pixel-level …

Exploring cross-image pixel contrast for semantic segmentation

W Wang, T Zhou, F Yu, J Dai… - Proceedings of the …, 2021 - openaccess.thecvf.com
Current semantic segmentation methods focus only on mining" local" context, ie,
dependencies between pixels within individual images, by context-aggregation modules …

Lite-hrnet: A lightweight high-resolution network

C Yu, B Xiao, C Gao, L Yuan, L Zhang… - Proceedings of the …, 2021 - openaccess.thecvf.com
We present an efficient high-resolution network, Lite-HRNet, for human pose estimation. We
start by simply applying the efficient shuffle block in ShuffleNet to HRNet (high-resolution …

Retinex-inspired unrolling with cooperative prior architecture search for low-light image enhancement

R Liu, L Ma, J Zhang, X Fan… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Low-light image enhancement plays very important roles in low-level vision areas. Recent
works have built a great deal of deep learning models to address this task. However, these …

Monoscene: Monocular 3d semantic scene completion

AQ Cao, R De Charette - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
MonoScene proposes a 3D Semantic Scene Completion (SSC) framework, where the dense
geometry and semantics of a scene are inferred from a single monocular RGB image …