Context prior for scene segmentation

F Shamshad, S Khan, SW Zamir, MH Khan… - Medical Image …, 2023 - Elsevier

Following unprecedented success on the natural language tasks, Transformers have been
successfully applied to several computer vision problems, achieving state-of-the-art results …

被引用次数：509 相关文章所有 9 个版本

LAANet: lightweight attention-guided asymmetric network for real-time semantic segmentation

X Zhang, B Du, Z Wu, T Wan - Neural Computing and Applications, 2022 - Springer

With the increasing demand for real-world scenarios such as robot navigation and
autonomous driving, how to achieve a good trade-off between segmentation accuracy …

被引用次数：43 相关文章所有 4 个版本

[PDF] thecvf.com

Rethinking semantic segmentation: A prototype view

T Zhou, W Wang, E Konukoglu… - Proceedings of the …, 2022 - openaccess.thecvf.com

Prevalent semantic segmentation solutions, despite their different network designs (FCN
based or attention based) and mask decoding strategies (parametric softmax based or pixel …

被引用次数：262 相关文章所有 10 个版本

[PDF] neurips.cc

SegFormer: Simple and efficient design for semantic segmentation with transformers

E Xie, W Wang, Z Yu, A Anandkumar… - Advances in neural …, 2021 - proceedings.neurips.cc

We present SegFormer, a simple, efficient yet powerful semantic segmentation framework
which unifies Transformers with lightweight multilayer perceptron (MLP) decoders …

被引用次数：3782 相关文章所有 12 个版本

[PDF] thecvf.com

Segmenter: Transformer for semantic segmentation

R Strudel, R Garcia, I Laptev… - Proceedings of the …, 2021 - openaccess.thecvf.com

Image segmentation is often ambiguous at the level of individual image patches and
requires contextual information to reach label consensus. In this paper we introduce …

被引用次数：1601 相关文章所有 12 个版本

[PDF] neurips.cc

Segvit: Semantic segmentation with plain vision transformers

B Zhang, Z Tian, Q Tang, X Chu… - Advances in Neural …, 2022 - proceedings.neurips.cc

We explore the capability of plain Vision Transformers (ViTs) for semantic segmentation and
propose the SegViT. Previous ViT-based segmentation networks usually learn a pixel-level …

被引用次数：87 相关文章所有 6 个版本

[PDF] thecvf.com

Exploring cross-image pixel contrast for semantic segmentation

W Wang, T Zhou, F Yu, J Dai… - Proceedings of the …, 2021 - openaccess.thecvf.com

Current semantic segmentation methods focus only on mining" local" context, ie,
dependencies between pixels within individual images, by context-aggregation modules …

被引用次数：501 相关文章所有 8 个版本

[PDF] thecvf.com

Lite-hrnet: A lightweight high-resolution network

C Yu, B Xiao, C Gao, L Yuan, L Zhang… - Proceedings of the …, 2021 - openaccess.thecvf.com

We present an efficient high-resolution network, Lite-HRNet, for human pose estimation. We
start by simply applying the efficient shuffle block in ShuffleNet to HRNet (high-resolution …

被引用次数：362 相关文章所有 7 个版本

[PDF] thecvf.com

Retinex-inspired unrolling with cooperative prior architecture search for low-light image enhancement

R Liu, L Ma, J Zhang, X Fan… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Low-light image enhancement plays very important roles in low-level vision areas. Recent
works have built a great deal of deep learning models to address this task. However, these …

被引用次数：581 相关文章所有 7 个版本

[PDF] thecvf.com

Monoscene: Monocular 3d semantic scene completion

AQ Cao, R De Charette - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com

MonoScene proposes a 3D Semantic Scene Completion (SSC) framework, where the dense
geometry and semantics of a scene are inferred from a single monocular RGB image …

被引用次数：168 相关文章所有 9 个版本

高级搜索

QQ 群