Bisenet: Bilateral segmentation network for real-time semantic segmentation

Y Mo, Y Wu, X Yang, F Liu, Y Liao - Neurocomputing, 2022 - Elsevier

The goal of semantic segmentation is to segment the input image according to semantic
information and predict the semantic category of each pixel from a given label set. With the …

被引用次数：318 相关文章所有 3 个版本

[PDF] arxiv.org

Image segmentation using deep learning: A survey

S Minaee, Y Boykov, F Porikli, A Plaza… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

Image segmentation is a key task in computer vision and image processing with important
applications such as scene understanding, medical image analysis, robotic perception …

被引用次数：3151 相关文章所有 13 个版本

[PDF] neurips.cc

Segnext: Rethinking convolutional attention design for semantic segmentation

MH Guo, CZ Lu, Q Hou, Z Liu… - Advances in Neural …, 2022 - proceedings.neurips.cc

We present SegNeXt, a simple convolutional network architecture for semantic
segmentation. Recent transformer-based models have dominated the field of se-mantic …

被引用次数：396 相关文章所有 6 个版本

[PDF] thecvf.com

PIDNet: A real-time semantic segmentation network inspired by PID controllers

J Xu, Z Xiong, SP Bhattacharyya - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Two-branch network architecture has shown its efficiency and effectiveness in real-time
semantic segmentation tasks. However, direct fusion of high-resolution details and low …

被引用次数：188 相关文章所有 8 个版本

[PDF] thecvf.com

Diffusionclip: Text-guided diffusion models for robust image manipulation

G Kim, T Kwon, JC Ye - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com

Recently, GAN inversion methods combined with Contrastive Language-Image Pretraining
(CLIP) enables zero-shot image manipulation guided by text prompts. However, their …

被引用次数：453 相关文章所有 9 个版本

[PDF] thecvf.com

Topformer: Token pyramid transformer for mobile semantic segmentation

W Zhang, Z Huang, G Luo, T Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com

Although vision transformers (ViTs) have achieved great success in computer vision, the
heavy computational cost hampers their applications to dense prediction tasks such as …

被引用次数：176 相关文章所有 6 个版本

[PDF] arxiv.org

Encoder-based domain tuning for fast personalization of text-to-image models

R Gal, M Arar, Y Atzmon, AH Bermano… - ACM Transactions on …, 2023 - dl.acm.org

Text-to-image personalization aims to teach a pre-trained diffusion model to reason about
novel, user provided concepts, embedding them into new scenes guided by natural …

被引用次数：98 相关文章所有 4 个版本

[PDF] arxiv.org

UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery

L Wang, R Li, C Zhang, S Fang, C Duan, X Meng… - ISPRS Journal of …, 2022 - Elsevier

Semantic segmentation of remotely sensed urban scene images is required in a wide range
of practical applications, such as land cover mapping, urban change detection …

被引用次数：364 相关文章所有 8 个版本

Deep dual-resolution networks for real-time and accurate semantic segmentation of traffic scenes

H Pan, Y Hong, W Sun, Y Jia - IEEE Transactions on Intelligent …, 2022 - ieeexplore.ieee.org

Using light-weight architectures or reasoning on low-resolution images, recent methods
realize very fast scene parsing, even running at more than 100 FPS on a single GPU …

被引用次数：126 相关文章所有 3 个版本

[PDF] thecvf.com

Headnerf: A real-time nerf-based parametric head model

Y Hong, B Peng, H Xiao, L Liu… - Proceedings of the …, 2022 - openaccess.thecvf.com

In this paper, we propose HeadNeRF, a novel NeRF-based parametric head model that
integrates the neural radiance field to the parametric representation of the human head. It …

被引用次数：211 相关文章所有 6 个版本

高级搜索

QQ 群