A survey of convolutional neural networks: analysis, applications, and prospects

Z Li, F Liu, W Yang, S Peng… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
A convolutional neural network (CNN) is one of the most significant networks in the deep
learning field. Since CNN made impressive achievements in many areas, including but not …

Image segmentation using deep learning: A survey

S Minaee, Y Boykov, F Porikli, A Plaza… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Image segmentation is a key task in computer vision and image processing with important
applications such as scene understanding, medical image analysis, robotic perception …

Convolutions die hard: Open-vocabulary segmentation with single frozen convolutional clip

Q Yu, J He, X Deng, X Shen… - Advances in Neural …, 2023 - proceedings.neurips.cc
Open-vocabulary segmentation is a challenging task requiring segmenting and recognizing
objects from an open set of categories in diverse environments. One way to address this …

A generalist framework for panoptic segmentation of images and videos

T Chen, L Li, S Saxena, G Hinton… - Proceedings of the …, 2023 - openaccess.thecvf.com
Panoptic segmentation assigns semantic and instance ID labels to every pixel of an image.
As permutations of instance IDs are also valid solutions, the task requires learning of high …

Max-deeplab: End-to-end panoptic segmentation with mask transformers

H Wang, Y Zhu, H Adam, A Yuille… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract We present MaX-DeepLab, the first end-to-end model for panoptic segmentation.
Our approach simplifies the current pipeline that depends heavily on surrogate sub-tasks …

Axial-deeplab: Stand-alone axial-attention for panoptic segmentation

H Wang, Y Zhu, B Green, H Adam, A Yuille… - European conference on …, 2020 - Springer
Convolution exploits locality for efficiency at a cost of missing long range context. Self-
attention has been adopted to augment CNNs with non-local interactions. Recent works …

Object-contextual representations for semantic segmentation

Y Yuan, X Chen, J Wang - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
In this paper, we study the context aggregation problem in semantic segmentation.
Motivated by that the label of a pixel is the category of the object that the pixel belongs to, we …

Panoptic-deeplab: A simple, strong, and fast baseline for bottom-up panoptic segmentation

B Cheng, MD Collins, Y Zhu, T Liu… - Proceedings of the …, 2020 - openaccess.thecvf.com
In this work, we introduce Panoptic-DeepLab, a simple, strong, and fast system for panoptic
segmentation, aiming to establish a solid baseline for bottom-up methods that can achieve …

Cmt-deeplab: Clustering mask transformers for panoptic segmentation

Q Yu, H Wang, D Kim, S Qiao… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract We propose Clustering Mask Transformer (CMT-DeepLab), a transformer-based
framework for panoptic segmentation designed around clustering. It rethinks the existing …

Clustseg: Clustering for universal segmentation

J Liang, T Zhou, D Liu, W Wang - arXiv preprint arXiv:2305.02187, 2023 - arxiv.org
We present CLUSTSEG, a general, transformer-based framework that tackles different
image segmentation tasks (ie, superpixel, semantic, instance, and panoptic) through a …