Gmmseg: Gaussian mixture based generative semantic segmentation models

C Liang, W Wang, J Miao… - Advances in Neural …, 2022 - proceedings.neurips.cc
Prevalent semantic segmentation solutions are, in essence, a dense discriminative classifier
of p (class| pixel feature). Though straightforward, this de facto paradigm neglects the …

Learning equivariant segmentation with instance-unique querying

W Wang, J Liang, D Liu - Advances in Neural Information …, 2022 - proceedings.neurips.cc
Prevalent state-of-the-art instance segmentation methods fall into a query-based scheme, in
which instance masks are derived by querying the image feature using a set of instance …

Transflow: Transformer as flow learner

Y Lu, Q Wang, S Ma, T Geng… - Proceedings of the …, 2023 - openaccess.thecvf.com
Optical flow is an indispensable building block for various important computer vision tasks,
including motion estimation, object tracking, and disparity measurement. In this work, we …

Not all features matter: Enhancing few-shot clip with adaptive prior refinement

X Zhu, R Zhang, B He, A Zhou… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract The popularity of Contrastive Language-Image Pre-training (CLIP) has propelled its
application to diverse downstream vision tasks. To improve its capacity on downstream …

Clustseg: Clustering for universal segmentation

J Liang, T Zhou, D Liu, W Wang - arXiv preprint arXiv:2305.02187, 2023 - arxiv.org
We present CLUSTSEG, a general, transformer-based framework that tackles different
image segmentation tasks (ie, superpixel, semantic, instance, and panoptic) through a …

Clusterfomer: clustering as a universal visual learner

J Liang, Y Cui, Q Wang, T Geng… - Advances in neural …, 2024 - proceedings.neurips.cc
This paper presents ClusterFormer, a universal vision model that is based on the Clustering
paradigm with TransFormer. It comprises two novel designs: 1) recurrent cross-attention …

Differential feature awareness network within antagonistic learning for infrared-visible object detection

R Zhang, L Li, Q Zhang, J Zhang, L Xu… - … on Circuits and …, 2023 - ieeexplore.ieee.org
The combination of infrared and visible videos aims to gather more comprehensive feature
information from multiple sources and reach superior results on various practical tasks, such …

Unified 3d segmenter as prototypical classifiers

Z Qin, C Han, Q Wang, X Nie, Y Yin… - Advances in Neural …, 2023 - proceedings.neurips.cc
The task of point cloud segmentation, comprising semantic, instance, and panoptic
segmentation, has been mainly tackled by designing task-specific network architectures …

Federated graph learning under domain shift with generalizable prototypes

G Wan, W Huang, M Ye - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Federated Graph Learning is a privacy-preserving collaborative approach for training a
shared model on graph-structured data in the distributed environment. However, in real …

E^ 2VPT: An Effective and Efficient Approach for Visual Prompt Tuning

C Han, Q Wang, Y Cui, Z Cao, W Wang, S Qi… - arXiv preprint arXiv …, 2023 - arxiv.org
As the size of transformer-based models continues to grow, fine-tuning these large-scale
pretrained vision models for new tasks has become increasingly parameter-intensive …