Visual recognition with deep nearest centroids

C Liang, W Wang, J Miao… - Advances in Neural …, 2022 - proceedings.neurips.cc

Prevalent semantic segmentation solutions are, in essence, a dense discriminative classifier
of p (class| pixel feature). Though straightforward, this de facto paradigm neglects the …

被引用次数：71 相关文章所有 9 个版本

[PDF] neurips.cc

Learning equivariant segmentation with instance-unique querying

W Wang, J Liang, D Liu - Advances in Neural Information …, 2022 - proceedings.neurips.cc

Prevalent state-of-the-art instance segmentation methods fall into a query-based scheme, in
which instance masks are derived by querying the image feature using a set of instance …

被引用次数：69 相关文章所有 5 个版本

[PDF] thecvf.com

Transflow: Transformer as flow learner

Y Lu, Q Wang, S Ma, T Geng… - Proceedings of the …, 2023 - openaccess.thecvf.com

Optical flow is an indispensable building block for various important computer vision tasks,
including motion estimation, object tracking, and disparity measurement. In this work, we …

被引用次数：46 相关文章所有 6 个版本

[PDF] thecvf.com

Not all features matter: Enhancing few-shot clip with adaptive prior refinement

X Zhu, R Zhang, B He, A Zhou… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract The popularity of Contrastive Language-Image Pre-training (CLIP) has propelled its
application to diverse downstream vision tasks. To improve its capacity on downstream …

被引用次数：34 相关文章所有 5 个版本

[PDF] arxiv.org

Clustseg: Clustering for universal segmentation

J Liang, T Zhou, D Liu, W Wang - arXiv preprint arXiv:2305.02187, 2023 - arxiv.org

We present CLUSTSEG, a general, transformer-based framework that tackles different
image segmentation tasks (ie, superpixel, semantic, instance, and panoptic) through a …

被引用次数：63 相关文章所有 5 个版本

[PDF] neurips.cc

Clusterfomer: clustering as a universal visual learner

J Liang, Y Cui, Q Wang, T Geng… - Advances in neural …, 2024 - proceedings.neurips.cc

This paper presents ClusterFormer, a universal vision model that is based on the Clustering
paradigm with TransFormer. It comprises two novel designs: 1) recurrent cross-attention …

被引用次数：19 相关文章所有 5 个版本

Differential feature awareness network within antagonistic learning for infrared-visible object detection

R Zhang, L Li, Q Zhang, J Zhang, L Xu… - … on Circuits and …, 2023 - ieeexplore.ieee.org

The combination of infrared and visible videos aims to gather more comprehensive feature
information from multiple sources and reach superior results on various practical tasks, such …

被引用次数：41 相关文章

[PDF] neurips.cc

Unified 3d segmenter as prototypical classifiers

Z Qin, C Han, Q Wang, X Nie, Y Yin… - Advances in Neural …, 2023 - proceedings.neurips.cc

The task of point cloud segmentation, comprising semantic, instance, and panoptic
segmentation, has been mainly tackled by designing task-specific network architectures …

被引用次数：11 相关文章所有 3 个版本

[PDF] aaai.org

Federated graph learning under domain shift with generalizable prototypes

G Wan, W Huang, M Ye - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org

Federated Graph Learning is a privacy-preserving collaborative approach for training a
shared model on graph-structured data in the distributed environment. However, in real …

被引用次数：8 相关文章

[PDF] arxiv.org

E^ 2VPT: An Effective and Efficient Approach for Visual Prompt Tuning

C Han, Q Wang, Y Cui, Z Cao, W Wang, S Qi… - arXiv preprint arXiv …, 2023 - arxiv.org

As the size of transformer-based models continues to grow, fine-tuning these large-scale
pretrained vision models for new tasks has become increasingly parameter-intensive …

被引用次数：23 相关文章所有 5 个版本

高级搜索

QQ 群