Z Wang, Z Wu, D Agarwal, J Sun - arXiv preprint arXiv:2210.10163, 2022 - arxiv.org
Existing vision-text contrastive learning like CLIP aims to match the paired image and caption embeddings while pushing others apart, which improves representation …
Learning with few labeled data has been a longstanding problem in the computer vision and machine learning research community. In this paper, we introduced a new semi-supervised …
Benefiting from the large-scale archiving of digitized whole-slide images (WSIs), computer- aided diagnosis has been well developed to assist pathologists in decision-making. Content …
N Pu, Z Zhong, N Sebe - … of the IEEE/CVF conference on …, 2023 - openaccess.thecvf.com
Generalized category discovery (GCD) is a recently proposed open-world problem, which aims to automatically cluster partially labeled data. The main challenge is that the unlabeled …
M El Banani, K Desai… - Proceedings of the ieee …, 2023 - openaccess.thecvf.com
Although an object may appear in numerous contexts, we often describe it in a limited number of ways. Language allows us to abstract away visual variation to represent and …
H Basak, Z Yin - Proceedings of the IEEE/CVF conference …, 2023 - openaccess.thecvf.com
Although recent works in semi-supervised learning (SemiSL) have accomplished significant success in natural image segmentation, the task of learning discriminative representations …
S Zhang, S Khan, Z Shen, M Naseer… - Proceedings of the …, 2023 - openaccess.thecvf.com
Although existing semi-supervised learning models achieve remarkable success in learning with unannotated in-distribution data, they mostly fail to learn on unlabeled data sampled …
We present an efficient approach for Masked Image Modeling (MIM) with hierarchical Vision Transformers (ViTs), allowing the hierarchical ViTs to discard masked patches and operate …
Self-supervised Learning (SSL) including the mainstream contrastive learning has achieved great success in learning visual representations without data annotations. However, most of …