Abstract We present Contextualized Local Visual Embeddings (CLoVE), a self-supervised convolutional-based method that learns representations suited for dense prediction tasks …
This paper introduces a novel approach to improving the training stability of self-supervised learning (SSL) methods by leveraging a non-parametric memory of seen concepts. The …
E Mannix, H Bondell - arXiv preprint arXiv:2403.04125, 2024 - arxiv.org
Interpretable computer vision models can produce transparent predictions, where the features of an image are compared with prototypes from a training dataset and the similarity …
T Silva, AR Rivera - arXiv preprint arXiv:2310.12692, 2023 - arxiv.org
We present Consistent Assignment of Views over Random Partitions (CARP), a self- supervised clustering method for representation learning of visual features. CARP learns …