I Bica, A Ilić, M Bauer, G Erdogan, M Bošnjak… - arXiv preprint arXiv …, 2024 - arxiv.org
We introduce SPARse Fine-grained Contrastive Alignment (SPARC), a simple method for
pretraining more fine-grained multimodal representations from image-text pairs. Given that …