When does contrastive visual representation learning work?

[HTML][HTML] Self-supervised learning for medical image classification: a systematic review and implementation guidelines

SC Huang, A Pareek, M Jensen, MP Lungren… - NPJ Digital …, 2023 - nature.com

Advancements in deep learning and computer vision provide promising solutions for
medical image analysis, potentially improving healthcare and patient outcomes. However …

被引用次数：123 相关文章所有 8 个版本

[PDF] arxiv.org

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arXiv preprint arXiv …, 2021 - arxiv.org

AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

被引用次数：3613 相关文章所有 2 个版本

[PDF] arxiv.org

Fine-grained image analysis with deep learning: A survey

XS Wei, YZ Song, O Mac Aodha, J Wu… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer
vision and pattern recognition, and underpins a diverse set of real-world applications. The …

被引用次数：259 相关文章所有 8 个版本

[PDF] arxiv.org

Self-supervised learning is more robust to dataset imbalance

H Liu, JZ HaoChen, A Gaidon, T Ma - arXiv preprint arXiv:2110.05025, 2021 - arxiv.org

Self-supervised learning (SSL) is a scalable way to learn general visual representations
since it learns without labels. However, large-scale unlabeled datasets in the wild often have …

被引用次数：158 相关文章所有 4 个版本

[PDF] arxiv.org

Extending the wilds benchmark for unsupervised adaptation

S Sagawa, PW Koh, T Lee, I Gao, SM Xie… - arXiv preprint arXiv …, 2021 - arxiv.org

Machine learning systems deployed in the wild are often trained on a source distribution but
deployed on a different target distribution. Unlabeled data can be a powerful point of …

被引用次数：116 相关文章所有 10 个版本

[PDF] arxiv.org

SSL4EO-S12: A large-scale multimodal, multitemporal dataset for self-supervised learning in Earth observation [Software and Data Sets]

Y Wang, NAA Braham, Z Xiong, C Liu… - … and Remote Sensing …, 2023 - ieeexplore.ieee.org

Self-supervised pretraining bears the potential to generate expressive representations from
large-scale Earth observation (EO) data without human annotation. However, most existing …

被引用次数：69 相关文章所有 4 个版本

[PDF] thecvf.com

Bioclip: A vision foundation model for the tree of life

S Stevens, J Wu, MJ Thompson… - Proceedings of the …, 2024 - openaccess.thecvf.com

Images of the natural world collected by a variety of cameras from drones to individual
phones are increasingly abundant sources of biological information. There is an explosion …

被引用次数：19 相关文章所有 4 个版本

[PDF] thecvf.com

Teaching matters: Investigating the role of supervision in vision transformers

M Walmer, S Suri, K Gupta… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Vision Transformers (ViTs) have gained significant popularity in recent years and
have proliferated into many applications. However, their behavior under different learning …

被引用次数：24 相关文章所有 7 个版本

[PDF] arxiv.org

From global to local: Multi-scale out-of-distribution detection

J Zhang, L Gao, B Hao, H Huang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Out-of-distribution (OOD) detection aims to detect “unknown” data whose labels have not
been seen during the in-distribution (ID) training process. Recent progress in representation …

被引用次数：18 相关文章所有 7 个版本

[PDF] arxiv.org

Large-scale unsupervised semantic segmentation

S Gao, ZY Li, MH Yang, MM Cheng… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

Empowered by large datasets, eg, ImageNet and MS COCO, unsupervised learning on
large-scale data has enabled significant advances for classification tasks. However, whether …

被引用次数：76 相关文章所有 12 个版本

高级搜索

QQ 群