[HTML][HTML] Self-supervised learning for medical image classification: a systematic review and implementation guidelines

SC Huang, A Pareek, M Jensen, MP Lungren… - NPJ Digital …, 2023 - nature.com
Advancements in deep learning and computer vision provide promising solutions for
medical image analysis, potentially improving healthcare and patient outcomes. However …

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arXiv preprint arXiv …, 2021 - arxiv.org
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

Fine-grained image analysis with deep learning: A survey

XS Wei, YZ Song, O Mac Aodha, J Wu… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer
vision and pattern recognition, and underpins a diverse set of real-world applications. The …

Self-supervised learning is more robust to dataset imbalance

H Liu, JZ HaoChen, A Gaidon, T Ma - arXiv preprint arXiv:2110.05025, 2021 - arxiv.org
Self-supervised learning (SSL) is a scalable way to learn general visual representations
since it learns without labels. However, large-scale unlabeled datasets in the wild often have …

Extending the wilds benchmark for unsupervised adaptation

S Sagawa, PW Koh, T Lee, I Gao, SM Xie… - arXiv preprint arXiv …, 2021 - arxiv.org
Machine learning systems deployed in the wild are often trained on a source distribution but
deployed on a different target distribution. Unlabeled data can be a powerful point of …

SSL4EO-S12: A large-scale multimodal, multitemporal dataset for self-supervised learning in Earth observation [Software and Data Sets]

Y Wang, NAA Braham, Z Xiong, C Liu… - … and Remote Sensing …, 2023 - ieeexplore.ieee.org
Self-supervised pretraining bears the potential to generate expressive representations from
large-scale Earth observation (EO) data without human annotation. However, most existing …

Bioclip: A vision foundation model for the tree of life

S Stevens, J Wu, MJ Thompson… - Proceedings of the …, 2024 - openaccess.thecvf.com
Images of the natural world collected by a variety of cameras from drones to individual
phones are increasingly abundant sources of biological information. There is an explosion …

Teaching matters: Investigating the role of supervision in vision transformers

M Walmer, S Suri, K Gupta… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Vision Transformers (ViTs) have gained significant popularity in recent years and
have proliferated into many applications. However, their behavior under different learning …

From global to local: Multi-scale out-of-distribution detection

J Zhang, L Gao, B Hao, H Huang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Out-of-distribution (OOD) detection aims to detect “unknown” data whose labels have not
been seen during the in-distribution (ID) training process. Recent progress in representation …

Large-scale unsupervised semantic segmentation

S Gao, ZY Li, MH Yang, MM Cheng… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Empowered by large datasets, eg, ImageNet and MS COCO, unsupervised learning on
large-scale data has enabled significant advances for classification tasks. However, whether …