Shortcut learning in deep neural networks

R Geirhos, JH Jacobsen, C Michaelis… - Nature Machine …, 2020 - nature.com
Deep learning has triggered the current rise of artificial intelligence and is the workhorse of
today's machine intelligence. Numerous success stories have rapidly spread all over …

Convolutional neural networks as a model of the visual system: Past, present, and future

GW Lindsay - Journal of cognitive neuroscience, 2021 - direct.mit.edu
Convolutional neural networks (CNNs) were inspired by early findings in the study of
biological vision. They have since become successful tools in computer vision and state-of …

Beyond neural scaling laws: beating power law scaling via data pruning

B Sorscher, R Geirhos, S Shekhar… - Advances in …, 2022 - proceedings.neurips.cc
Widely observed neural scaling laws, in which error falls off as a power of the training set
size, model size, or both, have driven substantial performance improvements in deep …

Test-time training with masked autoencoders

Y Gandelsman, Y Sun, X Chen… - Advances in Neural …, 2022 - proceedings.neurips.cc
Test-time training adapts to a new test distribution on the fly by optimizing a model for each
test input using self-supervision. In this paper, we use masked autoencoders for this one …

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arXiv preprint arXiv …, 2021 - arxiv.org
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

Robust and data-efficient generalization of self-supervised machine learning for diagnostic imaging

S Azizi, L Culp, J Freyberg, B Mustafa, S Baur… - Nature Biomedical …, 2023 - nature.com
Abstract Machine-learning models for medical tasks can match or surpass the performance
of clinical experts. However, in settings differing from those of the training dataset, the …

Robust fine-tuning of zero-shot models

M Wortsman, G Ilharco, JW Kim, M Li… - Proceedings of the …, 2022 - openaccess.thecvf.com
Large pre-trained models such as CLIP or ALIGN offer consistent accuracy across a range of
data distributions when performing zero-shot inference (ie, without fine-tuning on a specific …

Benchmarking robustness of 3d object detection to common corruptions

Y Dong, C Kang, J Zhang, Z Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract 3D object detection is an important task in autonomous driving to perceive the
surroundings. Despite the excellent performance, the existing 3D detectors lack the …

Lips don't lie: A generalisable and robust approach to face forgery detection

A Haliassos, K Vougioukas… - Proceedings of the …, 2021 - openaccess.thecvf.com
Although current deep learning-based face forgery detectors achieve impressive
performance in constrained scenarios, they are vulnerable to samples created by unseen …

Tent: Fully test-time adaptation by entropy minimization

D Wang, E Shelhamer, S Liu, B Olshausen… - arXiv preprint arXiv …, 2020 - arxiv.org
A model must adapt itself to generalize to new and different data during testing. In this
setting of fully test-time adaptation the model has only the test data and its own parameters …