Last layer re-training is sufficient for robustness to spurious correlations

P Kirichenko, P Izmailov, AG Wilson - arXiv preprint arXiv:2204.02937, 2022 - arxiv.org
Neural network classifiers can largely rely on simple spurious features, such as
backgrounds, to make predictions. However, even in these cases, we show that they still …

Contrastive decoding: Open-ended text generation as optimization

XL Li, A Holtzman, D Fried, P Liang, J Eisner… - arXiv preprint arXiv …, 2022 - arxiv.org
Given a language model (LM), maximum probability is a poor decoding objective for open-
ended generation, because it produces short and repetitive text. On the other hand …

Environment inference for invariant learning

E Creager, JH Jacobsen… - … Conference on Machine …, 2021 - proceedings.mlr.press
Learning models that gracefully handle distribution shifts is central to research on domain
generalization, robust optimization, and fairness. A promising formulation is domain …

Simple data balancing achieves competitive worst-group-accuracy

BY Idrissi, M Arjovsky, M Pezeshki… - … on Causal Learning …, 2022 - proceedings.mlr.press
We study the problem of learning classifiers that perform well across (known or unknown)
groups of data. After observing that common worst-group-accuracy datasets suffer from …

Deep problems with neural network models of human vision

JS Bowers, G Malhotra, M Dujmović… - Behavioral and Brain …, 2023 - cambridge.org
Deep neural networks (DNNs) have had extraordinary successes in classifying
photographic images of objects and are often described as the best models of biological …

Competency problems: On finding and removing artifacts in language data

M Gardner, W Merrill, J Dodge, ME Peters… - arXiv preprint arXiv …, 2021 - arxiv.org
Much recent work in NLP has documented dataset artifacts, bias, and spurious correlations
between input features and output labels. However, how to tell which features have" …

Towards trustworthy and aligned machine learning: A data-centric survey with causality perspectives

H Liu, M Chaudhary, H Wang - arXiv preprint arXiv:2307.16851, 2023 - arxiv.org
The trustworthiness of machine learning has emerged as a critical topic in the field,
encompassing various applications and research areas such as robustness, security …

Itsa: An information-theoretic approach to automatic shortcut avoidance and domain generalization in stereo matching networks

WQ Chuah, R Tennakoon… - Proceedings of the …, 2022 - openaccess.thecvf.com
State-of-the-art stereo matching networks trained only on synthetic data often fail to
generalize to more challenging real data domains. In this paper, we attempt to unfold an …

Selecmix: Debiased learning by contradicting-pair sampling

I Hwang, S Lee, Y Kwak, SJ Oh… - Advances in …, 2022 - proceedings.neurips.cc
Neural networks trained with ERM (empirical risk minimization) sometimes learn unintended
decision rules, in particular when their training data is biased, ie, when training labels are …

Focus on the common good: Group distributional robustness follows

V Piratla, P Netrapalli, S Sarawagi - arXiv preprint arXiv:2110.02619, 2021 - arxiv.org
We consider the problem of training a classification model with group annotated training
data. Recent work has established that, if there is distribution shift across different groups …