A too-good-to-be-true prior to reduce shortcut reliance

P Kirichenko, P Izmailov, AG Wilson - arXiv preprint arXiv:2204.02937, 2022 - arxiv.org

Neural network classifiers can largely rely on simple spurious features, such as
backgrounds, to make predictions. However, even in these cases, we show that they still …

被引用次数：215 相关文章所有 6 个版本

[PDF] arxiv.org

Contrastive decoding: Open-ended text generation as optimization

XL Li, A Holtzman, D Fried, P Liang, J Eisner… - arXiv preprint arXiv …, 2022 - arxiv.org

Given a language model (LM), maximum probability is a poor decoding objective for open-
ended generation, because it produces short and repetitive text. On the other hand …

被引用次数：133 相关文章所有 7 个版本

[PDF] mlr.press

Environment inference for invariant learning

E Creager, JH Jacobsen… - … Conference on Machine …, 2021 - proceedings.mlr.press

Learning models that gracefully handle distribution shifts is central to research on domain
generalization, robust optimization, and fairness. A promising formulation is domain …

被引用次数：318 相关文章所有 7 个版本

[PDF] mlr.press

Simple data balancing achieves competitive worst-group-accuracy

BY Idrissi, M Arjovsky, M Pezeshki… - … on Causal Learning …, 2022 - proceedings.mlr.press

We study the problem of learning classifiers that perform well across (known or unknown)
groups of data. After observing that common worst-group-accuracy datasets suffer from …

被引用次数：125 相关文章所有 4 个版本

[PDF] aston.ac.uk

Deep problems with neural network models of human vision

JS Bowers, G Malhotra, M Dujmović… - Behavioral and Brain …, 2023 - cambridge.org

Deep neural networks (DNNs) have had extraordinary successes in classifying
photographic images of objects and are often described as the best models of biological …

被引用次数：110 相关文章所有 17 个版本

[PDF] arxiv.org

Competency problems: On finding and removing artifacts in language data

M Gardner, W Merrill, J Dodge, ME Peters… - arXiv preprint arXiv …, 2021 - arxiv.org

Much recent work in NLP has documented dataset artifacts, bias, and spurious correlations
between input features and output labels. However, how to tell which features have" …

被引用次数：86 相关文章所有 7 个版本

[PDF] arxiv.org

Towards trustworthy and aligned machine learning: A data-centric survey with causality perspectives

H Liu, M Chaudhary, H Wang - arXiv preprint arXiv:2307.16851, 2023 - arxiv.org

The trustworthiness of machine learning has emerged as a critical topic in the field,
encompassing various applications and research areas such as robustness, security …

被引用次数：19 相关文章所有 3 个版本

[PDF] thecvf.com

Itsa: An information-theoretic approach to automatic shortcut avoidance and domain generalization in stereo matching networks

WQ Chuah, R Tennakoon… - Proceedings of the …, 2022 - openaccess.thecvf.com

State-of-the-art stereo matching networks trained only on synthetic data often fail to
generalize to more challenging real data domains. In this paper, we attempt to unfold an …

被引用次数：30 相关文章所有 11 个版本

[PDF] neurips.cc

Selecmix: Debiased learning by contradicting-pair sampling

I Hwang, S Lee, Y Kwak, SJ Oh… - Advances in …, 2022 - proceedings.neurips.cc

Neural networks trained with ERM (empirical risk minimization) sometimes learn unintended
decision rules, in particular when their training data is biased, ie, when training labels are …

被引用次数：17 相关文章所有 5 个版本

[PDF] arxiv.org

Focus on the common good: Group distributional robustness follows

V Piratla, P Netrapalli, S Sarawagi - arXiv preprint arXiv:2110.02619, 2021 - arxiv.org

We consider the problem of training a classification model with group annotated training
data. Recent work has established that, if there is distribution shift across different groups …

被引用次数：28 相关文章所有 5 个版本

高级搜索

QQ 群