Domain-adjusted regression or: Erm may already learn features sufficient for out-of-distribution...

J Liang, R He, T Tan - International Journal of Computer Vision, 2024 - Springer

Abstract Machine learning methods strive to acquire a robust model during the training
process that can effectively generalize to test samples, even in the presence of distribution …

被引用次数：123 相关文章所有 2 个版本

[PDF] arxiv.org

Last layer re-training is sufficient for robustness to spurious correlations

P Kirichenko, P Izmailov, AG Wilson - arXiv preprint arXiv:2204.02937, 2022 - arxiv.org

Neural network classifiers can largely rely on simple spurious features, such as
backgrounds, to make predictions. However, even in these cases, we show that they still …

被引用次数：239 相关文章所有 6 个版本

[PDF] arxiv.org

Surgical fine-tuning improves adaptation to distribution shifts

Y Lee, AS Chen, F Tajwar, A Kumar, H Yao… - arXiv preprint arXiv …, 2022 - arxiv.org

A common approach to transfer learning under distribution shift is to fine-tune the last few
layers of a pre-trained model, preserving learned features while also adapting to the new …

被引用次数：165 相关文章所有 5 个版本

[PDF] neurips.cc

On feature learning in the presence of spurious correlations

P Izmailov, P Kirichenko, N Gruver… - Advances in Neural …, 2022 - proceedings.neurips.cc

Deep classifiers are known to rely on spurious features—patterns which are correlated with
the target on the training data but not inherently relevant to the learning problem, such as the …

被引用次数：90 相关文章所有 7 个版本

[PDF] mdpi.com

Out-of-distribution (OOD) detection based on deep learning: A review

P Cui, J Wang - Electronics, 2022 - mdpi.com

Out-of-Distribution (OOD) detection separates ID (In-Distribution) data and OOD data from
input data through a model. This problem has attracted increasing attention in the area of …

被引用次数：25 相关文章所有 4 个版本

[PDF] neurips.cc

Learning linear causal representations from interventions under general nonlinear mixing

S Buchholz, G Rajendran… - Advances in …, 2024 - proceedings.neurips.cc

We study the problem of learning causal representations from unknown, latent interventions
in a general setting, where the latent distribution is Gaussian but the mixing function is …

被引用次数：37 相关文章所有 11 个版本

[PDF] acm.org Full View

Shortcut learning of large language models in natural language understanding

M Du, F He, N Zou, D Tao, X Hu - Communications of the ACM, 2023 - dl.acm.org

Shortcut Learning of Large Language Models in Natural Language Understanding Page 1 110
COMMUNICATIONS OF THE ACM | JANUARY 2024 | VOL. 67 | NO. 1 research IMA GE B Y …

被引用次数：79 相关文章所有 8 个版本

[PDF] arxiv.org

Change is hard: A closer look at subpopulation shift

Y Yang, H Zhang, D Katabi, M Ghassemi - arXiv preprint arXiv:2302.12254, 2023 - arxiv.org

Machine learning models often perform poorly on subgroups that are underrepresented in
the training data. Yet, little is understood on the variation in mechanisms that cause …

被引用次数：66 相关文章所有 7 个版本

[PDF] mlr.press

Mechanistic mode connectivity

ES Lubana, EJ Bigelow, RP Dick… - International …, 2023 - proceedings.mlr.press

We study neural network loss landscapes through the lens of mode connectivity, the
observation that minimizers of neural networks retrieved via training on a dataset are …

被引用次数：39 相关文章所有 9 个版本

[PDF] arxiv.org

Pareto invariant risk minimization: Towards mitigating the optimization dilemma in out-of-distribution generalization

Y Chen, K Zhou, Y Bian, B Xie, B Wu, Y Zhang… - arXiv preprint arXiv …, 2022 - arxiv.org

Recently, there has been a growing surge of interest in enabling machine learning systems
to generalize well to Out-of-Distribution (OOD) data. Most efforts are devoted to advancing …

被引用次数：51 相关文章所有 5 个版本

高级搜索

QQ 群