Exact gap between generalization error and uniform convergence in random feature models

N Tripuraneni, B Adlam… - Advances in Neural …, 2021 - proceedings.neurips.cc

A significant obstacle in the development of robust machine learning models is\emph
{covariate shift}, a form of distribution shift that occurs when the input distributions of the …

被引用次数：58 相关文章所有 5 个版本

[PDF] neurips.cc

Uniform convergence of interpolators: Gaussian width, norm bounds and benign overfitting

F Koehler, L Zhou, DJ Sutherland… - Advances in Neural …, 2021 - proceedings.neurips.cc

We consider interpolation learning in high-dimensional linear regression with Gaussian
data, and prove a generic uniform convergence guarantee on the generalization error of …

被引用次数：74 相关文章所有 8 个版本

[PDF] mlr.press

The implicit bias of benign overfitting

O Shamir - Conference on Learning Theory, 2022 - proceedings.mlr.press

The phenomenon of benign overfitting, where a predictor perfectly fits noisy training data
while attaining low expected loss, has received much attention in recent years, but still …

被引用次数：44 相关文章所有 4 个版本

[PDF] neurips.cc

On linear stability of sgd and input-smoothness of neural networks

C Ma, L Ying - Advances in Neural Information Processing …, 2021 - proceedings.neurips.cc

The multiplicative structure of parameters and input data in the first layer of neural networks
is explored to build connection between the landscape of the loss function with respect to …

被引用次数：52 相关文章所有 7 个版本

[PDF] neurips.cc

From tempered to benign overfitting in relu neural networks

G Kornowski, G Yehudai… - Advances in Neural …, 2024 - proceedings.neurips.cc

Overparameterized neural networks (NNs) are observed to generalize well even when
trained to perfectly fit noisy data. This phenomenon motivated a large body of work on" …

被引用次数：22 相关文章所有 6 个版本

[PDF] neurips.cc

ResMem: Learn what you can and memorize the rest

Z Yang, M Lukasik, V Nagarajan, Z Li… - Advances in …, 2024 - proceedings.neurips.cc

The impressive generalization performance of modern neural networks is attributed in part to
their ability to implicitly memorize complex training patterns. Inspired by this, we explore a …

被引用次数：11 相关文章所有 5 个版本

[PDF] arxiv.org

Covariate shift in high-dimensional random feature regression

N Tripuraneni, B Adlam, J Pennington - arXiv preprint arXiv:2111.08234, 2021 - arxiv.org

A significant obstacle in the development of robust machine learning models is covariate
shift, a form of distribution shift that occurs when the input distributions of the training and test …

被引用次数：32 相关文章所有 2 个版本

[PDF] arxiv.org

How do noise tails impact on deep ReLU networks?

J Fan, Y Gu, WX Zhou - The Annals of Statistics, 2024 - projecteuclid.org

How do noise tails impact on deep ReLU networks? Page 1 The Annals of Statistics 2024,
Vol. 52, No. 4, 1845–1871 https://doi.org/10.1214/24-AOS2428 © Institute of Mathematical …

被引用次数：18 相关文章所有 4 个版本

[PDF] arxiv.org

Deformed semicircle law and concentration of nonlinear random matrices for ultra-wide neural networks

Z Wang, Y Zhu - The Annals of Applied Probability, 2024 - projecteuclid.org

In this paper, we investigate a two-layer fully connected neural network of the form f (X)= 1 d
1 a⊤ σ (WX), where X∈ d 0× n is a deterministic data matrix, W∈ R d 1× d 0 and a∈ R d 1 …

被引用次数：21 相关文章所有 6 个版本

[PDF] arxiv.org

Counterclr: Counterfactual contrastive learning with non-random missing data in recommendation

J Wang, H Li, C Zhang, D Liang, E Yu… - … Conference on Data …, 2023 - ieeexplore.ieee.org

Recommender systems are designed to learn user preferences from observed feedback and
comprise many fundamental tasks, such as rating prediction and post-click conversion rate …

被引用次数：8 相关文章所有 4 个版本

高级搜索

QQ 群