Implicit regularization leads to benign overfitting for sparse linear regression

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

我的图书馆

Implicit regularization leads to benign overfitting for sparse linear regression

在引用文章中搜索

[PDF] arxiv.org

Unveil benign overfitting for transformer in vision: Training dynamics, convergence, and generalization

J Jiang, W Huang, M Zhang, T Suzuki, L Nie - arXiv preprint arXiv …, 2024 - arxiv.org

Transformers have demonstrated great power in the recent development of large
foundational models. In particular, the Vision Transformer (ViT) has brought revolutionary …

被引用次数：3 相关文章所有 3 个版本

[PDF] arxiv.org

Implicit Regularization of Gradient Flow on One-Layer Softmax Attention

H Sheen, S Chen, T Wang, HH Zhou - arXiv preprint arXiv:2403.08699, 2024 - arxiv.org

We study gradient flow on the exponential loss for a classification problem with a one-layer
softmax attention model, where the key and query weight matrices are trained separately …

被引用次数：7 相关文章所有 2 个版本

高级搜索

QQ 群

Implicit regularization leads to benign overfitting for sparse linear regression

Unveil benign overfitting for transformer in vision: Training dynamics, convergence, and generalization

Implicit Regularization of Gradient Flow on One-Layer Softmax Attention

引用