Representation costs of linear neural networks: Analysis and design

G Vardi - Communications of the ACM, 2023 - dl.acm.org

On the Implicit Bias in Deep-Learning Algorithms Page 1 DEEP LEARNING HAS been highly
successful in recent years and has led to dramatic improvements in multiple domains …

被引用次数：74 相关文章所有 5 个版本

[PDF] mlr.press

Implicit regularization towards rank minimization in relu networks

N Timor, G Vardi, O Shamir - International Conference on …, 2023 - proceedings.mlr.press

We study the conjectured relationship between the implicit regularization in neural networks,
trained with gradient-based methods, and rank minimization of their weight matrices …

被引用次数：50 相关文章所有 4 个版本

[PDF] openreview.net

Implicit bias of large depth networks: a notion of rank for nonlinear functions

A Jacot - The Eleventh International Conference on Learning …, 2023 - openreview.net

We show that the representation cost of fully connected neural networks with homogeneous
nonlinearities-which describes the implicit bias in function space of networks with $ L_2 …

被引用次数：22 相关文章所有 3 个版本

[PDF] neurips.cc

Bottleneck structure in learned features: Low-dimension vs regularity tradeoff

A Jacot - Advances in Neural Information Processing …, 2023 - proceedings.neurips.cc

Previous work has shown that DNNs withlarge depth $ L $ and $ L_ {2} $-regularization are
biased towards learninglow-dimensional representations of the inputs, which can be …

被引用次数：9 相关文章所有 6 个版本

[PDF] researchgate.net

[PDF][PDF] Smoothing the edges: a general framework for smooth optimization in sparse regularization using Hadamard overparametrization

C Kolb, CL Müller, B Bischl… - arXiv preprint arXiv …, 2023 - researchgate.net

This paper presents a framework for smooth optimization of objectives with ℓq and ℓp, q
regularization for (structured) sparsity. Finding solutions to these non-smooth and possibly …

被引用次数：8 相关文章所有 2 个版本

[PDF] arxiv.org

Linear neural network layers promote learning single-and multiple-index models

S Parkinson, G Ongie, R Willett - arXiv preprint arXiv:2305.15598, 2023 - arxiv.org

This paper explores the implicit bias of overparameterized neural networks of depth greater
than two layers. Our framework considers a family of networks of varying depths that all have …

被引用次数：14 相关文章所有 2 个版本

[PDF] neurips.cc

Feature Learning in -regularized DNNs: Attraction/Repulsion and Sparsity

A Jacot, E Golikov, C Hongler… - Advances in Neural …, 2022 - proceedings.neurips.cc

We study the loss surface of DNNs with $ L_ {2} $ regularization. Weshow that the loss in
terms of the parameters can be reformulatedinto a loss in terms of the layerwise activations …

被引用次数：13 相关文章所有 7 个版本

[PDF] neurips.cc

Path regularization: A convexity and sparsity inducing regularization for parallel relu networks

T Ergen, M Pilanci - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Understanding the fundamental principles behind the success of deep neural networks is
one of the most important open questions in the current literature. To this end, we study the …

被引用次数：17 相关文章所有 10 个版本

[PDF] mlr.press

Inductive bias of multi-channel linear convolutional networks with bounded weight norm

M Jagadeesan, I Razenshteyn… - … on Learning Theory, 2022 - proceedings.mlr.press

We provide a function space characterization of the inductive bias resulting from minimizing
the $\ell_2 $ norm of the weights in multi-channel convolutional neural networks with linear …

被引用次数：26 相关文章所有 7 个版本

[PDF] arxiv.org

Linear Recursive Feature Machines provably recover low-rank matrices

A Radhakrishnan, M Belkin, D Drusvyatskiy - arXiv preprint arXiv …, 2024 - arxiv.org

A fundamental problem in machine learning is to understand how neural networks make
accurate predictions, while seemingly bypassing the curse of dimensionality. A possible …

被引用次数：6 相关文章所有 2 个版本

高级搜索

QQ 群