Provable guarantees for nonlinear feature learning in three-layer neural networks

B Moniri, D Lee, H Hassani, E Dobriban - arXiv preprint arXiv:2310.07891, 2023 - arxiv.org

Feature learning is thought to be one of the fundamental reasons for the success of deep
neural networks. It is rigorously known that in two-layer fully-connected neural networks …

被引用次数：23 相关文章所有 5 个版本

[PDF] arxiv.org

Provable multi-task representation learning by two-layer relu neural networks

L Collins, H Hassani, M Soltanolkotabi… - arXiv preprint arXiv …, 2023 - arxiv.org

Feature learning, ie extracting meaningful representations of data, is quintessential to the
practical success of neural networks trained with gradient descent, yet it is notoriously …

被引用次数：10 相关文章所有 3 个版本

[PDF] arxiv.org

Learning hierarchical polynomials with three-layer neural networks

Z Wang, E Nichani, JD Lee - arXiv preprint arXiv:2311.13774, 2023 - arxiv.org

We study the problem of learning hierarchical polynomials over the standard Gaussian
distribution with three-layer neural networks. We specifically consider target functions of the …

被引用次数：10 相关文章所有 4 个版本

[PDF] arxiv.org

Neural network learns low-dimensional polynomials with SGD near the information-theoretic limit

JD Lee, K Oko, T Suzuki, D Wu - arXiv preprint arXiv:2406.01581, 2024 - arxiv.org

We study the problem of gradient descent learning of a single-index target function $
f_*(\boldsymbol {x})=\textstyle\sigma_*\left (\langle\boldsymbol {x},\boldsymbol …

被引用次数：11 相关文章所有 3 个版本

[PDF] arxiv.org

Gradient descent induces alignment between weights and the empirical NTK for deep non-linear networks

D Beaglehole, I Mitliagkas, A Agarwala - arXiv preprint arXiv:2402.05271, 2024 - arxiv.org

Understanding the mechanisms through which neural networks extract statistics from input-
label pairs is one of the most important unsolved problems in supervised learning. Prior …

被引用次数：3 相关文章所有 2 个版本

A novel domain adaptation method with physical constraints for shale gas production forecasting

L Gou, Z Yang, C Min, D Yi, X Li, B Kong - Applied Energy, 2024 - Elsevier

Effective forecasting of shale gas production is essential for optimizing exploration strategies
and guiding subsequent fracturing. However, in the new development of shale gas blocks …

被引用次数：5 相关文章

[PDF] openreview.net

Feature learning as alignment: a structural property of gradient descent in non-linear neural networks

D Beaglehole, I Mitliagkas… - Transactions on Machine …, 2024 - openreview.net

Understanding the mechanisms through which neural networks extract statistics from input-
label pairs through feature learning is one of the most important unsolved problems in …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

被引用次数：1 相关文章所有 2 个版本

高级搜索

QQ 群