相关文章- 学术资源搜索

Deep limits of residual neural networks

M Thorpe, Y van Gennip - arXiv preprint arXiv:1810.11741, 2018 - arxiv.org

Neural networks have been very successful in many applications; we often, however, lack a
theoretical understanding of what the neural networks are actually learning. This problem …

被引用次数：59 相关文章所有 3 个版本

[PDF] arxiv.org

Implicit regularization of deep residual networks towards neural ODEs

P Marion, YH Wu, ME Sander, G Biau - arXiv preprint arXiv:2309.01213, 2023 - arxiv.org

Residual neural networks are state-of-the-art deep learning models. Their continuous-depth
analog, neural ordinary differential equations (ODEs), are also widely used. Despite their …

被引用次数：6 相关文章所有 10 个版本

[PDF] jmlr.org

Overparameterization of deep ResNet: zero loss and mean-field analysis

Z Ding, S Chen, Q Li, SJ Wright - Journal of machine learning research, 2022 - jmlr.org

Finding parameters in a deep neural network (NN) that fit training data is a nonconvex
optimization problem, but a basic first-order optimization method (gradient descent) finds a …

被引用次数：29 相关文章所有 9 个版本

[PDF] neurips.cc

Generalization bounds for neural ordinary differential equations and deep residual networks

P Marion - Advances in Neural Information Processing …, 2024 - proceedings.neurips.cc

Neural ordinary differential equations (neural ODEs) are a popular family of continuous-
depth deep learning models. In this work, we consider a large family of parameterized ODEs …

被引用次数：10 相关文章所有 5 个版本

[PDF] arxiv.org

Local minima in training of neural networks

G Swirszcz, WM Czarnecki, R Pascanu - arXiv preprint arXiv:1611.06310, 2016 - arxiv.org

There has been a lot of recent interest in trying to characterize the error surface of deep
models. This stems from a long standing question. Given that deep networks are highly …

被引用次数：78 相关文章所有 2 个版本

[PDF] arxiv.org

On neural differential equations

P Kidger - arXiv preprint arXiv:2202.02435, 2022 - arxiv.org

The conjoining of dynamical systems and deep learning has become a topic of great
interest. In particular, neural differential equations (NDEs) demonstrate that neural networks …

被引用次数：215 相关文章所有 4 个版本

[PDF] mlr.press

Gradient descent finds global minima of deep neural networks

S Du, J Lee, H Li, L Wang… - … conference on machine …, 2019 - proceedings.mlr.press

Gradient descent finds a global minimum in training deep neural networks despite the
objective function being non-convex. The current paper proves gradient descent achieves …

被引用次数：1283 相关文章所有 10 个版本

[PDF] mlr.press

Scaling properties of deep residual networks

AS Cohen, R Cont, A Rossier… - … Conference on Machine …, 2021 - proceedings.mlr.press

Residual networks (ResNets) have displayed impressive results in pattern recognition and,
recently, have garnered considerable theoretical interest due to a perceived link with neural …

被引用次数：19 相关文章所有 7 个版本

[PDF] neurips.cc

Deep learning without poor local minima

K Kawaguchi - Advances in neural information processing …, 2016 - proceedings.neurips.cc

In this paper, we prove a conjecture published in 1989 and also partially address an open
problem announced at the Conference on Learning Theory (COLT) 2015. For an expected …

被引用次数：1056 相关文章所有 12 个版本

[PDF] arxiv.org

Collapse of deep and narrow neural nets

L Lu, Y Su, GE Karniadakis - arXiv preprint arXiv:1808.04947, 2018 - arxiv.org

Recent theoretical work has demonstrated that deep neural networks have superior
performance over shallow networks, but their training is more difficult, eg, they suffer from the …

被引用次数：18 相关文章所有 5 个版本

高级搜索

QQ 群

Deep limits of residual neural networks

Implicit regularization of deep residual networks towards neural ODEs

Overparameterization of deep ResNet: zero loss and mean-field analysis

Generalization bounds for neural ordinary differential equations and deep residual networks

Local minima in training of neural networks

On neural differential equations

Gradient descent finds global minima of deep neural networks

Scaling properties of deep residual networks

Deep learning without poor local minima

Collapse of deep and narrow neural nets

相关搜索

引用