相关文章- 学术资源搜索

The modern mathematics of deep learning

J Berner, P Grohs, G Kutyniok… - arXiv preprint arXiv …, 2021 - cambridge.org

We describe the new field of the mathematical analysis of deep learning. This field emerged
around a list of research questions that were not answered within the classical framework of …

被引用次数：163 相关文章所有 8 个版本

[PDF] academia.edu

The unbearable shallow understanding of deep learning

A Plebe, G Grasso - Minds and Machines, 2019 - Springer

This paper analyzes the rapid and unexpected rise of deep learning within Artificial
Intelligence and its applications. It tackles the possible reasons for this remarkable success …

被引用次数：45 相关文章所有 6 个版本

[PDF] neurips.cc

Deep learning without poor local minima

K Kawaguchi - Advances in neural information processing …, 2016 - proceedings.neurips.cc

In this paper, we prove a conjecture published in 1989 and also partially address an open
problem announced at the Conference on Learning Theory (COLT) 2015. For an expected …

被引用次数：1056 相关文章所有 12 个版本

[PDF] arxiv.org

Where is the information in a deep neural network?

A Achille, G Paolini, S Soatto - arXiv preprint arXiv:1905.12213, 2019 - arxiv.org

Whatever information a deep neural network has gleaned from training data is encoded in
its weights. How this information affects the response of the network to future data remains …

被引用次数：70 相关文章所有 4 个版本

[PDF] mlr.press

Sharp minima can generalize for deep nets

L Dinh, R Pascanu, S Bengio… - … Conference on Machine …, 2017 - proceedings.mlr.press

Despite their overwhelming capacity to overfit, deep learning architectures tend to
generalize relatively well to unseen data, allowing them to be deployed in practice …

被引用次数：756 相关文章所有 12 个版本

[PDF] pnas.org Full View

Theoretical issues in deep networks

T Poggio, A Banburski, Q Liao - Proceedings of the …, 2020 - National Acad Sciences

While deep learning is successful in a number of applications, it is not yet well understood
theoretically. A theoretical characterization of deep learning should answer questions about …

被引用次数：197 相关文章所有 11 个版本

[PDF] arxiv.org

Geometry of optimization and implicit regularization in deep learning

B Neyshabur, R Tomioka, R Salakhutdinov… - arXiv preprint arXiv …, 2017 - arxiv.org

We argue that the optimization plays a crucial role in generalization of deep learning models
through implicit regularization. We do this by demonstrating that generalization ability is not …

被引用次数：156 相关文章所有 3 个版本

[HTML] sciencedirect.com

[HTML][HTML] Depth with nonlinearity creates no bad local minima in ResNets

K Kawaguchi, Y Bengio - Neural Networks, 2019 - Elsevier

In this paper, we prove that depth with nonlinearity creates no bad local minima in a type of
arbitrarily deep ResNets with arbitrary nonlinear activation functions, in the sense that the …

被引用次数：62 相关文章所有 10 个版本

[PDF] arxiv.org

Full error analysis for the training of deep neural networks

C Beck, A Jentzen, B Kuckuck - Infinite Dimensional Analysis …, 2022 - World Scientific

Deep learning algorithms have been applied very successfully in recent years to a range of
problems out of reach for classical solution paradigms. Nevertheless, there is no completely …

被引用次数：57 相关文章所有 9 个版本

[PDF] arxiv.org

[图书][B] The principles of deep learning theory

DA Roberts, S Yaida, B Hanin - 2022 - cambridge.org

This textbook establishes a theoretical framework for understanding deep learning models
of practical relevance. With an approach that borrows from theoretical physics, Roberts and …

被引用次数：299 相关文章所有 11 个版本

高级搜索

QQ 群

The modern mathematics of deep learning

The unbearable shallow understanding of deep learning

Deep learning without poor local minima

Where is the information in a deep neural network?

Sharp minima can generalize for deep nets

Theoretical issues in deep networks

Geometry of optimization and implicit regularization in deep learning

[HTML][HTML] Depth with nonlinearity creates no bad local minima in ResNets

Full error analysis for the training of deep neural networks

[图书][B] The principles of deep learning theory

相关搜索

引用