On-line learning in the committee machine

A Engel - 2001 - books.google.com

Learning is one of the things that humans do naturally, and it has always been a challenge
for us to understand the process. Nowadays this challenge has another dimension as we try …

被引用次数：706 相关文章所有 8 个版本

[PDF] neurips.cc

Phase diagram of stochastic gradient descent in high-dimensional two-layer neural networks

R Veiga, L Stephan, B Loureiro… - Advances in …, 2022 - proceedings.neurips.cc

Despite the non-convex optimization landscape, over-parametrized shallow networks are
able to achieve global convergence under gradient descent. The picture can be radically …

被引用次数：35 相关文章所有 12 个版本

[PDF] yaroslavvb.com

On-line learning in soft committee machines

D Saad, SA Solla - Physical Review E, 1995 - APS

The problem of on-line learning in two-layer neural networks is studied within the framework
of statistical mechanics. A fully connected committee machine with K hidden units is trained …

被引用次数：306 相关文章所有 11 个版本

[PDF] aston.ac.uk

A Bayesian approach to on-line learning

M Opper, O Winther - On-line learning in neural networks, 1999 - research.aston.ac.uk

Online learning is discussed from the viewpoint of Bayesian statistical inference. By
replacing the true posterior distribution with a simpler parametric distribution, one can define …

被引用次数：276 相关文章所有 13 个版本

[PDF] mlr.press

From high-dimensional & mean-field dynamics to dimensionless odes: A unifying approach to sgd in two-layers networks

L Arnaboldi, L Stephan, F Krzakala… - The Thirty Sixth …, 2023 - proceedings.mlr.press

This manuscript investigates the one-pass stochastic gradient descent (SGD) dynamics of a
two-layer neural network trained on Gaussian data and labels generated by a similar …

被引用次数：23 相关文章所有 6 个版本

[PDF] tugraz.at

A learning rule for very simple universal approximators consisting of a single layer of perceptrons

P Auer, H Burgsteiner, W Maass - Neural networks, 2008 - Elsevier

One may argue that the simplest type of neural networks beyond a single perceptron is an
array of several perceptrons in parallel. In spite of their simplicity, such circuits can compute …

被引用次数：243 相关文章所有 14 个版本

[PDF] arxiv.org

Rethinking generalization requires revisiting old ideas: statistical mechanics approaches and complex learning behavior

CH Martin, MW Mahoney - arXiv preprint arXiv:1710.09553, 2017 - arxiv.org

We describe an approach to understand the peculiar and counterintuitive generalization
properties of deep neural networks. The approach involves going beyond worst-case …

被引用次数：56 相关文章所有 5 个版本

[PDF] researchgate.net

On-line backpropagation in two-layered neural networks

P Riegler, M Biehl - Journal of Physics A: Mathematical and …, 1995 - iopscience.iop.org

We present an exact analysis of learning a rule by on-line gradient descent in a two-layered
neural network with adjustable hidden-to-output weights (backpropagation of error). Results …

被引用次数：87 相关文章所有 12 个版本

[PDF] researchgate.net

Transient dynamics of on-line learning in two-layered neural networks

M Biehl, P Riegler, C Wöhler - Journal of Physics A: Mathematical …, 1996 - iopscience.iop.org

The dynamics of on-line learning in neural networks with continuous units is dominated by
plateaux in the time dependence of the generalization error. Using tools from statistical …

被引用次数：77 相关文章所有 7 个版本

[PDF] archive.org

On-line versus off-line learning from random examples: General results

M Opper - Physical review letters, 1996 - APS

I propose a general model of on-line learning from random examples which, when applied
to a smooth realizable stochastic rule, yields the same asymptotic generalization error rate …

被引用次数：94 相关文章所有 7 个版本

高级搜索

QQ 群