[图书][B] Statistical mechanics of learning

A Engel - 2001 - books.google.com
Learning is one of the things that humans do naturally, and it has always been a challenge
for us to understand the process. Nowadays this challenge has another dimension as we try …

Phase diagram of stochastic gradient descent in high-dimensional two-layer neural networks

R Veiga, L Stephan, B Loureiro… - Advances in …, 2022 - proceedings.neurips.cc
Despite the non-convex optimization landscape, over-parametrized shallow networks are
able to achieve global convergence under gradient descent. The picture can be radically …

On-line learning in soft committee machines

D Saad, SA Solla - Physical Review E, 1995 - APS
The problem of on-line learning in two-layer neural networks is studied within the framework
of statistical mechanics. A fully connected committee machine with K hidden units is trained …

A Bayesian approach to on-line learning

M Opper, O Winther - On-line learning in neural networks, 1999 - research.aston.ac.uk
Online learning is discussed from the viewpoint of Bayesian statistical inference. By
replacing the true posterior distribution with a simpler parametric distribution, one can define …

From high-dimensional & mean-field dynamics to dimensionless odes: A unifying approach to sgd in two-layers networks

L Arnaboldi, L Stephan, F Krzakala… - The Thirty Sixth …, 2023 - proceedings.mlr.press
This manuscript investigates the one-pass stochastic gradient descent (SGD) dynamics of a
two-layer neural network trained on Gaussian data and labels generated by a similar …

A learning rule for very simple universal approximators consisting of a single layer of perceptrons

P Auer, H Burgsteiner, W Maass - Neural networks, 2008 - Elsevier
One may argue that the simplest type of neural networks beyond a single perceptron is an
array of several perceptrons in parallel. In spite of their simplicity, such circuits can compute …

Rethinking generalization requires revisiting old ideas: statistical mechanics approaches and complex learning behavior

CH Martin, MW Mahoney - arXiv preprint arXiv:1710.09553, 2017 - arxiv.org
We describe an approach to understand the peculiar and counterintuitive generalization
properties of deep neural networks. The approach involves going beyond worst-case …

On-line backpropagation in two-layered neural networks

P Riegler, M Biehl - Journal of Physics A: Mathematical and …, 1995 - iopscience.iop.org
We present an exact analysis of learning a rule by on-line gradient descent in a two-layered
neural network with adjustable hidden-to-output weights (backpropagation of error). Results …

Transient dynamics of on-line learning in two-layered neural networks

M Biehl, P Riegler, C Wöhler - Journal of Physics A: Mathematical …, 1996 - iopscience.iop.org
The dynamics of on-line learning in neural networks with continuous units is dominated by
plateaux in the time dependence of the generalization error. Using tools from statistical …

On-line versus off-line learning from random examples: General results

M Opper - Physical review letters, 1996 - APS
I propose a general model of on-line learning from random examples which, when applied
to a smooth realizable stochastic rule, yields the same asymptotic generalization error rate …