K Hajjar, L Chizat - arXiv preprint arXiv:2211.08771, 2022 - arxiv.org
We consider the idealized setting of gradient flow on the population risk for infinitely wide
two-layer ReLU neural networks (without bias), and study the effect of symmetries on the …