O Neumann, C Gros - arXiv preprint arXiv:2412.11979, 2024 - arxiv.org
Neural scaling laws are observed in a range of domains, to date with no clear understanding
of why they occur. Recent theories suggest that loss power laws arise from Zipf's law, a …