A Letcher, D Balduzzi, S Racaniere, J Martens… - arXiv preprint arXiv …, 2019 - arxiv.org
Deep learning is built on the foundational guarantee that gradient descent on an objective
function converges to local minima. Unfortunately, this guarantee fails in settings, such as …