Y Li,
Y Yuan - Advances in neural information processing …, 2017 - proceedings.neurips.cc
In recent years, stochastic gradient descent (SGD) based techniques has become the
standard tools for training neural networks. However, formal theoretical understanding of …