A Daniely - Advances in neural information processing …, 2017 - proceedings.neurips.cc
We show that the standard stochastic gradient decent (SGD) algorithm is guaranteed to
learn, in polynomial time, a function that is competitive with the best function in the conjugate …