M Telgarsky - Conference on learning theory, 2016 - proceedings.mlr.press
For any positive integer k, there exist neural networks with Θ (k^ 3) layers, Θ (1) nodes per
layer, and Θ (1) distinct parameters which can not be approximated by networks with O (k) …