is proposed. The learning problem is posed such that the open loop value function is its
optimal solution. This infinite dimensional, function space, problem, is approximated by a
polynomial ansatz and its convergence is analyzed. An $\ell_1 $ penalty term is employed,
which combined with the proximal point method, allows to find sparse solutions for the
learning problem. The approach requires multiple evaluations of the elements of the …