a popular solver of constrained convex optimization problems in signal processing and
machine learning, thanks to its low complexity. The present contribution broadens its scope
by replacing the gradient per FW subproblem with a weighted average of gradients. This
generalization speeds up the convergence of FW by alleviating its zigzag behavior. A
geometric interpretation for the averaged gradients is provided, and convergence …