R Yu,
Y Zhang,
J Kwok - Forty-first International Conference on Machine … - openreview.net
Sharpness-Aware Minimization (SAM), which performs gradient descent on adversarially
perturbed weights, can improve generalization by identifying flatter minima. However, recent …