YL Sung, V Nair, C Raffel - … of the 35th International Conference on …, 2021 - dl.acm.org
During typical gradient-based training of deep neural networks, all of the model's
parameters are updated at each iteration. Recent work has shown that it is possible to …