Global convergence of the heavy-ball method for convex optimization

M Schneider - Acta Mechanica, 2021 - Springer

Since their inception, computational homogenization methods based on the fast Fourier
transform (FFT) have grown in popularity, establishing themselves as a powerful tool …

被引用次数：152 相关文章所有 8 个版本

[PDF] mdpi.com

Recent advances in stochastic gradient descent in deep learning

Y Tian, Y Zhang, H Zhang - Mathematics, 2023 - mdpi.com

In the age of artificial intelligence, the best approach to handling huge amounts of data is a
tremendously motivating and hard problem. Among machine learning models, stochastic …

被引用次数：59 相关文章所有 5 个版本

[PDF] neurips.cc

An improved analysis of stochastic gradient descent with momentum

Y Liu, Y Gao, W Yin - Advances in Neural Information …, 2020 - proceedings.neurips.cc

SGD with momentum (SGDM) has been widely applied in many machine learning tasks, and
it is often applied with dynamic stepsizes and momentum weights tuned in a stagewise …

被引用次数：205 相关文章所有 8 个版本

[PDF] arxiv.org

Accelerating federated learning via momentum gradient descent

W Liu, L Chen, Y Chen, W Zhang - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

Federated learning (FL) provides a communication-efficient approach to solve machine
learning problems concerning distributed data, without sending raw data to a central server …

被引用次数：314 相关文章所有 6 个版本

[PDF] mlr.press

On the linear speedup analysis of communication efficient momentum SGD for distributed non-convex optimization

H Yu, R Jin, S Yang - International Conference on Machine …, 2019 - proceedings.mlr.press

Recent developments on large-scale distributed machine learning applications, eg, deep
neural networks, benefit enormously from the advances in distributed non-convex …

被引用次数：383 相关文章所有 5 个版本

[PDF] arxiv.org

On the convergence of a class of adam-type algorithms for non-convex optimization

X Chen, S Liu, R Sun, M Hong - arXiv preprint arXiv:1808.02941, 2018 - arxiv.org

This paper studies a class of adaptive gradient based momentum algorithms that update the
search directions and learning rates simultaneously using past gradients. This class, which …

被引用次数：343 相关文章所有 8 个版本

[PDF] mlr.press

Negative momentum for improved game dynamics

G Gidel, RA Hemmat, M Pezeshki… - The 22nd …, 2019 - proceedings.mlr.press

Games generalize the single-objective optimization paradigm by introducing different
objective functions for different players. Differentiable games often proceed by simultaneous …

被引用次数：197 相关文章所有 9 个版本

[PDF] arxiv.org

Momentum and stochastic momentum for stochastic gradient, newton, proximal point and subspace descent methods

N Loizou, P Richtárik - Computational Optimization and Applications, 2020 - Springer

In this paper we study several classes of stochastic optimization algorithms enriched with
heavy ball momentum. Among the methods studied are: stochastic gradient descent …

被引用次数：214 相关文章所有 17 个版本

[PDF] ieee.org

Distributed heavy-ball: A generalization and acceleration of first-order methods with gradient tracking

R Xin, UA Khan - IEEE Transactions on Automatic Control, 2019 - ieeexplore.ieee.org

We study distributed optimization to minimize a sum of smooth and strongly-convex
functions. Recent work on this problem uses gradient tracking to achieve linear convergence …

被引用次数：153 相关文章所有 6 个版本

[PDF] researchgate.net

Handbook of convergence theorems for (stochastic) gradient methods

G Garrigos, RM Gower - arXiv preprint arXiv:2301.11235, 2023 - arxiv.org

This is a handbook of simple proofs of the convergence of gradient and stochastic gradient
descent type methods. We consider functions that are Lipschitz, smooth, convex, strongly …

被引用次数：63 相关文章所有 4 个版本

高级搜索

QQ 群