A proximal stochastic gradient method with progressive variance reduction

Y Tian, Y Zhang, H Zhang - Mathematics, 2023 - mdpi.com

In the age of artificial intelligence, the best approach to handling huge amounts of data is a
tremendously motivating and hard problem. Among machine learning models, stochastic …

被引用次数：116 相关文章所有 5 个版本

[PDF] academia.edu

Problem formulations and solvers in linear SVM: a review

VK Chauhan, K Dahiya, A Sharma - Artificial Intelligence Review, 2019 - Springer

Support vector machine (SVM) is an optimal margin based classification technique in
machine learning. SVM is a binary linear classifier which has been extended to non-linear …

被引用次数：562 相关文章所有 4 个版本

[PDF] arxiv.org

Federated learning of a mixture of global and local models

F Hanzely, P Richtárik - arXiv preprint arXiv:2002.05516, 2020 - arxiv.org

We propose a new optimization formulation for training federated learning models. The
standard formulation has the form of an empirical risk minimization problem constructed to …

被引用次数：454 相关文章所有 5 个版本

[PDF] arxiv.org

Federated optimization: Distributed machine learning for on-device intelligence

J Konečný, HB McMahan, D Ramage… - arXiv preprint arXiv …, 2016 - arxiv.org

We introduce a new and increasingly relevant setting for distributed optimization in machine
learning, where the data defining the optimization are unevenly distributed over an …

被引用次数：2355 相关文章所有 8 个版本

随机梯度下降算法研究进展

史加荣，王丹，尚凡华，张鹤于 - 自动化学报, 2021 - aas.net.cn

在机器学习领域中, 梯度下降算法是求解最优化问题最重要, 最基础的方法. 随着数据规模的不断
扩大, 传统的梯度下降算法已不能有效地解决大规模机器学习问题. 随机梯度下降算法在迭代 …

被引用次数：39 相关文章所有 3 个版本

[PDF] mlr.press

SARAH: A novel method for machine learning problems using stochastic recursive gradient

LM Nguyen, J Liu, K Scheinberg… - … conference on machine …, 2017 - proceedings.mlr.press

In this paper, we propose a StochAstic Recursive grAdient algoritHm (SARAH), as well as its
practical variant SARAH+, as a novel approach to the finite-sum minimization problems …

被引用次数：710 相关文章所有 9 个版本

[PDF] mlr.press

Coresets for data-efficient training of machine learning models

B Mirzasoleiman, J Bilmes… - … Conference on Machine …, 2020 - proceedings.mlr.press

Incremental gradient (IG) methods, such as stochastic gradient descent and its variants are
commonly used for large scale optimization in machine learning. Despite the sustained effort …

被引用次数：391 相关文章所有 13 个版本

[PDF] neurips.cc

Only train once: A one-shot neural network training and pruning framework

T Chen, B Ji, T Ding, B Fang, G Wang… - Advances in …, 2021 - proceedings.neurips.cc

Structured pruning is a commonly used technique in deploying deep neural networks
(DNNs) onto resource-constrained devices. However, the existing pruning methods are …

被引用次数：130 相关文章所有 9 个版本

[PDF] jmlr.org Full View

Katyusha: The first direct acceleration of stochastic gradient methods

Z Allen-Zhu - Journal of Machine Learning Research, 2018 - jmlr.org

Nesterov's momentum trick is famously known for accelerating gradient descent, and has
been proven useful in building fast iterative algorithms. However, in the stochastic setting …

被引用次数：705 相关文章所有 9 个版本

[PDF] neurips.cc

SAGA: A fast incremental gradient method with support for non-strongly convex composite objectives

A Defazio, F Bach… - Advances in neural …, 2014 - proceedings.neurips.cc

In this work we introduce a new fast incremental gradient method SAGA, in the spirit of SAG,
SDCA, MISO and SVRG. SAGA improves on the theory behind SAG and SVRG, with better …

被引用次数：2270 相关文章所有 20 个版本

高级搜索

QQ 群