Gradient statistics aware power control for over-the-air federated learning

N Zhang, M Tao - IEEE Transactions on Wireless …, 2021 - ieeexplore.ieee.org
… control with unknown gradient statistics: We propose an adaptive power control algorithm
that estimates the gradient statistics based on the historical aggregated gradients and then …

Gradient statistics aware power control for over-the-air federated learning in fading channels

N Zhang, M Tao - 2020 IEEE International Conference on …, 2020 - ieeexplore.ieee.org
… over-the-air federated learning over fading channels by taking the gradient distribution into
account. The structure of the optimal power control for SGD learning mainly depends on the …

FedH2L: Federated learning with model and statistical heterogeneity

Y Li, W Zhou, H Wang, H Mi, TM Hospedales - arXiv preprint arXiv …, 2021 - arxiv.org
… being desired to maximise performance under heterogeneous data statistics across peers. …
data statistics across peers we introduce a new optimization strategy to find the gradient

Accelerating federated learning via momentum gradient descent

W Liu, L Chen, Y Chen, W Zhang - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
… Motivated by the above observations, we propose a new federated learning design of
Momentum Federated Learning (MFL) in this paper. In the proposed MFL design, we introduce …

Network gradient descent algorithm for decentralized federated learning

S Wu, D Huang, H Wang - … of Business & Economic Statistics, 2023 - Taylor & Francis
… We study a fully decentralized federated learning algorithm, which is a novel gradient … it as
a network gradient descent (NGD) method. In the NGD method, only statistics (eg, parameter …

FedPAGE: A fast local stochastic gradient method for communication-efficient federated learning

H Zhao, Z Li, P Richtárik - arXiv preprint arXiv:2108.04755, 2021 - arxiv.org
… We propose a new federated learning algorithm, FedPAGE, able to further reduce the … than
previous local methods for both federated convex and nonconvex optimization. Concretely, 1) …

Federated learning with compression: Unified analysis and sharp guarantees

F Haddadpour, MM Kamani… - … and Statistics, 2021 - proceedings.mlr.press
… In this section, we propose a generalized version of the local stochastic gradient descent (SGD)
method for federated learning which uses compressed signals to reduce the overall …

Linear convergence in federated learning: Tackling client heterogeneity and sparse gradients

A Mitra, R Jaafar, GJ Pappas… - Advances in Neural …, 2021 - proceedings.neurips.cc
federated learning (FL) setup where a group of clients periodically coordinate with a central
server to train a statistical … convergence rates under aggressive gradient sparsification, and …

Fast composite optimization and statistical recovery in federated learning

Y Bao, M Crawshaw, S Luo… - … on Machine Learning, 2022 - proceedings.mlr.press
As a prevalent distributed learning paradigm, Federated Learning (FL) trains a global model
on a massive amount of devices with infrequent communication. This paper investigates a …

On the convergence of local descent methods in federated learning

F Haddadpour, M Mahdavi - arXiv preprint arXiv:1910.14425, 2019 - arxiv.org
… to learn from extremely large number of devices in federated setting… efficient algorithms for
federated learning with provable … of local Gradient Descent (GD) and local Stochastic Gradient