Don't use large mini-batches, use local sgd T Lin, SU Stich, KK Patel, M Jaggi arXiv preprint arXiv:1808.07217, 2018 | 473 | 2018 |
Is local SGD better than minibatch SGD? B Woodworth, KK Patel, S Stich, Z Dai, B Bullins, B Mcmahan, O Shamir, ... International Conference on Machine Learning, 10334-10343, 2020 | 273 | 2020 |
Minibatch vs local sgd for heterogeneous distributed learning BE Woodworth, KK Patel, N Srebro Advances in Neural Information Processing Systems 33, 6281-6292, 2020 | 204 | 2020 |
Communication trade-offs for local-sgd with large step size KK Patel, A Dieuleveut Advances In Neural Information Processing Systems 32 (32), 2825-2830, 2019 | 83* | 2019 |
Corruption-tolerant bandit learning S Kapoor, KK Patel, P Kar Machine Learning 108 (4), 687-715, 2019 | 60 | 2019 |
Towards Optimal Communication Complexity in Distributed Non-Convex Optimization KK Patel, L Wang, B Woodworth, B Bullins, N Srebro Advances in Neural Information Systems 36, 2022 | 15 | 2022 |
A stochastic newton algorithm for distributed convex optimization B Bullins, K Patel, O Shamir, N Srebro, BE Woodworth Advances in Neural Information Processing Systems 34, 26818-26830, 2021 | 15 | 2021 |
On Convexity and Linear Mode Connectivity in Neural Networks D Yunis, KK Patel, PHP Savarese, G Vardi, J Frankle, M Walter, K Livescu, ... OPT 2022: Optimization for Machine Learning (NeurIPS 2022 Workshop), 2022 | 13 | 2022 |
Federated online and bandit convex optimization KK Patel, L Wang, A Saha, N Srebro International Conference on Machine Learning, 27439-27460, 2023 | 11* | 2023 |
The limits and potentials of local sgd for distributed heterogeneous learning with intermittent communication KK Patel, M Glasgow, A Zindari, L Wang, SU Stich, Z Cheng, N Joshi, ... arXiv preprint arXiv:2405.11667, 2024 | 8* | 2024 |
On the Effect of Defections in Federated Learning and How to Prevent Them M Han, KK Patel, H Shao, L Wang arXiv preprint arXiv:2311.16459, 2023 | 4 | 2023 |
Online Combinatorial Optimization with Group Fairness Constraints N Golrezaei, R Niazadeh, KK Patel, F Susan Available at SSRN 4824251, 2024 | | 2024 |
Grokking, Rank Minimization and Generalization in Deep Learning D Yunis, KK Patel, S Wheeler, PHP Savarese, G Vardi, K Livescu, ... ICML 2024 Workshop on Mechanistic Interpretability, 2024 | | 2024 |
Urban Heat Effect in Ghent: a Time Series Analysis KK Patel, S Roels | | 2018 |
Rank Minimization, Alignment and Weight Decay in Neural Networks D Yunis, KK Patel, S Wheeler, PHP Savarese, G Vardi, K Livescu, ... High-dimensional Learning Dynamics 2024: The Emergence of Structure and …, 0 | | |
Personalization Mitigates the Perils of Local SGD for Heterogeneous Distributed Learning KK Patel, N Gazagnadou, L Wang, L Lyu | | |
One Shot Learning in Humans: A Non-Parametric Bayesian Model KK Patel | | |
Paraphrase Generation Using Deep Generative Models Final Project Report CS-772 N Asnani, K Gandhi, KK Patel | | |
Efficient Private Federated Non-Convex Optimization With Shuffled Model L Wang, X Zhou, KK Patel, L Tang, A Saha Privacy Regulation and Protection in Machine Learning, 0 | | |
Private Overparameterized Linear Regression without Suffering in High Dimensions L Wang, D Zou, KK Patel, J Wu, N Srebro | | |