Central Limit Theorem for Two-Timescale Stochastic Approximation with Markovian Noise: Theory and Applications

J Hu, V Doshi - International Conference on Artificial …, 2024 - proceedings.mlr.press
Two-timescale stochastic approximation (TTSA) is among the most general frameworks for
iterative stochastic algorithms. This includes well-known stochastic optimization methods …

Effectiveness of Constant Stepsize in Markovian LSA and Statistical Inference

DL Huo, Y Chen, Q Xie - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
In this paper, we study the effectiveness of using a constant stepsize in statistical inference
via linear stochastic approximation (LSA) algorithms with Markovian data. After establishing …

[PDF][PDF] Asymptotic convergence rate and statistical inference for stochastic sequential quadratic programming

S Na, MW Mahoney - arXiv: 2205.13687 v1, 2022 - par.nsf.gov
We apply a stochastic sequential quadratic programming (StoSQP) algorithm to solve
constrained nonlinear optimization problems, where the objective is stochastic and the …

Online bootstrap inference with nonconvex stochastic gradient descent estimator

Y Zhong, T Kuffner, S Lahiri - arXiv preprint arXiv:2306.02205, 2023 - arxiv.org
In this paper, we investigate the theoretical properties of stochastic gradient descent (SGD)
for statistical inference in the context of nonconvex optimization problems, which have been …

Stochastic approximation mcmc, online inference, and applications in optimization of queueing systems

X Li, J Liang, X Chen, Z Zhang - arXiv preprint arXiv:2309.09545, 2023 - arxiv.org
Stochastic approximation Markov Chain Monte Carlo (SAMCMC) algorithms are a class of
online algorithms having wide-ranging applications, particularly within Markovian systems …

Accelerating Distributed Stochastic Optimization via Self-Repellent Random Walks

J Hu, V Doshi, DY Eun - arXiv preprint arXiv:2401.09665, 2024 - arxiv.org
We study a family of distributed stochastic optimization algorithms where gradients are
sampled by a token traversing a network of agents in random-walk fashion. Typically, these …

Accelerated Multi-Time-Scale Stochastic Approximation: Optimal Complexity and Applications in Reinforcement Learning and Multi-Agent Games

S Zeng, TT Doan - arXiv preprint arXiv:2409.07767, 2024 - arxiv.org
Multi-time-scale stochastic approximation is an iterative algorithm for finding the fixed point
of a set of $ N $ coupled operators given their noisy samples. It has been observed that due …

Statistical Inference for Temporal Difference Learning with Linear Function Approximation

W Wu, G Li, Y Wei, A Rinaldo - arXiv preprint arXiv:2410.16106, 2024 - arxiv.org
Statistical inference with finite-sample validity for the value function of a given policy in
Markov decision processes (MDPs) is crucial for ensuring the reliability of reinforcement …

Asymptotic Time-Uniform Inference for Parameters in Averaged Stochastic Approximation

C Xie, K Jin, J Liang, Z Zhang - arXiv preprint arXiv:2410.15057, 2024 - arxiv.org
We study time-uniform statistical inference for parameters in stochastic approximation (SA),
which encompasses a bunch of applications in optimization and machine learning. To that …

Does Worst-Performing Agent Lead the Pack? Analyzing Agent Dynamics in Unified Distributed SGD

J Hu, YT Ma, DY Eun - arXiv preprint arXiv:2409.17499, 2024 - arxiv.org
Distributed learning is essential to train machine learning algorithms across heterogeneous
agents while maintaining data privacy. We conduct an asymptotic analysis of Unified …