Optimal stochastic non-smooth non-convex optimization through online-to-non-convex conversion

A Cutkosky, H Mehta… - … Conference on Machine …, 2023 - proceedings.mlr.press
We present new algorithms for optimizing non-smooth, non-convex stochastic objectives
based on a novel analysis technique. This improves the current best-known complexity for …

Adaptivity and non-stationarity: Problem-dependent dynamic regret for online convex optimization

P Zhao, YJ Zhang, L Zhang, ZH Zhou - Journal of Machine Learning …, 2024 - jmlr.org
We investigate online convex optimization in non-stationary environments and choose
dynamic regret as the performance measure, defined as the difference between cumulative …

Non-stationary online learning with memory and non-stochastic control

P Zhao, YH Yan, YX Wang, ZH Zhou - The Journal of Machine Learning …, 2023 - dl.acm.org
We study the problem of Online Convex Optimization (OCO) with memory, which allows loss
functions to depend on past decisions and thus captures temporal effects of learning …

Dynamic regret of online markov decision processes

P Zhao, LF Li, ZH Zhou - International Conference on …, 2022 - proceedings.mlr.press
Abstract We investigate online Markov Decision Processes (MDPs) with adversarially
changing loss functions and known transitions. We choose dynamic regret as the …

Unconstrained dynamic regret via sparse coding

Z Zhang, A Cutkosky… - Advances in Neural …, 2024 - proceedings.neurips.cc
Motivated by the challenge of nonstationarity in sequential decision making, we study Online
Convex Optimization (OCO) under the coupling of two problem structures: the domain is …

Optimistic online mirror descent for bridging stochastic and adversarial online convex optimization

S Chen, YJ Zhang, WW Tu, P Zhao, L Zhang - Journal of Machine Learning …, 2024 - jmlr.org
The stochastically extended adversarial (SEA) model, introduced by Sachs et al.(2022),
serves as an interpolation between stochastic and adversarial online convex optimization …

Optimal dynamic regret in LQR control

D Baby, YX Wang - Advances in Neural Information …, 2022 - proceedings.neurips.cc
We consider the problem of nonstochastic control with a sequence of quadratic losses, ie,
LQR control. We provide an efficient online algorithm that achieves an optimal dynamic …

Adaptive, doubly optimal no-regret learning in strongly monotone and exp-concave games with gradient feedback

M Jordan, T Lin, Z Zhou - Operations Research, 2024 - pubsonline.informs.org
Online gradient descent (OGD) is well-known to be doubly optimal under strong convexity or
monotonicity assumptions:(1) in the single-agent setting, it achieves an optimal regret of Θ …

Online label shift: Optimal dynamic regret meets practical algorithms

D Baby, S Garg, TC Yen… - Advances in …, 2024 - proceedings.neurips.cc
This paper focuses on supervised and unsupervised online label shift, where the class
marginals $ Q (y) $ variesbut the class-conditionals $ Q (x| y) $ remain invariant. In the …

Fast rates in time-varying strongly monotone games

YH Yan, P Zhao, ZH Zhou - International Conference on …, 2023 - proceedings.mlr.press
Multi-player online games depict the interaction of multiple players with each other over
time. Strongly monotone games are of particular interest since they have benign properties …