Beyond black-box advice: learning-augmented algorithms for MDPs with Q-value predictions

T Li, Y Lin, S Ren, A Wierman - Advances in Neural …, 2024 - proceedings.neurips.cc
We study the tradeoff between consistency and robustness in the context of a single-
trajectory time-varying Markov Decision Process (MDP) with untrusted machine-learned …

Online switching control with stability and regret guarantees

Y Li, JA Preiss, N Li, Y Lin… - … for Dynamics and …, 2023 - proceedings.mlr.press
This paper considers online switching control with a finite candidate controller pool, an
unknown dynamical system, and unknown cost functions. The candidate controllers can be …

Online stabilization of unknown linear time-varying systems

J Yu, V Gupta, A Wierman - arXiv preprint arXiv:2304.02878, 2023 - arxiv.org
This paper studies the problem of online stabilization of an unknown discrete-time linear
time-varying (LTV) system under bounded non-stochastic (potentially adversarial) …

Online adversarial stabilization of unknown networked systems

J Yu, D Ho, A Wierman - Proceedings of the ACM on Measurement and …, 2023 - dl.acm.org
We investigate the problem of stabilizing an unknown networked linear system under
communication constraints and adversarial disturbances. We propose the first provably …

Online Adversarial Stabilization of Unknown Linear Time-Varying Systems

J Yu, V Gupta, A Wierman - 2023 62nd IEEE Conference on …, 2023 - ieeexplore.ieee.org
This paper studies the problem of online stabilization of an unknown discrete-time linear
time-varying (LTV) system under bounded non-stochastic (potentially adversarial) …

Online convex optimization with unbounded memory

R Kumar, S Dean, R Kleinberg - Advances in Neural …, 2024 - proceedings.neurips.cc
Online convex optimization (OCO) is a widely used framework in online learning. In each
round, the learner chooses a decision in a convex set and an adversary chooses a convex …

Online Control for Linear Dynamics: A Data-Driven Approach

Z Liu, Y Chen - arXiv preprint arXiv:2308.08138, 2023 - arxiv.org
This paper considers an online control problem over a linear time-invariant system with
unknown dynamics, bounded disturbance, and adversarial cost. We propose a data-driven …