M Jin - arXiv preprint arXiv:2405.11397, 2024 - arxiv.org
Operating safely and reliably despite continual distribution shifts is vital for high-stakes machine learning applications. This paper builds upon the transformative concept …
We study online policy optimization in nonlinear time-varying dynamical systems where the true dynamical models are unknown to the controller. This problem is challenging because …
M Nonhoff, E Dall'Anese, MA Müller - arXiv preprint arXiv:2401.04487, 2024 - arxiv.org
This article investigates the problem of controlling linear time-invariant systems subject to time-varying and a priori unknown cost functions, state and input constraints, and …
This paper is concerned with the online bandit control problem, which aims to learn the best stabilizing controller from a pool of stabilizing and destabilizing controllers of unknown types …