Certifying black-box policies with stability for nonlinear control

T Li, Y Lin, S Ren, A Wierman - Advances in Neural …, 2024 - proceedings.neurips.cc

We study the tradeoff between consistency and robustness in the context of a single-
trajectory time-varying Markov Decision Process (MDP) with untrusted machine-learned …

被引用次数：2 相关文章所有 6 个版本

[PDF] neurips.cc

Anytime-competitive reinforcement learning with policy prior

J Yang, P Li, T Li, A Wierman… - Advances in Neural …, 2024 - proceedings.neurips.cc

This paper studies the problem of Anytime-Competitive Markov Decision Process (A-CMDP).
Existing works on Constrained Markov Decision Processes (CMDPs) aim to optimize the …

被引用次数：1 相关文章所有 6 个版本

[PDF] neurips.cc

Online adaptive policy selection in time-varying systems: No-regret via contractive perturbations

Y Lin, JA Preiss, E Anand, Y Li… - Advances in Neural …, 2024 - proceedings.neurips.cc

We study online adaptive policy selection in systems with time-varying costs and dynamics.
We develop the Gradient-based Adaptive Policy Selection (GAPS) algorithm together with a …

被引用次数：5 相关文章所有 5 个版本

[PDF] arxiv.org

Gray-box nonlinear feedback optimization

Z He, S Bolognani, M Muehlebach, F Dörfler - arXiv preprint arXiv …, 2024 - arxiv.org

Feedback optimization enables autonomous optimality seeking of a dynamical system
through its closed-loop interconnection with iterative optimization algorithms. Among various …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Learning-Augmented Scheduling for Solar-Powered Electric Vehicle Charging

T Li - arXiv preprint arXiv:2311.05941, 2023 - arxiv.org

We tackle the complex challenge of scheduling the charging of electric vehicles (EVs)
equipped with solar panels and batteries, particularly under out-of-distribution (OOD) …

高级搜索

QQ 群