Anytime-competitive reinforcement learning with policy prior

J Yang, P Li, T Li, A Wierman… - Advances in Neural …, 2024 - proceedings.neurips.cc
This paper studies the problem of Anytime-Competitive Markov Decision Process (A-CMDP).
Existing works on Constrained Markov Decision Processes (CMDPs) aim to optimize the …

Robust learning for smoothed online convex optimization with feedback delay

P Li, J Yang, A Wierman, S Ren - Advances in Neural …, 2024 - proceedings.neurips.cc
We study a general form of Smoothed Online Convex Optimization, aka SOCO, including
multi-step switching costs and feedback delay. We propose a novel machine learning (ML) …

Learning-augmented decentralized online convex optimization in networks

P Li, J Yang, A Wierman, S Ren - … of the ACM on Measurement and …, 2024 - dl.acm.org
This paper studies learning-augmented decentralized online convex optimization in a
networked multi-agent system, a challenging setting that has remained under-explored. We …

Online Allocation with Replenishable Budgets: Worst Case and Beyond

J Yang, P Li, MJ Islam, S Ren - … of the ACM on Measurement and …, 2024 - dl.acm.org
This paper studies online resource allocation with replenishable budgets, where budgets
can be replenished on top of the initial budget and an agent sequentially chooses online …

Autoscaling via Online Optimization With Switching Cost Constraints

Z Shi, J Tan - IEEE Transactions on Networking, 2025 - ieeexplore.ieee.org
In cloud services, autoscaling is one of the most important features, which enables system
intelligence to adaptively assign computing resources for users according to real-time …

[图书][B] Learning-Augmented Online Decision Making With Guaranteed Trustworthiness

J Yang - 2023 - search.proquest.com
Many mission-critical systems need to solve online decision-making problems such as
workload scheduling in datacenters, power allocation in edge computing, battery …