FLASH: Fast model adaptation in ML-centric cloud platforms

H Qiu, W Mao, A Patke, S Cui, C Wang… - Proceedings of …, 2024 - proceedings.mlsys.org
The emergence of ML in various cloud system management tasks (eg, workload autoscaling
and job scheduling) has become a core driver of ML-centric cloud platforms. However, there …

[PDF][PDF] CLOUD SYSTEMS MANAGEMENT WITH EFFICIENT AND ROBUST ONLINE LEARNING

H QIU - 2024 - haoran-qiu.com
Large-scale cloud computing systems rely heavily on decision-making algorithms for critical
system management tasks such as resource allocation, job scheduling, and power …

Multi-agent reinforcement learning for nonzero-sum Markov games

W Mao - 2024 - ideals.illinois.edu
In recent years, multi-agent reinforcement learning (MARL) has shown remarkable
capabilities in addressing sequential decision-making problems that involve the strategic …