A mean-field game approach to cloud resource management with function approximation

W Mao, H Qiu, C Wang, H Franke… - Advances in …, 2022 - proceedings.neurips.cc
Advances in Neural Information Processing Systems, 2022proceedings.neurips.cc
Reinforcement learning (RL) has gained increasing popularity for resource management in
cloud services such as serverless computing. As self-interested users compete for shared
resources in a cluster, the multi-tenancy nature of serverless platforms necessitates multi-
agent reinforcement learning (MARL) solutions, which often suffer from severe scalability
issues. In this paper, we propose a mean-field game (MFG) approach to cloud resource
management that is scalable to a large number of users and applications and incorporates …
Abstract
Reinforcement learning (RL) has gained increasing popularity for resource management in cloud services such as serverless computing. As self-interested users compete for shared resources in a cluster, the multi-tenancy nature of serverless platforms necessitates multi-agent reinforcement learning (MARL) solutions, which often suffer from severe scalability issues. In this paper, we propose a mean-field game (MFG) approach to cloud resource management that is scalable to a large number of users and applications and incorporates function approximation to deal with the large state-action spaces in real-world serverless platforms. Specifically, we present an online natural actor-critic algorithm for learning in MFGs compatible with various forms of function approximation. We theoretically establish its finite-time convergence to the regularized Nash equilibrium under linear function approximation and softmax parameterization. We further implement our algorithm using both linear and neural-network function approximations, and evaluate our solution on an open-source serverless platform, OpenWhisk, with real-world workloads from production traces. Experimental results demonstrate that our approach is scalable to a large number of users and significantly outperforms various baselines in terms of function latency and resource utilization efficiency.
proceedings.neurips.cc
以上显示的是最相近的搜索结果。 查看全部搜索结果