DeepPower: Deep Reinforcement Learning based Power Management for Latency Critical Applications in Multi-core Systems

J Zhang, G Yu, Z He, L Ai, P Chen - Proceedings of the 52nd …, 2023 - dl.acm.org
J Zhang, G Yu, Z He, L Ai, P Chen
Proceedings of the 52nd International Conference on Parallel Processing, 2023dl.acm.org
Latency-critical (LC) applications are widely deployed in modern datacenters. Effective
power management for LC applications can yield significant cost savings. However, it poses
a significant challenge in maintaining the desired Service Level Aggrement (SLA) levels.
Prior researches have mainly emphasized predicting the service time of request and utilize
heuristic algorithms for CPU frequency adjustment. Unfortunately, the control granularity is
limited to the request level and manual feature selection is needed. This paper proposes …
Latency-critical (LC) applications are widely deployed in modern datacenters. Effective power management for LC applications can yield significant cost savings. However, it poses a significant challenge in maintaining the desired Service Level Aggrement (SLA) levels. Prior researches have mainly emphasized predicting the service time of request and utilize heuristic algorithms for CPU frequency adjustment. Unfortunately, the control granularity is limited to the request level and manual feature selection is needed.
This paper proposes DeepPower, a deep reinforcement learning (DRL) based power management solution for LC applications. DeepPower comprises two key components, a DRL agent for monitoring the system load changes and a thread controller for CPU frequency adjustment. Considering the high overhead of the neural network and the short service time of requests, it is infeasible to employ DRL for direct adjustment of CPU frequency at the request level. Instead, DeepPower proposes a hierarchical control mechanism. That means the DRL agent adjusts the parameter of thread controller with longer intervals, and thread controller adjusts the CPU frequency with shorter intervals. This control mechanism enables DeepPower to adapt to dynamic workloads and achieves fine-grained frequency adjustments. We evaluate DeepPower with some common LC applications under dynamic workload. The experimental results show that DeepPower saves up to 28.4% power compared with state-of-the-art methods and reduces the percentage of request timeout.
ACM Digital Library
以上显示的是最相近的搜索结果。 查看全部搜索结果