[HTML][HTML] Energy-aware scheduling for high-performance computing systems: A survey

B Kocot, P Czarnul, J Proficz - Energies, 2023 - mdpi.com
High-performance computing (HPC), according to its name, is traditionally oriented toward
performance, especially the execution time and scalability of the computations. However …

Dynamic GPU energy optimization for machine learning training workloads

F Wang, W Zhang, S Lai, M Hao… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
GPUs are widely used to accelerate the training of machine learning workloads. As modern
machine learning models become increasingly larger, they require a longer time to train …

SYnergy: Fine-grained Energy-Efficient Heterogeneous Computing for Scalable Energy Saving

K Fan, M D'Antonio, L Carpentieri, B Cosenza… - Proceedings of the …, 2023 - dl.acm.org
Energy-efficient computing uses power management techniques such as frequency scaling
to save energy. Implementing energy-efficient techniques on large-scale computing systems …

Fine-grained powercap allocation for power-constrained systems based on multi-objective machine learning

M Hao, W Zhang, Y Wang, G Lu, F Wang… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Power capping is an important solution to keep the system within a fixed power constraint.
However, for the over-provisioned and power-constrained systems, especially the future …

Online power management for multi-cores: A reinforcement learning based approach

Y Wang, W Zhang, M Hao… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Power and energy is the first-class design constraint for multi-core processors and is a
limiting factor for future-generation supercomputers. While modern processor design …

An automated and portable method for selecting an optimal GPU frequency

G Ali, M Side, S Bhalachandra, NJ Wright… - Future Generation …, 2023 - Elsevier
Power consumption poses a significant challenge in current and emerging graphics
processing unit (GPU) enabled high-performance computing systems. In modern GPUs …

Sustaining performance while reducing energy consumption: a control theory approach

S Cerf, R Bleuse, V Reis, S Perarnau… - Euro-Par 2021: Parallel …, 2021 - Springer
Production high-performance computing systems continue to grow in complexity and size.
As applications struggle to make use of increasingly heterogeneous compute nodes …

Optimal GPU frequency selection using multi-objective approaches for hpc systems

G Ali, S Bhalachandra, NJ Wright… - 2022 IEEE High …, 2022 - ieeexplore.ieee.org
Power consumption poses a significant challenge in current and emerging GPU-enabled
high-performance computing (HPC) systems. In modern GPUs, controls like dynamic voltage …

[HTML][HTML] JuMonC: A RESTful tool for enabling monitoring and control of simulations at scale

C Witzler, FSM Guimarães, D Mira, H Anzt… - Future Generation …, 2025 - Elsevier
As systems and simulations grow in size and complexity, it is challenging to maintain
efficient use of resources and avoid failures. In this scenario, monitoring becomes even …

A reinforcement learning approach for performance-aware reduction in power consumption of data center compute nodes

A Raj, S Perarnau, A Gokhale - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
As Exascale computing becomes a reality, the energy needs of compute nodes in cloud data
centers will continue to grow. A common approach to reducing this energy demand is to limit …