Reducing energy bloat in large model training

JW Chung, Y Gu, I Jang, L Meng, N Bansal… - Proceedings of the …, 2024 - dl.acm.org
Training large AI models on numerous GPUs consumes a massive amount of energy,
making power delivery one of the largest limiting factors in building and operating …

SmartOClock: Workload-and risk-aware overclocking in the cloud

J Stojkovic, PA Misra, Í Goiri, S Whitlock… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
Operating server components beyond their voltage and power design limit (ie, overclocking)
enables improving performance and lowering cost for cloud workloads. However …

Dynamic Idle Resource Leasing To Safely Oversubscribe Capacity At Meta

N Gupta, I Narayanan, S Handa, S Chakraborti… - Proceedings of the …, 2024 - dl.acm.org
Meta maintains additional capacity within its infrastructure to ensure high availability for
business workloads, accommodating user growth, temporal traffic variations, and …

A Framework for Carbon-aware Real-Time Workload Management in Clouds using Renewables-driven Cores

TB Hewage, S Ilager, MA Rodriguez… - arXiv preprint arXiv …, 2024 - arxiv.org
Cloud platforms commonly exploit workload temporal flexibility to reduce their carbon
emissions. They suspend/resume workload execution for when and where the energy is …

PADS: Power Budgeting with Diagonal Scaling for Performance-Aware Cloud Workloads

M Savasci, A Souza, D Irwin… - 2024 IEEE 15th …, 2024 - ieeexplore.ieee.org
Cloud platforms' rapid growth raises significant concerns about their electricity consumption
and resulting carbon emissions. Power capping is a known technique for limiting the power …

FLAPS: fluctuation-aware power auction strategy for reducing the power overload probability

X Cai, H Zhao, X Hou, W Cui, Q Chen, C Li… - Frontiers of Computer …, 2025 - Springer
Conclusion In this paper, we propose a Fluctuation-Aware Power Auction Strategy (FLAPS),
which aims to reduce power usage overload. FLAPS identifies previously overlooked job …

A Thermal Reduced Order Model for Power Throttling Simulations of a 3D IC

D Geb, S Deodhar, N Netake… - International …, 2024 - asmedigitalcollection.asme.org
Power throttling with dynamic voltage and frequency scaling (DVFS) is important to the
dynamic thermal management (DTM) of processors. With increasing numbers of on-chip …

Software-Oriented Hardware Prefetching and Vector Execution

N Adit - 2024 - search.proquest.com
The hardware-software abstraction enables programmers to write high-level algorithms
without delving into low-level microarchitectural details. Compilers, positioned at the …

Self-aware Memory Management for Emerging Architectures

B Maity - 2023 - escholarship.org
The ever-increasing demands of data-intensive applications and the rapid evolution of
computer architectures have posed significant challenges in memory performance and …