Gpu-nest: Characterizing energy efficiency of multi-gpu inference servers

A Jahanshahi, HZ Sabzi, C Lau… - IEEE Computer …, 2020 - ieeexplore.ieee.org
Cloud inference systems have recently emerged as a solution to the ever-increasing
integration of AI-powered applications into the smart devices around us. The wide adoption …

μdpm: Dynamic power management for the microsecond era

CH Chou, LN Bhuyan, D Wong - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
The complex, distributed nature of data centers have spawned the adoption of distributed,
multi-tiered software architectures, consisting of many inter-connected microservices. These …

Energy efficiency of VM consolidation in IaaS clouds

F Teng, L Yu, T Li, D Deng, F Magoulès - The Journal of Supercomputing, 2017 - Springer
The energy efficiency of cloud computing has recently attracted a great deal of attention. As
a result of raised expectations, cloud providers such as Amazon and Microsoft have started …

KRISP: Enabling kernel-wise right-sizing for spatial partitioned gpu inference servers

M Chow, A Jahanshahi, D Wong - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Machine learning (ML) inference workloads present significantly different challenges than
ML training workloads. Typically, inference workloads are shorter running and under-utilize …

Frequency regulation service provision in data center with computational flexibility

W Wang, A Abdolrashidi, N Yu, D Wong - Applied Energy, 2019 - Elsevier
The rapid adoption of cloud storage and computing services led to unprecedented growth of
data centers in the world. As bulk energy consumers, large-scale data centers in the US rack …

WASP: Workload adaptive energy-latency optimization in server farms using server low-power states

F Yao, J Wu, S Subramaniam… - 2017 IEEE 10th …, 2017 - ieeexplore.ieee.org
With the growing energy demands from server farms, it becomes necessary to understand
the tradeoffs between energy consumption and application performance. Typically, server …

Peak efficiency aware scheduling for highly energy proportional servers

D Wong - ACM SIGARCH Computer Architecture News, 2016 - dl.acm.org
Energy proportionality of data center severs have improved drastically over the past decade
to the point where near ideal energy proportional servers are now common. These highly …

The effect of server energy proportionality on data center power oversubscription

S Malla, K Christensen - Future Generation Computer Systems, 2020 - Elsevier
Modern data centers improve resource utilization with power oversubscription. The power
hierarchy in a data center is oversubscribed by installing more servers than allowed by the …

Energy proportional servers: Where are we in 2016?

C Jiang, Y Wang, D Ou, B Luo… - 2017 IEEE 37th …, 2017 - ieeexplore.ieee.org
The huge energy consumption in data centers produces not only high electricity bill but also
tremendous carbon footprints. Although today's servers and data centers of leading internet …

Ts-batpro: Improving energy efficiency in data centers by leveraging temporal–spatial batching

F Yao, J Wu, G Venkataramani… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
With the rapid scaling of data centers, understanding their power characteristics and
optimizing data center energy consumption is a critical task. Typically, data centers are …