Modeling power and energy usage of HPC kernels

A Tiwari, MA Laurenzano, L Carrington… - 2012 IEEE 26th …, 2012 - ieeexplore.ieee.org
Compute intensive kernels make up the majority of execution time in HPC applications.
Therefore, many of the power draw and energy consumption traits of HPC applications can …

Using hardware performance counters to speed up autotuning convergence on GPUs

J Filipovič, J Hozzová, A Nezarat, J Ol'ha… - Journal of Parallel and …, 2022 - Elsevier
Nowadays, GPU accelerators are commonly used to speed up general-purpose computing
tasks on a variety of hardware. However, due to the diversity of GPU architectures and …

Auto-tuning for energy usage in scientific applications

A Tiwari, MA Laurenzano, L Carrington… - Euro-Par 2011: Parallel …, 2012 - Springer
The power wall has become a dominant impeding factor in the realm of exascale system
design. It is therefore important to understand how to most effectively create software to …

Online inertia-based temperature estimation for reliability enhancement

M Chhablani, I Koren… - Journal of Low Power …, 2016 - ingentaconnect.com
With the advent of technology scaling and the increased use of high performance multi-
cores in life-critical applications, reliability has become an increasingly pressing issue. High …

Cross-architecture prediction based scheduling for energy efficient execution on single-ISA heterogeneous chip-multiprocessors

Y Zhang, L Duan, B Li, L Peng, S Sadagopan - Microprocessors and …, 2015 - Elsevier
In recent years, single-ISA heterogeneous chip multiprocessors (CMP) consisting of big high-
performance cores and small power-saving cores on the same die have been proposed for …

[PDF][PDF] Evaluating linear regression for temperature modeling at the core level

D Upton, K Hazelwood - Workshop on Duplication, Deconstructing, and …, 2011 - Citeseer
Temperature issues have become a first-order concern for modern computing systems.
There are several approaches for dynamic thermal management, including reacting based …

Energy efficiency via the n-way model

R Cledat, S Pande - Pespma 2010-Workshop on Parallel Execution …, 2010 - inria.hal.science
With core counts as well as heterogeneity on the rise, the sequential components of
applications are becoming the major bottleneck in performance scaling as predicted by …

Pre-execution power consumption prediction of computational multithreaded workloads

H Fadishei, H Deldari, M Naghibzadeh - Cluster computing, 2014 - Springer
Power management in large-scale computational environments can significantly benefit
from predictive models. Such models provide information about the power consumption …

[PDF][PDF] Enabling efficient online profiling of homogeneous and heterogeneous multicore systems

D Upton - Disseration, University of Virginia, August, 2011 - Citeseer
Using profiling tools is a common way to understand computer systems and software and to
achieve the best performance. Profiling becomes more important as computing technology …

Helper Thread Prefetching Control Framework on Chip Multi-processor

J Zhang, Z Gu, Y Huang, N Zheng, X Hu - International Journal of Parallel …, 2015 - Springer
Helper thread prefetching can improve performance of irregular data-intensive applications.
However, helper thread prefetching quality depends on the values of control parameters …