A survey of power and energy efficient techniques for high performance numerical linear algebra operations

L Tan, S Kothapalli, L Chen, O Hussaini, R Bissiri… - Parallel Computing, 2014 - Elsevier
Extreme scale supercomputers available before the end of this decade are expected to have
100 million to 1 billion computing cores. The power and energy efficiency issue has become …

Energy efficiency vs. performance of the numerical solution of PDEs: An application study on a low-power ARM-based cluster

D Göddeke, D Komatitsch, M Geveler… - Journal of …, 2013 - Elsevier
Power consumption and energy efficiency are becoming critical aspects in the design and
operation of large scale HPC facilities, and it is unanimously recognised that future exascale …

Performance and energy impact of parallelization and vectorization techniques in modern microprocessors

JM Cebrián, L Natvig, JC Meyer - Computing, 2014 - Springer
While Moore's law states that the number of transistors is approximately doubled every 2
years, powering these transistors simultaneously is only possible as long as Dennard …

Improving energy efficiency through parallelization and vectorization on intel core i5 and i7 processors

JM Cebrian, L Natvig, JC Meyer - 2012 SC Companion: High …, 2012 - ieeexplore.ieee.org
Driven by the utilization wall and the Dark Silicon effect, energy efficiency has become a key
research area in microprocessor design. Vectorization, parallelization, specialization and …

Power measurement methods for energy efficient applications

G Calandrini, A Gardel, I Bravo, P Revenga, JL Lázaro… - Sensors, 2013 - mdpi.com
Energy consumption constraints on computing systems are more important than ever.
Maintenance costs for high performance systems are limiting the applicability of processing …

Reformulated conjugate gradient for the energy-aware solution of linear systems on GPUs

JI Aliaga, J Pérez, ES Quintana-Ortí… - 2013 42nd International …, 2013 - ieeexplore.ieee.org
In this paper we introduce a redesign of the conjugate gradient method for the iterative
solution of sparse linear systems on heterogeneous systems accelerated by graphics …

Asynchronous and multiprecision linear solvers-scalable and fault-tolerant numerics for energy efficient high performance computing

H Anzt - 2012 - publikationen.bibliothek.kit.edu
Asynchronous methods minimize idle times by removing synchronization barriers, and
therefore allow the efficient usage of computer systems. The implied high tolerance with …

Case studies of multi-core energy efficiency in task based programs

H Lien, L Natvig, A Al Hasib, JC Meyer - ICT as Key Technology against …, 2012 - Springer
In this paper, we present three performance and energy case studies of benchmark
applications in the OmpSs environment for task based programming. Different parallel and …

Sparse matrix-vector multiply on the keystone ii digital signal processor

Y Gao, F Zhang, JD Bakos - 2014 IEEE High Performance …, 2014 - ieeexplore.ieee.org
In this paper we describe an implementation of sparse matrix-vector multiply (SpMV) on the
Texas Instruments (TI) Keystone II architecture. The Keystone II is an eight core Digital …

A Performance and Energy Study of GPU-Resident Preconditioners for Conjugate Gradient Solvers: In the Context of Existing and Novel Approaches

K Świrydowicz, J Firoz, J Manzano… - 2024 IEEE 36th …, 2024 - ieeexplore.ieee.org
Optimizing a particular subprogram out of the set of Basic (sparse) Linear Algebra
Subprograms (BLAS) for a given architecture is a common topic of research. In applications …