Energy‐aware high‐performance computing: survey of state‐of‐the‐art tools, techniques, and environments

P Czarnul, J Proficz, A Krzywaniak - Scientific Programming, 2019 - Wiley Online Library
The paper presents state of the art of energy‐aware high‐performance computing (HPC), in
particular identification and classification of approaches by system and device types …

[图书][B] Parallel programming for modern high performance computing systems

P Czarnul - 2018 - books.google.com
In view of the growing presence and popularity of multicore and manycore processors,
accelerators, and coprocessors, as well as clusters using such computing devices, the …

MERPSYS: an environment for simulation of parallel application execution on large scale HPC systems

P Czarnul, J Kuchta, M Matuszek, J Proficz… - … Modelling Practice and …, 2017 - Elsevier
In this paper we present a new environment called MERPSYS that allows simulation of
parallel application execution time on cluster-based systems. The environment offers a …

DEPO: A dynamic energy‐performance optimizer tool for automatic power capping for energy efficient high‐performance computing

A Krzywaniak, P Czarnul… - Software: Practice and …, 2022 - Wiley Online Library
In the article we propose an automatic power capping software tool DEPO that allows one to
perform runtime optimization of performance and energy related metrics. For an assumed …

Analyzing energy/performance trade-offs with power capping for parallel applications on modern multi and many core processors

A Krzywaniak, J Proficz… - … Federated conference on …, 2018 - ieeexplore.ieee.org
In the paper we present extensive results from analyzing energy/performance trade-offs with
power capping observed on four different modern CPUs, for three different parallel …

Extended investigation of performance-energy trade-offs under power capping in HPC environments

A Krzywaniak, P Czarnul… - … Conference on High …, 2019 - ieeexplore.ieee.org
In the paper we present investigation of performance-energy trade-offs under power capping
using modern processors. The results are presented for systems targeted at both server and …

Improving all-reduce collective operations for imbalanced process arrival patterns

J Proficz - The Journal of Supercomputing, 2018 - Springer
Two new algorithms for the all-reduce operation optimized for imbalanced process arrival
patterns (PAPs) are presented:(1) sorted linear tree,(2) pre-reduced ring as well as a new …

Modeling energy consumption of parallel applications

P Czarnul, J Kuchta, P Rościszewski… - … on Computer Science …, 2016 - ieeexplore.ieee.org
The paper presents modeling and simulation of energy consumption of two types of parallel
applications: geometric Single Program Multiple Data (SPMD) and divide-and-conquer …

Process arrival pattern aware algorithms for acceleration of scatter and gather operations

J Proficz - Cluster Computing, 2020 - Springer
Imbalanced process arrival patterns (PAPs) are ubiquitous in many parallel and distributed
systems, especially in HPC ones. The collective operations, eg in MPI, are designed for …

All-gather Algorithms Resilient to Imbalanced Process Arrival Patterns

J Proficz - ACM Transactions on Architecture and Code …, 2021 - dl.acm.org
Two novel algorithms for the all-gather operation resilient to imbalanced process arrival
patterns (PATs) are presented. The first one, Background Disseminated Ring (BDR), is …