The Mont-Blanc prototype: an alternative approach for HPC systems

N Rajovic, A Rico, F Mantovani, D Ruiz… - SC'16: Proceedings …, 2016 - ieeexplore.ieee.org
High-performance computing (HPC) is recognized as one of the pillars for further progress in
science, industry, medicine, and education. Current HPC systems are being developed to …

A structured approach to performance analysis

M Wagner, S Mohr, J Giménez, J Labarta - Tools for High Performance …, 2019 - Springer
Performance analysis tools are essential in the process of understanding application
behavior, identifying critical performance issues and adapting applications to new …

Performance analysis of complex engineering frameworks

M Wagner, J Jägersküpper, D Molka… - … of the 12th and of the 13th …, 2021 - Springer
Many engineering applications require complex frameworks to simulate the intricate and
extensive sub-problems involved. However, performance analysis tools can struggle when …

A parallelism profiler with what-if analyses for openmp programs

N Boushehrinejadmoradi, A Yoga… - … Conference for High …, 2018 - ieeexplore.ieee.org
This paper proposes OMP-WhIP, a profiler that measures inherent parallelism in the
program for a given input and provides what-if analyses to estimate improvements in …

Cost-efficient elastic stream processing using application-agnostic performance prediction

S Imai, S Patterson, CA Varela - 2016 16th IEEE/ACM …, 2016 - ieeexplore.ieee.org
Cloud computing adds great on-demand scalability to stream processing systems with its
pay-per-use cost model. However, to promise service level agreements to users while …

On-the-Fly Calculation of Model Factors for Multi-paradigm Applications

J Protze, F Orland, K Haldar, T Koritzius… - European Conference on …, 2022 - Springer
Abstract Model factors provide initial insight into fundamental issues of parallel applications.
These metrics elaborate beyond conventional HPC metrics to indicate whether an …

Understanding the role of GPGPU-accelerated SoC-based ARM clusters

R Azimi, T Fox, S Reda - 2017 IEEE International Conference …, 2017 - ieeexplore.ieee.org
The last few years saw the emergence of 64-bit ARM SoCs targeted for mobile systems and
servers. Mobile-class SoCs rely on the heterogeneous integration of a mix of CPU cores …

Performance analysis and optimization of the fftxlib on the intel knights landing architecture

M Wagner, V López, J Morillo… - 2017 46th …, 2017 - ieeexplore.ieee.org
In this paper, we address the decreasing performance of the FFTXlib, the Fast Fourier
Transformation (FFT) kernel of Quantum ESPRESSO, when scaling to a full KNL node. An …

Towards performance and scalability analysis of distributed memory programs on large-scale clusters

S Medya, L Cherkasova, G Magalhaes… - Proceedings of the 7th …, 2016 - dl.acm.org
Many HPC and modern Big Data processing applications belong to a class of so-called
scale-out applications, where the application dataset is partitioned and processed by a …

A run control framework to streamline profiling, porting, and tuning simulation runs and provenance tracking of geoscientific applications

W Sharples, I Zhukov, M Geimer… - Geoscientific model …, 2018 - gmd.copernicus.org
Geoscientific modeling is constantly evolving, with next-generation geoscientific models and
applications placing large demands on high-performance computing (HPC) resources …