S4BXI: the MPI-ready Portals 4 Simulator

J Emmanuel, M Moy, L Henrio… - 2021 29th International …, 2021 - ieeexplore.ieee.org
We present a simulator for High Performance Computing (HPC) interconnection networks. It
models Portals 4, a standard low-level API for communication, and it allows running …

Fast and faithful performance prediction of MPI applications: the HPL case study

T Cornebize, A Legrand… - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
Finely tuning MPI applications (number of processes, granularity, collective operation
algorithms, topology and process placement) is critical to obtain good performance on …

Design of a simulation model for high performance LINPACK in hybrid CPU-GPU systems

Y Hu, L Lu - The Journal of Supercomputing, 2021 - Springer
High performance LINPACK (HPL) benchmark is used to evaluate the maximum floating-
point performance of a computer cluster. Since the performance of the graphics processing …

Modeling, prediction and optimization of energy consumption of MPI applications using SimGrid

F Heinrich - 2019 - theses.hal.science
The High-Performance Computing (HPC) community is currently undergoingdisruptive
technology changes in almost all fields, including a switch towardsmassive parallelism with …

ROCK-CNN: a distributed RockPro64-based convolutional neural network cluster for IoT. Verification and performance analysis

R Khaydarova, V Fishchenko… - … 26th Conference of …, 2020 - ieeexplore.ieee.org
The paper is dedicated to optimization of machine learning and neural networks
applications by replacing common servers with Single Board Computer (SBC) clusters to …

Immersion cooled ARM-based computer clusters towards low-cost high-performance computing

H Bostanci, MA Shareef… - 2020 19th IEEE Intersociety …, 2020 - ieeexplore.ieee.org
This study aimed to investigate performance of ARM-based computer clusters using two-
phase immersion cooling (IC) approach, and demonstrate its potential benefits over the air …

Benchmarking lammps: sensitivity to task location under cpu-based weak-scaling

JA Moríñigo, P García-Muller, AJ Rubio-Montero… - … Computing: 5th Latin …, 2019 - Springer
This investigation summarizes a set of executions completed on the supercomputers
Stampede at TACC (USA), Helios at IFERC (Japan), and Eagle at PSNC (Poland), with the …

Performance drop at executing communication-intensive parallel algorithms

JA Moríñigo, P García-Muller, AJ Rubio-Montero… - The Journal of …, 2020 - Springer
This work summarizes the results of a set of executions completed on three fat-tree network
supercomputers: Stampede at TACC (USA), Helios at IFERC (Japan) and Eagle at PSNC …

Évaluation d'algorithmes d'ordonnancement par simulation réaliste

A Faure, M Poquet, O Richard - 2018 - inria.hal.science
La diversité des plateformes de calcul à haute performance ne fait qu'augmenter. Le gestion-
naire de ressources et de tâches (ou RJMS pour Resources and Jobs Management …

[图书][B] High Performance Computing

E Meneses, H Castro, CJB Hernández… - 2019 - Springer
The use and development of high-performance computing (HPC) in Latin America is steadily
growing. New challenges come from the capabilities provided by clusters, grids, and …