A survey of communication performance models for high-performance computing

JA Rico-Gallego, JC Díaz-Martín… - ACM Computing …, 2019 - dl.acm.org
This survey aims to present the state of the art in analytic communication performance
models, providing sufficiently detailed descriptions of particularly noteworthy efforts …

Bi-objective optimization of data-parallel applications on heterogeneous HPC platforms for performance and energy through workload distribution

H Khaleghzadeh, M Fahad, A Shahid… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Performance and energy are the two most important objectives for optimization on modern
parallel platforms. In this article, we show that moving from single-objective optimization for …

Обзор моделей параллельных вычислений

НА Ежова, ЛБ Соколинский - Вестник Южно-Уральского …, 2019 - cyberleninka.ru
Цель данного обзора дать максимально полное представление о достижениях и
современном состоянии дел в разработке аналитических моделей параллельных …

BSF: A parallel computation model for scalability estimation of iterative numerical algorithms on cluster computing systems

LB Sokolinsky - Journal of Parallel and Distributed Computing, 2021 - Elsevier
This paper examines a novel parallel computation model called bulk synchronous farm
(BSF) that focuses on estimating the scalability of compute-intensive iterative algorithms …

[HTML][HTML] Model-based selection of optimal MPI broadcast algorithms for multi-core clusters

E Nuriyev, JA Rico-Gallego, A Lastovetsky - Journal of Parallel and …, 2022 - Elsevier
The performance of collective communication operations determines the overall
performance of MPI applications. Different algorithms have been developed and …

Correlation of performance optimizations and energy consumption for stencil-based application on Intel Xeon scalable processors

L Szustak, R Wyrzykowski, T Olas… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
This article provides a comprehensive study of the impact of performance optimizations on
the energy efficiency of a real-world CFD application called MPDATA, as well as an …

Exploration of OpenCL heterogeneous programming for porting solidification modeling to CPU‐GPU platforms

K Halbiniak, L Szustak, T Olas… - Concurrency and …, 2021 - Wiley Online Library
This article provides a comprehensive study of OpenCL heterogeneous programming for
porting applications to CPU–GPU computing platforms, with a real‐life application for the …

Sparse FCM-based map-reduce framework for distributed parallel data clustering in E-Khool learning platform

A Suki Antely, P Jegatheeswari… - … Journal of Uncertainty …, 2023 - World Scientific
Parallel clustering serves as a platform for handling big data. The literature displays a
number of clustering algorithms using a map-reduce framework, but they did not assure the …

Assessment of offload-based programming environments for hybrid CPU–MIC platforms in numerical modeling of solidification

K Halbiniak, R Wyrzykowski, L Szustak… - … Modelling Practice and …, 2018 - Elsevier
Heterogeneous (or hybrid) computing platforms with Intel Xeon Phi accelerators offer
potential advantages of energy efficient, massively parallel computing, while supporting …

Performance portable parallel programming of heterogeneous stencils across shared-memory platforms with modern Intel processors

L Szustak, P Bratek - The International Journal of High …, 2019 - journals.sagepub.com
In this work, we take up the challenge of performance portable programming of
heterogeneous stencil computations across a wide range of modern shared-memory …