MPIX Stream: An explicit solution to hybrid MPI+ X programming

H Zhou, K Raffenetti, Y Guo, R Thakur - … of the 29th European MPI Users' …, 2022 - dl.acm.org
The hybrid MPI+ X programming paradigm, where X refers to threads or GPUs, has gained
prominence in the high-performance computing arena. This corresponds to a trend of …

Implementation and evaluation of MPI 4.0 partitioned communication libraries

MGF Dosanjh, A Worley, D Schafer, P Soundararajan… - Parallel Computing, 2021 - Elsevier
Partitioned point-to-point communication primitives provide a performance-oriented
mechanism to support a hybrid parallel programming model and have been included in the …

How I learned to stop worrying about user-visible endpoints and love MPI

R Zambre, A Chandramowliswharan… - Proceedings of the 34th …, 2020 - dl.acm.org
MPI+ threads is gaining prominence as an alternative to the traditional" MPI everywhere"
model in order to better handle the disproportionate increase in the number of cores …

Partitioned collective communication

DJ Holmes, A Skjellum, J Jaeger… - 2021 Workshop on …, 2021 - ieeexplore.ieee.org
Partitioned point-to-point communication and persistent collective communication were both
recently standardized in MPI-4.0. Each offers performance and scalability advantages over …

Lessons learned on MPI+ threads communication

R Zambre… - … Conference for High …, 2022 - ieeexplore.ieee.org
Hybrid MPI+ threads programming is gaining prominence, but, in practice, applications
perform slower with it compared to the MPI everywhere model. The most critical challenge to …

Exampi: A modern design and implementation to accelerate message passing interface innovation

A Skjellum, M Rüfenacht, N Sultana, D Schafer… - … Computing: 6th Latin …, 2020 - Springer
The difficulty of deep experimentation with Message Passing Interface (MPI)
implementations—which are quite large and complex—substantially raises the cost and …

Callback-based completion notification using MPI Continuations

J Schuchart, P Samfass, C Niethammer, J Gracia… - Parallel Computing, 2021 - Elsevier
Asynchronous programming models (APM) are gaining more and more traction, allowing
applications to expose the available concurrency to a runtime system tasked with …

Cmb: a configurable messaging benchmark to explore fine-grained communication

WP Marts, DA Kruse, MGF Dosanjh… - 2024 IEEE 24th …, 2024 - ieeexplore.ieee.org
Modern communication APIs provide increased ability to specify when, where, and how to
send data between processes. One recent innovation is fine-grained communication, where …

Micro-benchmarking mpi partitioned point-to-point communication

Y Hassan Temucin, RE Grant, A Afsahi - Proceedings of the 51st …, 2022 - dl.acm.org
Modern High-Performance Computing (HPC) architectures have developed the need for
scalable hybrid programming models. The latest Message Passing Interface (MPI) 4.0 …

A Dynamic Network-Native MPI Partitioned Aggregation Over InfiniBand Verbs

YH Temuçin, S Levy, W Schonbein… - 2023 IEEE …, 2023 - ieeexplore.ieee.org
Modern HPC systems require efficient hybrid programming model to utilize their hardware
resources effectively. The Message Passing Interface (MPI) has accommodated next …