Pipelining and overlapping for MPI collective operations

J Worringen - 28th Annual IEEE International Conference on …, 2003 - ieeexplore.ieee.org
Collective operations are an important aspect of the currently most important message-
passing programming model MPI (message passing interface). Many MPI applications make …

Exploiting hierarchy in parallel computer networks to optimize collective operation performance

NT Karonis, BR De Supinski, I Foster… - Proceedings 14th …, 2000 - ieeexplore.ieee.org
The efficient implementation of collective communication operations has received much
attention. Initial efforts modeled network communication and produced" optimal" trees based …

Implementation and performance analysis of non-blocking collective operations for MPI

T Hoefler, A Lumsdaine, W Rehm - Proceedings of the 2007 ACM/IEEE …, 2007 - dl.acm.org
Collective operations and non-blocking point-to-point operations have always been part of
MPI. Although non-blocking collective operations are an obvious extension to MPI, there …

Adaptive recursive doubling algorithm for collective communication

O Arap, M Swany, G Brown… - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
Process arrival times at MPI collective operations differ significantly. Addressing this fact with
special handling for popular collective communication algorithms can yield performance …

Automatically tuned collective communications

SS Vadhiyar, GE Fagg… - SC'00: Proceedings of the …, 2000 - ieeexplore.ieee.org
The performance of the MPI's collective communications is critical in most MPI-based
applications. A general algorithm for a given collective communication operation may not …

Mpi collective communication operations on large shared memory systems

M Bernaschi, G Richelli - Proceedings Ninth Euromicro …, 2001 - ieeexplore.ieee.org
Collective communication performance is critical in a number of MPI applications yet
relatively few results are available to assess the performance of MPI implementations …

Efficient mpi collective operations for clusters in long-and-fast networks

M Matsuda, T Kudoh, Y Kodama… - 2006 IEEE …, 2006 - ieeexplore.ieee.org
Several MPI systems for grid environment, in which clusters are connected by wide-area
networks, have been proposed. However, the algorithms of collective communication in …

Collective communication and communicators in mpi++

D Kafura, L Huang - Proceedings. Second MPI Developer's …, 1996 - ieeexplore.ieee.org
The paper describes the current version of mpi++, a C++ language binding for MPI, that
includes all of the collective services, and services for contexts, groups and communicators …

Improving collective I/O performance using threads

PM Dickens, R Thakur - … and 10th Symposium on Parallel and …, 1999 - ieeexplore.ieee.org
Massively parallel computers are increasingly being used to solve large, I/O intensive
applications in many different fields. For such applications, the I/O requirements quite often …

Multi-core aware optimization for MPI collectives

B Tu, M Zou, J Zhan, X Zhao… - 2008 IEEE International …, 2008 - ieeexplore.ieee.org
MPI collective operations on multi-core clusters should be multi-core aware. In this paper,
collective algorithms with hierarchical virtual topology focus on the performance difference …