Open MPI: Goals, concept, and design of a next generation MPI implementation

E Gabriel, GE Fagg, G Bosilca, T Angskun… - Recent Advances in …, 2004 - Springer
A large number of MPI implementations are currently available, each of which emphasize
different aspects of high-performance computing or are intended to solve a specific research …

Optimization of collective communication operations in MPICH

R Thakur, R Rabenseifner… - The International Journal …, 2005 - journals.sagepub.com
We describe our work on improving the performance of collective communication operations
in MPICH for clusters connected by switched networks. For each collective operation, we …

When children are more logical than adults: Experimental investigations of scalar implicature

IA Noveck - Cognition, 2001 - Elsevier
A conversational implicature is an inference that consists of attributing to a speaker an
implicit meaning that goes beyond the explicit linguistic meaning of an utterance. This paper …

{DeepDive}: Transparently identifying and managing performance interference in virtualized environments

D Novaković, N Vasić, S Novaković, D Kostić… - 2013 USENIX Annual …, 2013 - usenix.org
We describe the design and implementation of DeepDive, a system for transparently
identifying and managing performance interference between virtual machines (VMs) co …

Blink: Fast and generic collectives for distributed ml

G Wang, S Venkataraman… - Proceedings of …, 2020 - proceedings.mlsys.org
Abstract Model parameter synchronization across GPUs introduces high overheads for data-
parallel training at scale. Existing parameter synchronization protocols cannot effectively …

The Globus striped GridFTP framework and server

W Allcock, J Bresnahan, R Kettimuthu… - SC'05: Proceedings of …, 2005 - ieeexplore.ieee.org
The GridFTP extensions to the File Transfer Protocol define a general-purpose mechanism
for secure, reliable, high-performance data movement. We report here on the Globus striped …

A medium-scale distributed system for computer science research: Infrastructure for the long term

H Bal, D Epema, C de Laat, R Van Nieuwpoort… - Computer, 2016 - ieeexplore.ieee.org
The Dutch Advanced School for Computing and Imaging has built five generations of a 200-
node distributed system over nearly two decades while remaining aligned with the shifting …

Collective communication: theory, practice, and experience

E Chan, M Heimlich, A Purkayastha… - Concurrency and …, 2007 - Wiley Online Library
We discuss the design and high‐performance implementation of collective communications
operations on distributed‐memory computer architectures. Using a combination of known …

MPICH-G2: A grid-enabled implementation of the message passing interface

NT Karonis, B Toonen, I Foster - Journal of Parallel and Distributed …, 2003 - Elsevier
Application development for distributed-computing “Grids” can benefit from tools that
variously hide or enable application-level management of critical aspects of the …

Performance analysis of MPI collective operations

J Pješivac-Grbović, T Angskun, G Bosilca, GE Fagg… - Cluster …, 2007 - Springer
Previous studies of application usage show that the performance of collective
communications are critical for high-performance computing. Despite active research in the …