Fast collective operations using shared and remote memory access protocols on clusters

V Tipparaju, J Nieplocha… - … International Parallel and …, 2003 - ieeexplore.ieee.org
This paper describes a novel methodology for implementing a common set of collective
communication operations on clusters based on symmetric multiprocessor (SMP) nodes …

Fast and scalable MPI-level broadcast using InfiniBand's hardware multicast support

J Liu, AR Mamidala, DK Panda - 18th International Parallel and …, 2004 - ieeexplore.ieee.org
Summary form only given. Modern high performance applications require efficient and
scalable collective communication operations. Currently, most collective operations are …

A practical Approach to the Rating of Barrier Algorithms using the LogP Model and Open MPI

T Hoefler, L Cerquetti, T Mehlan… - 2005 International …, 2005 - ieeexplore.ieee.org
Large-scale parallel applications performing global synchronization may spend a significant
amount of execution time waiting for the completion of a barrier operation. Consequently …

Hardware implementation of MPI_Barrier on an FPGA cluster

S Gao, AG Schmidt, R Sass - 2009 International Conference on …, 2009 - ieeexplore.ieee.org
Message-Passing is the dominant programming model for distributed memory parallel
computers and Message-Passing Interface (MPI) is the standard. Along with point-to-point …

[PDF][PDF] Fast barrier synchronization for InfiniBand

DIT Hoefler - 2006 - pdfs.semanticscholar.org
Fast Barrier Synchronization for InfiniBand™ Page 1 Introduction Modelling Barrier Algorithms
Summary Fast Barrier Synchronization for InfiniBandTM Torsten Höfler Department of Computer …

Highly efficient synchronization based on active memory operations

L Zhang, Z Fang, JB Carter - 18th International Parallel and …, 2004 - ieeexplore.ieee.org
Summary form only given. Synchronization is a crucial operation in many parallel
applications. As network latency approaches thousands of processor cycles for large scale …

Fast and scalable barrier using rdma and multicast mechanisms for infiniband-based clusters

SP Kini, J Liu, J Wu, P Wyckoff, DK Panda - European Parallel Virtual …, 2003 - Springer
This paper describes a methodology for efficiently implementing the barrier operation, on
clusters with the emerging InfiniBand Architecture (IBA). IBA provides hardware level …

An efficient approach to detect concept drifts in data streams

A Jadhav, L Deshpande - 2017 IEEE 7th International Advance …, 2017 - ieeexplore.ieee.org
Due to the presence of data streams in many applications like banking, sensor networks,
and telecommunication, data stream mining has gained increased attention. Data stream is …

Efficient collective operations using remote memory operations on VIA-based clusters

R Gupta, P Balaji, DK Panda… - … Parallel and Distributed …, 2003 - ieeexplore.ieee.org
High performance scientific applications require efficient and fast collective communication
operations. Most collective communication operations have been built on top of point-to …

Fast synchronization on shared-memory multiprocessors: An architectural approach

Z Fang, L Zhang, JB Carter, L Cheng… - Journal of Parallel and …, 2005 - Elsevier
Synchronization is a crucial operation in many parallel applications. Conventional
synchronization mechanisms are failing to keep up with the increasing demand for efficient …