Cheetah: A framework for scalable hierarchical collective operations

R Graham, MG Venkata, J Ladd… - 2011 11th IEEE/ACM …, 2011 - ieeexplore.ieee.org
Collective communication operations, used by many scientific applications, tend to limit
overall parallel application performance and scalability. Computer systems are becoming …

Connectx-2 core-direct enabled asynchronous broadcast collective communications

MG Venkata, RL Graham, JS Ladd… - … on Parallel and …, 2011 - ieeexplore.ieee.org
This paper describes the design and implementation of InfiniBand (IB) CORE-Direct based
blocking and nonblocking broadcast operations within the Cheetah collective operation …

Designing an offloaded nonblocking MPI_Allgather collective using CORE-Direct

G Inozemtsev, A Afsahi - 2012 IEEE International Conference …, 2012 - ieeexplore.ieee.org
Collective communication operations in the Message Passing Interface (MPI) consume a
significant amount of time at scale, degrading the performance of scientific applications …

Non-blocking PMI extensions for fast MPI startup

S Chakraborty, H Subramoni, A Moody… - 2015 15th IEEE/ACM …, 2015 - ieeexplore.ieee.org
An efficient implementation of the Process Management Interface (PMI) is crucial to enable
fast start-up of MPI jobs. We propose three extensions to the PMI specification: 1) a blocking …

Can network-offload based non-blocking neighborhood MPI collectives improve communication overheads of irregular graph algorithms?

K Kandalla, A Buluç, H Subramoni… - 2012 IEEE …, 2012 - ieeexplore.ieee.org
Graph-based computations are commonly used across various data intensive computing
domains ranging from social networks to biological systems. On distributed memory …

The co-design architecture for exascale systems, a novel approach for scalable designs

G Shainer, T Wilde, P Lui, T Liu, M Kagan… - … Science-Research and …, 2013 - Springer
High performance computing (HPC) has begun scaling beyond the Petaflop range towards
the Exaflop (1000 Petaflops) mark. One of the major concerns throughout the development …

Design and implementation of broadcast algorithms for extreme-scale systems

P Shamis, R Graham, MG Venkata… - 2011 IEEE International …, 2011 - ieeexplore.ieee.org
The scalability and performance of collective communication operations limit the scalability
and performance of many scientific applications. This paper presents two new blocking and …

The mathematics of shape-geometry approach to the analysis of curve profile

F Khosrowshahi - 1999 IEEE International Conference on …, 1999 - ieeexplore.ieee.org
It has been previously shown that the visual approach to the analysis of the behaviour of
growth patterns can provide a viable solution to the understanding of the physical behaviour …

Adding Process-driven collaboration support in Moodle

R Perez-Rodriguez… - 2009 39th IEEE …, 2009 - ieeexplore.ieee.org
Moodle is a well-known open-source LMS. Moodle approach to collaborative learning is just
limited to¿ putting people around a table¿. It provides a means for participants to interact …

Simple fault-tolerant computing for field solvers

A Degro, R Löhner - International Journal of Computational Fluid …, 2020 - Taylor & Francis
Fault-tolerant computing options based on the use of restart information stored on and off
node and the use of reserve processes have been developed, implemented and tested in a …