An integrated tutorial on InfiniBand, verbs, and MPI

P MacArthur, Q Liu, RD Russell… - … Surveys & Tutorials, 2017 - ieeexplore.ieee.org
This tutorial presents the details of the interconnection network utilized in many high
performance computing (HPC) systems today.“InfiniBand” is the hardware interconnect …

Throttling for bandwidth imbalanced data transfers

T Schneider, KD Underwood, M Flajslik, S Sur… - US Patent …, 2020 - Google Patents
Techniques are disclosed to throttle bandwidth imbalanced data transfers. In some
examples, an example computer-implemented method may include splitting a payload of a …

Toward lower-diameter large-scale HPC and data center networks with co-packaged optics

P Maniotis, L Schares, BG Lee… - Journal of Optical …, 2021 - opg.optica.org
We investigate the advantages of using co-packaged optics for building low-diameter, large-
scale high-performance computing (HPC) and data center networks. The increased escape …

[PDF][PDF] Solving hot spot contention using infiniband architecture congestion control

G Pfister, M Gusat, W Denzel, D Craddock… - Proceedings HP-IPC …, 2005 - researchgate.net
Since at least 1985 [1] it has been known that certain traffic patterns in multistage
interconnection networks, hot spots, can cause catastrophic congestion and loss of …

Revisiting Congestion Control for Lossless Ethernet

Y Zhang, Q Meng, C Hu, F Ren - 21st USENIX Symposium on …, 2024 - usenix.org
Congestion control is a key enabler for lossless Ethernet at scale. In this paper, we revisit
this classic topic from a new perspective, ie, understanding and exploiting the intrinsic …

A new proposal to deal with congestion in InfiniBand-based fat-trees

J Escudero-Sahuquillo, PJ Garcia, FJ Quiles… - Journal of Parallel and …, 2014 - Elsevier
The overall performance of High-Performance Computing applications may depend largely
on the performance achieved by the network interconnecting the end-nodes; thus high …

Exploration of congestion control techniques on dragonfly-class hpc networks through simulation

N McGlohon, CD Carothers… - … and Simulation of …, 2021 - ieeexplore.ieee.org
Ensuring optimal communication latency in High Performance Computing (HPC) networks is
of critical importance to the efficient operation of facilitated applications. Different application …

Latency and throughput optimization in modern networks: a comprehensive survey

A Mirzaeinnia, M Mirzaeinia, A Rezgui - arXiv preprint arXiv:2009.03715, 2020 - arxiv.org
Modern applications are highly sensitive to communication delays and throughput. This
paper surveys major attempts on reducing latency and increasing the throughput. These …

Noise injection techniques to expose subtle and unintended message races

K Sato, DH Ahn, I Laguna, GL Lee, M Schulz… - Proceedings of the …, 2017 - dl.acm.org
Debugging intermittently occurring bugs within MPI applications is challenging, and
message races, a condition in which two or more sends race to match with a receive, are …

Hot-spot avoidance with multi-pathing over infiniband: An mpi perspective

A Vishnu, M Koop, A Moody… - … Computing and the …, 2007 - ieeexplore.ieee.org
Large scale InfiniBand clusters are becoming increasingly popular, as reflected by the TOP
500 supercomputer rankings. At the same time, fat tree has become a popular …