Predicting the performance impact of different fat-tree configurations

N Jain, A Bhatele, LH Howell, D Böhme… - Proceedings of the …, 2017 - dl.acm.org
The fat-tree topology is one of the most commonly used network topologies in HPC systems.
Vendors support several options that can be configured when deploying fat-tree networks on …

Bandwidth steering in HPC using silicon nanophotonics

G Michelogiannakis, Y Shen, MY Teh, X Meng… - Proceedings of the …, 2019 - dl.acm.org
As bytes-per-FLOP ratios continue to decline, communication is becoming a bottleneck for
performance scaling. This paper describes bandwidth steering in HPC using emerging …

Performance trade-offs in reconfigurable networks for HPC

MY Teh, Z Wu, M Glick, S Rumley… - Journal of Optical …, 2022 - opg.optica.org
Designing efficient interconnects to support high-bandwidth and low-latency communication
is critical toward realizing high performance computing (HPC) and data center (DC) systems …

Dynamic reliability modeling of cyber-physical edge computing network

KC Okafor - International Journal of Computers and Applications, 2021 - Taylor & Francis
Recently, large scale cyber physical systems (LS-CPS) leverage network-cores provided by
application providers (APs) to carry out analytics. These CPS-APs uses the automated cloud …

Evaluation of an interference-free node allocation policy on fat-tree clusters

SD Pollard, N Jain, S Herbein… - … Conference for High …, 2018 - ieeexplore.ieee.org
Interference between jobs competing for network bandwidth on a fat-tree cluster can cause
significant variability and degradation in performance. These performance issues can be …

HyperX topology: First at-scale implementation and comparison to the fat-tree

J Domke, S Matsuoka, IR Ivanov, Y Tsushima… - Proceedings of the …, 2019 - dl.acm.org
The de-facto standard topology for modern HPC systems and data-centers are Folded Clos
networks, commonly known as Fat-Trees. The number of network endpoints in these …

Performance optimality or reproducibility: that is the question

T Patki, JJ Thiagarajan, A Ayala, TZ Islam - Proceedings of the …, 2019 - dl.acm.org
The era of extremely heterogeneous supercomputing brings with itself the devil of increased
performance variation and reduced reproducibility. There is a lack of understanding in the …

TLB: Traffic-aware load balancing with adaptive granularity in data center networks

J Hu, J Huang, W Lv, W Li, J Wang, T He - Proceedings of the 48th …, 2019 - dl.acm.org
Modern datacenter topologies typically are multi-rooted trees consisting of multiple paths
between any given pair of hosts. Recent load balancing designs focus on making full use of …

A {High-Performance} Design, Implementation, Deployment, and Evaluation of The Slim Fly Network

N Blach, M Besta, D De Sensi, J Domke… - … USENIX Symposium on …, 2024 - usenix.org
Novel low-diameter network topologies such as Slim Fly (SF) offer significant cost and power
advantages over the established Fat Tree, Clos, or Dragonfly. To spearhead the adoption of …

Fine-grained load balancing with traffic-aware rerouting in datacenter networks

T Zhang, Y Lei, Q Zhang, S Zou, J Huang… - Journal of Cloud …, 2021 - Springer
Modern datacenters provide a wide variety of application services, which generate a mix of
delay-sensitive short flows and throughput-oriented long flows, transmitting in the multi-path …