Cerberus: The power of choices in datacenter topology design-a throughput perspective

C Griner, J Zerwas, A Blenk, M Ghobadi… - Proceedings of the …, 2021 - dl.acm.org
The bandwidth and latency requirements of modern datacenter applications have led
researchers to propose various topology designs using static, dynamic demand-oblivious …

A throughput-centric view of the performance of datacenter topologies

P Namyar, S Supittayapornpong, M Zhang… - Proceedings of the …, 2021 - dl.acm.org
While prior work has explored many proposed datacenter designs, only two designs, Clos-
based and expander-based, are generally considered practical because they can scale …

Watch out for the bully! job interference study on dragonfly network

X Yang, J Jenkins, M Mubarak… - SC'16: Proceedings of …, 2016 - ieeexplore.ieee.org
High-radix, low-diameter dragonfly networks will be a common choice in next-generation
supercomputers. Preliminary studies show that random job placement with adaptive routing …

Analyzing network health and congestion in dragonfly-based supercomputers

A Bhatele, N Jain, Y Livnat, V Pascucci… - 2016 IEEE …, 2016 - ieeexplore.ieee.org
The dragonfly topology is a popular choice for building high-radix, low-diameter, hierarchical
networks with high-bandwidth links. On Cray installations of the dragonfly network, job …

Evaluating HPC networks via simulation of parallel workloads

N Jain, A Bhatele, S White, T Gamblin… - SC'16: Proceedings of …, 2016 - ieeexplore.ieee.org
This paper presents an evaluation and comparison of three topologies that are popular for
building interconnection networks in large-scale supercomputers: torus, fat-tree, and …

Flexfly: Enabling a reconfigurable dragonfly through silicon photonics

K Wen, P Samadi, S Rumley, CP Chen… - SC'16: Proceedings …, 2016 - ieeexplore.ieee.org
The Dragonfly topology provides low-diameter connectivity for high-performance computing
with all-to-all global links at the inter-group level. Our traffic matrix characterization of various …

Predicting the performance impact of different fat-tree configurations

N Jain, A Bhatele, LH Howell, D Böhme… - Proceedings of the …, 2017 - dl.acm.org
The fat-tree topology is one of the most commonly used network topologies in HPC systems.
Vendors support several options that can be configured when deploying fat-tree networks on …

Study of workload interference with intelligent routing on dragonfly

Y Kang, X Wang, Z Lan - SC22: International Conference for …, 2022 - ieeexplore.ieee.org
Dragonfly interconnect is a crucial network technol-ogy for supercomputers. To support
exascale systems, network resources are shared such that links and routers are not …

Megafly: A topology for exascale systems

M Flajslik, E Borch, MA Parker - … 2018, Frankfurt, Germany, June 24-28 …, 2018 - Springer
In this paper we explore network topologies suitable for future exascale systems that need to
support over fifty thousand endpoints. With the increased necessity to use optics at higher …

Q-adaptive: A multi-agent reinforcement learning based routing on dragonfly network

Y Kang, X Wang, Z Lan - … of the 30th International Symposium on High …, 2021 - dl.acm.org
High-radix interconnects such as Dragonfly and its variants rely on adaptive routing to
balance network traffic for optimum performance. Ideally, adaptive routing attempts to …