An analysis of long-tailed network latency distribution and background traffic on dragonfly+

M Salimi Beni, B Cosenza - International Symposium on Benchmarking …, 2022 - Springer
Modern computing systems are highly affected by large performance variability, resulting in
a long tail in the distribution of the network latency. For communication-intensive …

An analysis of performance variability on dragonfly+ topology

MS Beni, B Cosenza - 2022 IEEE International Conference on …, 2022 - ieeexplore.ieee.org
Large-scale compute clusters are highly affected by performance variability that originates
from different sources. Among these sources, the network plays an essential role as a …

Efficient task placement and routing of nearest neighbor exchanges in dragonfly networks

B Prisacari, G Rodriguez, P Heidelberger… - Proceedings of the 23rd …, 2014 - dl.acm.org
Dragonflies are recent network designs that are one of the most promising topologies for the
Exascale effort due to their scalability and cost. While being able to achieve very high …

Measuring network latency variation impacts to high performance computing application performance

R Underwood, J Anderson, A Apon - Proceedings of the 2018 ACM …, 2018 - dl.acm.org
In this paper, we study the impacts of latency variation versus latency mean on application
runtime, library performance, and packet delivery. Our contributions include the design and …

Watch out for the bully! job interference study on dragonfly network

X Yang, J Jenkins, M Mubarak… - SC'16: Proceedings of …, 2016 - ieeexplore.ieee.org
High-radix, low-diameter dragonfly networks will be a common choice in next-generation
supercomputers. Preliminary studies show that random job placement with adaptive routing …

The effect of system utilization on application performance variability

B Li, S Chunduri, K Harms, Y Fan, Z Lan - Proceedings of the 9th …, 2019 - dl.acm.org
Application performance variability caused by network contention is a major issue on
dragonfly based systems. This work-in-progress study makes two contributions. First, we …

[PDF][PDF] Analyzing inter-job contention in dragonfly networks

S Smith, D Lowenthal, A Bhatele, J Thiagarajan… - 2016 - cs.arizona.edu
Interconnection networks are increasing in importance as node counts increase in high-end
machines. To achieve better application performance, newer supercomputers frequently …

Evaluating quality of service traffic classes on the megafly network

M Mubarak, N McGlohon, M Musleh, E Borch… - … Conference, ISC High …, 2019 - Springer
An emerging trend in High Performance Computing (HPC) systems that use hierarchical
topologies (such as dragonfly) is that the applications are increasingly exhibiting high run-to …

Workload Interference Analysis and Mitigation on Dragonfly Class Networks

Y Kang - 2022 - search.proquest.com
Dragonfly class of networks are promising interconnect topologies that support current and
next-generation high-performance computing (HPC) systems. Serving as the" central …

Analysis and prediction of performance variability in large-scale computing systems

M Salimi Beni, S Hunold, B Cosenza - The Journal of Supercomputing, 2024 - Springer
The development of new exascale supercomputers has dramatically increased the need for
fast, high-performance networking technology. Efficient network topologies, such as …