End-to-end I/O monitoring on leading supercomputers

B Yang, W Xue, T Zhang, S Liu, X Ma, X Wang… - ACM Transactions on …, 2023 - dl.acm.org
This paper offers a solution to overcome the complexities of production system I/O
performance monitoring. We present Beacon, an end-to-end I/O resource monitoring and …

InfiniBand network monitoring: Challenges and possibilities

K Hintze, S Graham, S Dunlap, P Sweeney - … Infrastructure Protection XV …, 2022 - Springer
The InfiniBand architecture is among the leading interconnects that support high
performance computing. The high bandwidth and low latency provided by InfiniBand are …

Tacit knowledge as a promoter of success in technology firms

KU Koskinen - Proceedings of the 34th Annual Hawaii …, 2001 - ieeexplore.ieee.org
Addresses the question of whether tacit knowledge can be a promoter of success in
technology enterprises. Tacit knowledge is illustrated by focusing on its foundations, on how …

Monitoring large scale supercomputers: A case study with the lassen supercomputer

T Patki, A Bertsch, I Karlin, DH Ahn… - 2021 IEEE …, 2021 - ieeexplore.ieee.org
Scalable management of user workloads on large-scale supercomputers remains a
challenge due to the tradeoff between capturing adequate detail for analysis from various …

Provision of docker and Infiniband in high performance computing

MT Chung, A Le, N Quang-Hung… - 2016 International …, 2016 - ieeexplore.ieee.org
High Performance Computing (HPC) is playing an important role in a variety of domains with
the demand of high-level computational capacity. Besides, HPC provides services for a …

Sonar: Automated communication characterization for hpc applications

S Lammel, F Zahn, H Fröning - … , E-MuCoCoS, HPC-IODC, IXPUG, IWOPH …, 2016 - Springer
Future computing systems will need to operate within hard power and energy constraints,
this is particularly true for Exascale-class systems. These constraints are hard for technical …

Improving communication performance through topology and congestion awareness in HPC systems

S Mirsadeghi - 2017 - qspace.library.queensu.ca
Abstract High-Performance Computing (HPC) represents the flagship domain in providing
high-end computing capabilities that play a critical role in helping humanity solve its hardest …

A scalable infiniband network topology-aware performance analysis tool for mpi

H Subramoni, J Vienne, DK Panda - European Conference on Parallel …, 2012 - Springer
Over the last decade, InfiniBand (IB) has become an increasingly popular interconnect for
deploying modern supercomputing systems. As supercomputing systems grow in size and …

A review of applications and approaches of network monitoring

A Sultana, BG Jairam - … Journal of Innovative Research in Computer …, 2019 - papers.ssrn.com
Monitoring the network forms an important part of the Network Management, assisting in
visualization of the network behaviour in real time. Networks are growing extensively and …

[PDF][PDF] A Benchmark to Understand Communication Performance in Hybrid MPI and GPU Applications.

K Haskins, P Bridges, K Ferreira, S Levy - 2021 - osti.gov
Analyzing MPI communication costs on extremescale high-performance computing systems
is critical to ensuring optimal performance. Several factors such as scalability and the …