Deep configuration performance learning: A systematic survey and taxonomy

J Gong, T Chen - ACM Transactions on Software Engineering and …, 2024 - dl.acm.org
Performance is arguably the most crucial attribute that reflects the quality of a configurable
software system. However, given the increasing scale and complexity of modern software …

I/o bottleneck detection and tuning: Connecting the dots using interactive log analysis

JL Bez, H Tang, B Xie… - 2021 IEEE/ACM …, 2021 - ieeexplore.ieee.org
Using parallel file systems efficiently is a tricky problem due to inter-dependencies among
multiple layers of I/O software, including high-level I/O libraries (HDF5, netCDF, etc.), MPI-IO …

Extracting and characterizing I/O behavior of HPC workloads

H Devarajan, K Mohror - 2022 IEEE International Conference …, 2022 - ieeexplore.ieee.org
System administrators set default storage-system configuration parameters with the goal of
providing high per-formance for their system's I/O workloads. However, this gener-alized …

Drilling Down I/O Bottlenecks with Cross-layer I/O Profile Exploration

H Ather, JL Bez, Y Xia, S Byna - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
I/O performance monitoring tools such as Darshan and Recorder collect I/O-related metrics
on production systems and help understand the applications' behavior. However, some …

Drishti: Guiding end-users in the i/o optimization journey

JL Bez, H Ather, S Byna - 2022 IEEE/ACM International Parallel …, 2022 - ieeexplore.ieee.org
The complex software and hardware I/O stack of HPC platforms makes it challenging for end-
users to obtain superior I/O performance and to understand the root causes of I/O …

Scheduling distributed I/O resources in HPC systems

A Bandet, F Boito, G Pallez - European Conference on Parallel Processing, 2024 - Springer
This paper presents a comprehensive investigation on optimizing I/O performance in the
access to distributed I/O resources in high-performance computing (HPC) environments. I/O …

Illuminating the I/O optimization path of scientific applications

H Ather, JL Bez, B Norris, S Byna - International Conference on High …, 2023 - Springer
The existing parallel I/O stack is complex and difficult to tune due to the interdependencies
among multiple factors that impact the performance of data movement between storage and …

Design and implementation of I/O performance prediction scheme on HPC systems through large-scale log analysis

S Kim, A Sim, K Wu, S Byna, Y Son - Journal of Big Data, 2023 - Springer
Large-scale high performance computing (HPC) systems typically consist of many
thousands of CPUs and storage units used by hundreds to thousands of users …

Tarazu: An Adaptive End-to-end I/O Load-balancing Framework for Large-scale Parallel File Systems

AK Paul, S Neuwirth, B Wadhwa, F Wang… - ACM Transactions on …, 2024 - dl.acm.org
The imbalanced I/O load on large parallel file systems affects the parallel I/O performance of
high-performance computing (HPC) applications. One of the main reasons for I/O …

Improving the I/O performance of applications with predictive modeling based auto-tuning

A Bağbaba, X Wang, C Niethammer… - … on Engineering and …, 2021 - ieeexplore.ieee.org
Parallel I/O is an essential part of scientific applications running on high performance
computing systems. Typically, parallel I/O stacks offer many parameters that need to be …