Hpcgcn: A predictive framework on high performance computing cluster log data using graph convolutional networks

A Bose, H Yang, WH Hsu… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
This paper presents a novel use case of Graph Convolutional Network (GCN) learning
representations for predictive data mining, specifically from user/task data in the domain of …

Optimizing performance and energy across problem sizes through a search space exploration and machine learning

L Scravaglieri, M Popov, LL Pilla, A Guermouche… - Journal of Parallel and …, 2023 - Elsevier
HPC systems expose configuration options to assist optimization. Configurations such as
parallelism, thread and data mapping, or prefetching have been explored but with a limited …

Scheduling distributed I/O resources in HPC systems

A Bandet, F Boito, G Pallez - European Conference on Parallel Processing, 2024 - Springer
This paper presents a comprehensive investigation on optimizing I/O performance in the
access to distributed I/O resources in high-performance computing (HPC) environments. I/O …

Investigating HPC Job Resource Requests and Job Efficiency Reporting

T Jakobsche, N Lachiche… - 2023 22nd International …, 2023 - ieeexplore.ieee.org
High Performance Computing (HPC) systems are ever-evolving, increasing in complexity
and heterogeneity, while providing high computing power to scientific and data analysis …

Optimal checkpointing strategies for iterative applications

Y Du, L Marchal, G Pallez… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
This work provides an optimal checkpointing strategy to protect iterative applications from
fail-stop errors. We consider a general framework, where the application repeats the same …

Allocation and Placement Algorithms for Scheduling Distributed I/O Resources in HPC Systems

A Bandet, F Boito, G Pallez - 2024 - hal.science
This paper presents a comprehensive investigation on optimizing I/O performance in the
access to distributed I/O resources in high-performance computing (HPC) environments. I/O …