A checkpoint of research on parallel i/o for high-performance computing

FZ Boito, EC Inacio, JL Bez, POA Navaux… - ACM Computing …, 2018 - dl.acm.org
We present a comprehensive survey on parallel I/O in the high-performance computing
(HPC) context. This is an important field for HPC because of the historic gap between …

Mochi: Composing data services for high-performance computing environments

RB Ross, G Amvrosiadis, P Carns, CD Cranor… - Journal of Computer …, 2020 - Springer
Technology enhancements and the growing breadth of application workflows running on
high-performance computing (HPC) platforms drive the development of new data services …

Adaptive numerical simulations with Trixi. jl: A case study of Julia for scientific computing

H Ranocha, M Schlottke-Lakemper, AR Winters… - arXiv preprint arXiv …, 2021 - arxiv.org
We present Trixi. jl, a Julia package for adaptive high-order numerical simulations of
hyperbolic partial differential equations. Utilizing Julia's strengths, Trixi. jl is extensible, easy …

Pmix: process management for exascale environments

RH Castain, D Solt, J Hursey, A Bouteiller - Proceedings of the 24th …, 2017 - dl.acm.org
High-Performance Computing (HPC) applications have historically executed in static
resource allocations, using programming models that ran independently from the resident …

A massively parallel infrastructure for adaptive multiscale simulations: modeling RAS initiation pathway for cancer

F Di Natale, H Bhatia, TS Carpenter, C Neale… - Proceedings of the …, 2019 - dl.acm.org
Computational models can define the functional dynamics of complex systems in
exceptional detail. However, many modeling studies face seemingly incommensurate …

Flux: A next-generation resource management framework for large HPC centers

DH Ahn, J Garlick, M Grondona, D Lipari… - 2014 43rd …, 2014 - ieeexplore.ieee.org
Resource and job management software is crucial to High Performance Computing (HPC)
for efficient application execution. However, current systems and approaches can no longer …

Evaluating and extending user-level fault tolerance in MPI applications

I Laguna, DF Richards, T Gamblin… - … Journal of High …, 2016 - journals.sagepub.com
The user-level failure mitigation (ULFM) interface has been proposed to provide fault-
tolerant semantics in the Message Passing Interface (MPI). Previous work presented …

Mapping out the HPC dependency chaos

F Zakaria, TRW Scogland, T Gamblin… - … Conference for High …, 2022 - ieeexplore.ieee.org
High Performance Computing (HPC) software stacks have become complex, with the
dependencies of some applications numbering in the hundreds. Packaging, distributing, and …

Serpens: A high performance faas platform for network functions

H Yu, H Zhang, J Shen, Y Geng, J Wang… - … on Parallel and …, 2023 - ieeexplore.ieee.org
More and more enterprises deploy applications on Function-as-a-Service (FaaS) platforms
to improve resource efficiency and save monetary costs. Network Functions (NFs) suffer from …

Methodology for the rapid development of scalable HPC data services

M Dorier, P Carns, K Harms, R Latham… - 2018 IEEE/ACM 3rd …, 2018 - ieeexplore.ieee.org
Growing evidence in the scientific computing community indicates that parallel file systems
are not sufficient for all HPC storage workloads. This realization has motivated extensive …