[PDF][PDF] Lulesh programming model and performance ports overview

I Karlin - 2012 - osti.gov
This document gives a description of various versions of the LULESH proxy application
described in [1]. All the the codes described in this document are available at …

Affinity-aware checkpoint restart

A Saini, A Rezaei, F Mueller, P Hargrove… - Proceedings of the 15th …, 2014 - dl.acm.org
Current checkpointing techniques employed to overcome faults for HPC applications result
in inferior application performance after restart from a checkpoint for a number of …

[图书][B] Fault Resilience for Next Generation HPC Systems

A Rezaei - 2016 - search.proquest.com
Fault resilience techniques enable application completion with correct results despite the
existence of multiple sources of faults and their possible occurrence in the system. HPC …

[PDF][PDF] Exploring use-cases for non-volatile memories in support of hpc resilience

O Patil, S Hukerikar, F Mueller… - SC Poster …, 2017 - sc17.supercomputing.org
The exascale supercomputer will be di erent than its predecessors in many ways. Exa op
computation capabilities will be realized by a vast ensemble of processors, co-processors …

[引用][C] Fault Resilience for Next Generation HPC Systems

R Arash - 2016