Exploration of lossy compression for application-level checkpoint/restart

N Sasaki, K Sato, T Endo… - 2015 IEEE international …, 2015 - ieeexplore.ieee.org
The scale of high performance computing (HPC) systems is exponentially growing,
potentially causing prohibitive shrinkage of mean time between failures (MTBF) while the …

Exploring the feasibility of lossy compression for pde simulations

J Calhoun, F Cappello, LN Olson… - … Journal of High …, 2019 - journals.sagepub.com
Checkpoint restart plays an important role in high-performance computing (HPC)
applications, allowing simulation runtime to extend beyond a single job allocation and …

Efficient encoding and reconstruction of HPC datasets for checkpoint/restart

J Zhang, X Zhuo, A Moon, H Liu… - 2019 35th Symposium …, 2019 - ieeexplore.ieee.org
As the amount of data produced by HPC applications reaches the exabyte range,
compression techniques are often adopted to reduce the checkpoint time and volume. Since …

Arc: An automated approach to resiliency for lossy compressed data via error correcting codes

D Fulp, A Poulos, R Underwood… - Proceedings of the 30th …, 2021 - dl.acm.org
Progress in high-performance computing (HPC) systems has led to complex applications
that stress the I/O subsystem by creating vast amounts of data. Lossy compression reduces …

Bit-error aware quantization for dct-based lossy compression

J Zhang, J Chen, A Moon, X Zhuo… - 2020 IEEE High …, 2020 - ieeexplore.ieee.org
Scientific simulations run by high-performance computing (HPC) systems produce a large
amount of data, which causes an extreme I/O bottleneck and a huge storage burden …

Towards Guaranteeing Error Bound in DCT-based Lossy Compression

J Chen, A Moon, SW Son - … Conference on Big Data (Big Data), 2022 - ieeexplore.ieee.org
High-performance computing (HPC) systems that run scientific simulations of significance
produce a large amount of data during runtime. Transferring or storing such big datasets …

Analyzing the impact of lossy compressor variability on checkpointing scientific simulations

P Triantafyllides, T Reza… - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
Lossy compression algorithms are effective tools to reduce the size of high-performance
computing data sets. As established lossy compressors such as SZ and ZFP evolve, they …

Analyzing the performance and accuracy of lossy checkpointing on sub-iteration of nwchem

T Reza, J Calhoun, K Keipert, S Di… - 2019 IEEE/ACM 5th …, 2019 - ieeexplore.ieee.org
Future exascale systems are expected to be characterized by more frequent failures than
current petascale systems. This places increased importance on the application to minimize …

PSNR-Aware Quantization for DCT-based Lossy Compression

J Chen, SW Son - 2023 IEEE International Conference on Big …, 2023 - ieeexplore.ieee.org
Recent years have witnessed a wide adoption of various lossy compression techniques to
alleviate the burden on high-performance computing (HPC) systems that run large-scale …

Software Support for Non-Volatile Memory (NVM) Programming

DT Aksun - 2021 - infoscience.epfl.ch
Abstract Non-Volatile Memory (NVM) is an emerging type of memory device that provides
fast, byte-addressable, and high-capacity durable storage. NVM sits on the memory bus and …