Zhw: A numerical codec for big data scientific computation

M Barrow, Z Wu, S Lloyd, M Gokhale… - … Conference on Field …, 2022 - ieeexplore.ieee.org
Distributed big data in scientific computing presents a major I/O performance bottleneck
when exploiting data paral-lelism. Consumer and producer compute nodes are often …

Increasing fpga accelerators memory bandwidth with a burst-friendly memory layout

C Ferry, T Yuki, S Derrien… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Offloading compute-intensive kernels to hardware accelerators relies on the large degree of
parallelism offered by these platforms. However, the effective bandwidth of the memory …

A compression-based memory-efficient optimization for out-of-core gpu stencil computation

J Shen, L Long, X Deng, M Okita, F Ino - The Journal of Supercomputing, 2023 - Springer
A code for out-of-core stencil computation manages data that exceeds the memory capacity
of a GPU. However, such a code necessitates frequent data transfers between the CPU and …

ZFP: A compressed array representation for numerical computations

P Lindstrom, J Hittinger, J Diffenderfer… - … Journal of High …, 2025 - journals.sagepub.com
HPC trends favor algorithms and implementations that reduce data motion relative to
FLOPS. We investigate the use of lossy compressed data arrays in place of traditional IEEE …

A data compressor for FPGA-based state vector quantum simulators

K Wei, H Amano, R Niwase, Y Yamaguchi - Proceedings of the 14th …, 2024 - dl.acm.org
A quantum computer simulator is a tool that simulates the operation of a quantum computer
using classical computers. Researchers widely adopt the state-vector-based simulator to …

A Synergy between On-and Off-Chip Data Reuse for GPU-based Out-of-Core Stencil Computation

J Shen, L Long, J Zhang, W Shen, M Okita… - arXiv preprint arXiv …, 2023 - arxiv.org
Stencil computation is an extensively-utilized class of scientific-computing applications that
can be efficiently accelerated by graphics processing units (GPUs). Out-of-core approaches …

[PDF][PDF] Automating the derivation of memory allocations for acceleration of polyhedral programs

C Ferry, S Rajopadhye, S Derrien, S Pasricha… - 2024 - api.mountainscholar.org
As processors compute power keeps increasing, so do their demands in memory accesses:
some computations will require a higher bandwidth and exhibit regular memory access …

Compression-Based Optimizations for Out-of-Core GPU Stencil Computation

J Shen, X Deng, Y Wu, M Okita, F Ino - arXiv preprint arXiv:2204.11315, 2022 - arxiv.org
An out-of-core stencil computation code handles large data whose size is beyond the
capacity of GPU memory. Whereas, such an code requires streaming data to and from the …

[PDF][PDF] Zfp

P Lindstrom - … Livermore National Laboratory.[Online]. Available: https …, 2015 - ipo.llnl.gov
The zfp software library provides a comprehensive solution to both lossy and lossless data
compression. zfp reduces the storage space of high-precision floating-point data without …