Model-driven data layout selection for improving read performance

J Liu, S Byna, B Dong, K Wu… - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
Performance of reading scientific data from a parallel file system depends on the
organization of data on physical storage devices. Data is often immutable after producers of …

Optimizing fastquery performance on lustre file system

KW Lin, S Byna, J Chou, K Wu - … of the 25th International Conference on …, 2013 - dl.acm.org
FastQuery is a parallel indexing and querying system we developed for accelerating
analysis and visualization of scientific data. We have applied it to a wide variety of HPC …

Tuning object-centric data management systems for large scale scientific applications

H Tang, S Byna, S Bailey, Z Lukic, J Liu… - 2019 IEEE 26th …, 2019 - ieeexplore.ieee.org
Efficient management of scientific data on high-performance computing (HPC) systems has
been a challenge, as it often requires knowledge of various hardware and software …

Apply block index technique to scientific data analysis and I/O systems

T Wu, J Chou, N Podhorszki, J Gu… - 2017 17th IEEE/ACM …, 2017 - ieeexplore.ieee.org
Scientific discoveries are increasingly relying on analysis of massive amounts of data. The
ability to directly access the most relevant data records through query, without shifting …

Design of locality-aware MPI-IO for scalable shared file write performance

K Sugihara, O Tatebe - 2020 IEEE International Parallel and …, 2020 - ieeexplore.ieee.org
Difficult and challenging I/O access pattern among applications is N-1 access pattern such
that multiple N processes share and access a single file. This I/O pattern is commonly and …

Terabyte-scale particle data analysis: an arrayudf case study

B Dong, P Kilian, X Li, F Guo, S Byna… - Proceedings of the 31st …, 2019 - dl.acm.org
A prime question for plasma physicists is how a fraction of charged particles is accelerated
to very high energy. To answer this question, physicists simulate trillions of particles with …

Evaluating Asynchronous Parallel I/O on HPC Systems

J Ravi, S Byna, Q Koziol, H Tang… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Parallel I/O is an effective method to optimize data movement between memory and storage
for many scientific applications. Poor performance of traditional disk-based file systems has …

New directions in measurement for software quality control

P Krause, B Freimut, W Suryn - 10th International Workshop on …, 2002 - ieeexplore.ieee.org
Assessing and controlling software quality is still an immature discipline. One of the reasons
for this is that many of the concepts and terms that are used in discussing and describing …

In-memory query system for scientific dataseis

HT Chiu, J Chou, V Vishwanath… - 2015 IEEE 21st …, 2015 - ieeexplore.ieee.org
The growing gap between compute performance and I/O bandwidth coupled with the
increasing data volumes has resulted in a bottleneck to the traditional post-simulation data …

The challenges of in situ analysis for multiple simulations

A Ribés, B Raffin - ISAV'20 In Situ Infrastructures for Enabling Extreme …, 2020 - dl.acm.org
In situ analysis and visualization have mainly been applied to the output of a single large-
scale simulation. However, topics involving the execution of multiple simulations in …