Survey of storage systems for high-performance computing

J Lüttgau, M Kuhn, K Duwe, Y Alforov… - Supercomputing …, 2018 - centaur.reading.ac.uk
In current supercomputers, storage is typically provided by parallel distributed file systems
for hot data and tape archives for cold data. These file systems are often compatible with …

Gufi: fast, secure file system metadata search for both privileged and unprivileged users

D Manno, J Lee, P Challa, Q Zheng… - … Conference for High …, 2022 - ieeexplore.ieee.org
Modern High-Performance Computing (HPC) data centers routinely store massive data sets
resulting in millions of directories and billions of files. To efficiently search and sift through …

Datastates: Towards lightweight data models for deep learning

B Nicolae - … Scientific and Engineering Discoveries Through the …, 2020 - Springer
A key emerging pattern in deep learning applications is the need to capture intermediate
DNN model snapshots and preserve or clone them in explore a large number of alternative …

Faodel: Data management for next-generation application workflows

C Ulmer, S Mukherjee, G Templet, S Levy… - Proceedings of the 9th …, 2018 - dl.acm.org
Composition of computational science applications, whether into ad hoc pipelines for
analysis of simulation data or into well-defined and repeatable workflows, is becoming …

Using a robust metadata management system to accelerate scientific discovery at extreme scales

M Lawson, J Lofstead - … Workshop on Parallel Data Storage & …, 2018 - ieeexplore.ieee.org
Our previous work, which can be referred to as EMPRESS 1.0, showed that rich metadata
management provides a relatively low-overhead approach to facilitating insight from scale …

Coupling storage systems and self-describing data formats for global metadata management

M Kuhn, K Duwe - 2020 International Conference on …, 2020 - ieeexplore.ieee.org
Traditional I/O stacks feature a strict separation of layers, which provides portability benefits
but makes it impossible for storage systems to understand the structure of data. Coupling …

Tintenfisch: file system namespace schemas and generators

MA Sevilla, R Nasirigerdeh, C Maltzahn… - 10th USENIX Workshop …, 2018 - usenix.org
The file system metadata service is the scalability bottleneck for many of today's workloads.
Common approaches for attacking this" metadata scaling wall" include: caching inodes on …

The Next Generation of EMPRESS: A Metadata Management System for Accelerated Scientific Discovery at Exascale

MR Lawson - 2018 - digitalcommons.dartmouth.edu
Scientific data sets have grown rapidly in recent years, outpacing the growth in memory and
network bandwidths. This I/O bottleneck has made it increasingly difficult for scientists to …

[图书][B] MITRA: Robust Architecture for Distributed Metadata Indexing

S Thakkar - 2021 - search.proquest.com
In the post-exascale era storage systems, a fundamental challenge faced by the research
community is the efficient and scalable access to the stored information while meeting the …

Adventures in NoSQL for metadata management

J Lofstead, A Ryan, M Lawson - … , Frankfurt, Germany, June 16-20, 2019 …, 2019 - Springer
This paper describes an attempt to use a NoSQL database engine to manage custom
metadata using a rich query interface as motivating and descriptive examples of what kind of …