EMPRESS: Accelerating Scientific Discovery through Descriptive Metadata Management

M Lawson, W Gropp, J Lofstead - ACM Transactions on Storage, 2022 - dl.acm.org
High-performance computing scientists are producing unprecedented volumes of data that
take a long time to load for analysis. However, many analyses only require loading in the …

[PDF][PDF] SkyhookDM: Data processing in Ceph with programmable storage

J LeFevre, C Maltzahn - USENIX login;, 2020 - usenix.org
With ever larger data sets and cloud-based storage systems, it becomes increasingly more
attractive to move computation to data, a common principle in big data systems. Historically …

A Moveable Beast: Partitioning Data and Compute for Computational Storage

A Montana, Y Xue, J LeFevre, C Maltzahn… - arXiv preprint arXiv …, 2022 - arxiv.org
Over the years, hardware trends have introduced various heterogeneous compute units
while also bringing network and storage bandwidths within an order of magnitude of …

Popper pitfalls: Experiences following a reproducibility convention

MA Sevilla, C Maltzahn - Proceedings of the First International Workshop …, 2018 - dl.acm.org
We describe the four publications we have tried to make reproducible and discuss how each
paper has changed our workflows, practices, and collaboration policies. The fundamental …

The Next Generation of EMPRESS: A Metadata Management System for Accelerated Scientific Discovery at Exascale

MR Lawson - 2018 - digitalcommons.dartmouth.edu
Scientific data sets have grown rapidly in recent years, outpacing the growth in memory and
network bandwidths. This I/O bottleneck has made it increasingly difficult for scientists to …

Adventures in NoSQL for metadata management

J Lofstead, A Ryan, M Lawson - … , Frankfurt, Germany, June 16-20, 2019 …, 2019 - Springer
This paper describes an attempt to use a NoSQL database engine to manage custom
metadata using a rich query interface as motivating and descriptive examples of what kind of …

Programmable storage

N Watkins - 2018 - escholarship.org
Storage system solutions have historically been dominated by proprietary offerings
designed around a fixed set of common interfaces such as the POSIX file abstraction …

[图书][B] Scalable, Global Namespaces with Programmable Storage

MA Sevilla - 2018 - search.proquest.com
Global file system namespaces are difficult to scale because of the overheads of POSIX IO
metadata management. The file system metadata IO created by today's workloads subjects …

[引用][C] The Next Generation of EMPRESS

SDA Exascale - 2018 - Dartmouth College