Orion: Interference-aware, Fine-grained GPU Sharing for ML Applications

F Strati, X Ma, A Klimovic - … of the Nineteenth European Conference on …, 2024 - dl.acm.org
GPUs are critical for maximizing the throughput-per-Watt of deep neural network (DNN)
applications. However, DNN applications often underutilize GPUs, even when using large …

What modern NVMe storage can do, and how to exploit it: high-performance I/O for high-performance storage engines

G Haas, V Leis - Proceedings of the VLDB Endowment, 2023 - dl.acm.org
NVMe SSDs based on flash are cheap and offer high throughput. Combining several of
these devices into a single server enables 10 million I/O operations per second or more. Our …

DecLog: Decentralized Logging in Non-Volatile Memory for Time Series Database Systems

B Zheng, Y Gao, J Wan, L Yan, L Hu, B Liu… - Proceedings of the …, 2023 - dl.acm.org
Growing demands for the efficient processing of extreme-scale time series workloads call for
more capable time series database management systems (TSDBMS). Specifically, to …

[PDF][PDF] Database Kernels: Seamless Integration of Database Systems and Fast Storage via CXL.

S Lee, A Lerner, P Bonnet, P Cudré-Mauroux - CIDR, 2024 - exascale.info
Flash memory is the de facto standard for data persistence in dataintensive systems. Despite
its benefits, this type of memory has at least one severe disadvantage: it is offered only as …

Redesigning high-performance lsm-based key-value stores with persistent cpu caches

Y Zhong, Z Shen, Z Yu, J Shu - 2023 IEEE 39th International …, 2023 - ieeexplore.ieee.org
By providing non-volatility with DRAM-comparable performance, the emerging persistent
memory (PMem) is propelling new key-value (KV) store designs. The recently released Intel …

Dds: Dpu-optimized disaggregated storage

Q Zhang, P Bernstein, B Chandramouli, J Hu… - arXiv preprint arXiv …, 2024 - arxiv.org
This extended report presents DDS, a novel disaggregated storage architecture enabled by
emerging networking hardware, namely DPUs (Data Processing Units). DPUs can optimize …

Data flow architectures for data processing on modern hardware

A Lerner, G Alonso - 2024 IEEE 40th International Conference …, 2024 - ieeexplore.ieee.org
The requirements arising from ever growing amounts of data and tight performance
constraints as well as the limitations encountered in improving conventional CPU …

Delilah: eBPF-offload on Computational Storage

N Hedam, M Tychsen Clausen, P Bonnet… - Proceedings of the 19th …, 2023 - dl.acm.org
The idea of pushing computation to storage devices has been explored for decades, without
widespread adoption so far. The definition of Computational Programs namespaces in …

Time-constrained persistent deletion for key-value store engine on ZNS SSD

S Nie, T Lei, J Niu, Q Hu, S Liu, W Wu - Future Generation Computer …, 2024 - Elsevier
The inherent out-of-place update characteristic of the Log-Structured Merge tree (LSM tree)
cannot guarantee persistent deletion within a specific time window, leading to potential data …

TEngine: A Native Distributed Table Storage Engine

X Fan, S Yan, Y Huang, C Weng - 2024 IEEE 40th International …, 2024 - ieeexplore.ieee.org
With the rapid development of storage and network technology, emerging high-performance
hardware is being widely applied to the distributed storage cluster. However, existing …