Sibyl: Adaptive and extensible data placement in hybrid storage systems using online reinforcement learning

G Singh, R Nadig, J Park, R Bera, N Hajinazar… - Proceedings of the 49th …, 2022 - dl.acm.org
Hybrid storage systems (HSS) use multiple different storage devices to provide high and
scalable storage capacity at high performance. Data placement across different devices is …

Daemon: Architectural support for efficient data movement in fully disaggregated systems

C Giannoula, K Huang, J Tang, N Koziris… - Proceedings of the …, 2023 - dl.acm.org
Resource disaggregation offers a cost effective solution to resource scaling, utilization, and
failure-handling in data centers by physically separating hardware devices in a server …

Harnessing integrated cpu-gpu system memory for hpc: a first look into grace hopper

G Schieffer, J Wahlgren, J Ren, J Faj… - Proceedings of the 53rd …, 2024 - dl.acm.org
Memory management across discrete CPU and GPU physical memory is traditionally
achieved through explicit GPU allocations and data copy or unified virtual memory. The …

Enabling Large Dynamic Neural Network Training with Learning-based Memory Management

J Ren, D Xu, S Yang, J Zhao, Z Li… - … Symposium on High …, 2024 - ieeexplore.ieee.org
Dynamic neural network (DyNN) enables high computational efficiency and strong
representation capability. However, training DyNN can face a memory capacity problem …

CachedArrays: Optimizing Data Movement for Heterogeneous Memory Systems

M Hildebrand, J Lowe-Power… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
We propose a new framework called CachedArrays and a set of APIs to address the data
tiering problem in large scale heterogeneous and disaggregated memory systems. The …

MTM: Rethinking Memory Profiling and Migration for Multi-Tiered Large Memory

J Ren, D Xu, J Ryu, K Shin, D Kim, D Li - Proceedings of the Nineteenth …, 2024 - dl.acm.org
Multi-terabyte large memory systems are often characterized by more than two memory tiers
with different latency and bandwidth. Multi-tiered large memory systems call for rethinking of …

PARL: Page Allocation in hybrid main memory using Reinforcement Learning

E Karimov, T Evenblij, SA Chamazcoti… - Journal of Systems …, 2025 - Elsevier
Abstract Hybrid Main Memory introduces emerging non-volatile memory technologies and
reduces the DRAM footprint to address the increasing capacity demands of modern …

StarNUMA: Mitigating NUMA Challenges with Memory Pooling

A Cho, A Daglis - … 57th IEEE/ACM International Symposium on …, 2024 - ieeexplore.ieee.org
Large multi-socket machines are mission-critical high-performance systems for workloads
requiring massive memory shared by hundreds of processors. Beyond eight sockets, such …

Coeus: Clustering (a) like patterns for practical machine intelligent hybrid memory management

TD Doudali, A Gavrilovska - 2022 22nd IEEE International …, 2022 - ieeexplore.ieee.org
Emerging workloads benefit from massive memory capacities provided by hybrid memory
platforms. Recent system-level hybrid memory management solutions integrate machine …

FAM-Graph: Graph analytics on disaggregated memory

D Zahka, A Gavrilovska - 2022 IEEE International Parallel and …, 2022 - ieeexplore.ieee.org
Disaggregated memory is being proposed as a way to provide efficient memory scaling for
data intensive applications. High performance interconnect technologies, such as CXL …