Memtis: Efficient memory tiering with dynamic page classification and page size determination

T Lee, SK Monga, C Min, YI Eom - … of the 29th Symposium on Operating …, 2023 - dl.acm.org
The evergrowing memory demand fueled by datacenter workloads is the driving force
behind new memory technology innovations (eg, NVM, CXL). Tiered memory is a promising …

Victima: Drastically Increasing Address Translation Reach by Leveraging Underutilized Cache Resources

K Kanellopoulos, HC Nam, N Bostanci, R Bera… - Proceedings of the 56th …, 2023 - dl.acm.org
Address translation is a performance bottleneck in data-intensive workloads due to large
datasets and irregular access patterns that lead to frequent high-latency page table walks …

Reconsidering os memory optimizations in the presence of disaggregated memory

S Bergman, P Faldu, B Grot, L Vilanova… - Proceedings of the 2022 …, 2022 - dl.acm.org
Tiered memory systems introduce an additional memory level with higher-than-local-DRAM
access latency and require sophisticated memory management mechanisms to achieve cost …

CachedArrays: Optimizing Data Movement for Heterogeneous Memory Systems

M Hildebrand, J Lowe-Power… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
We propose a new framework called CachedArrays and a set of APIs to address the data
tiering problem in large scale heterogeneous and disaggregated memory systems. The …

Utopia: Fast and Efficient Address Translation via Hybrid Restrictive & Flexible Virtual-to-Physical Address Mappings

K Kanellopoulos, R Bera, K Stojiljkovic… - Proceedings of the 56th …, 2023 - dl.acm.org
Conventional virtual memory (VM) frameworks enable a virtual address to flexibly map to
any physical address. This flexibility necessitates large data structures to store virtual-to …

Telescope: telemetry for gargantuan memory footprint applications

A Nair, S Kumar, A Prasad, Y Huang, A Rudoff… - 2024 USENIX Annual …, 2024 - usenix.org
Data-hungry applications that require terabytes of memory have become widespread in
recent years. To meet the memory needs of these applications, data centers are embracing …

Pinning Page Structure Entries to Last-Level Cache for Fast Address Translation

O Kwon, Y Lee, S Hong - IEEE Access, 2022 - ieeexplore.ieee.org
As the memory footprint of emerging applications continues to increase, the address
translation becomes a critical performance bottleneck owing to frequent misses on the …

Accelerating performance of gpu-based workloads using cxl

M Arif, A Maurya, MM Rafique - Proceedings of the 13th Workshop on AI …, 2023 - dl.acm.org
High-performance computing (HPC) workloads such as scientific simulations and deep
learning (DL) running across multi-GPU systems are memory and data-intensive, relying on …

Optimizing the page hotness measurement with re-fault latency for tiered memory systems

T Lee, YI Eom - 2022 IEEE International Conference on Big …, 2022 - ieeexplore.ieee.org
Constructing tiered memory systems becomes prevalent in cloud and datacenter servers
due to its benefits on large capacity and low total cost of ownership. Since such tiered …

Performance characterization of autonuma memory tiering on graph analytics

D Moura, D Mossé, V Petrucci - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Non-Volatile Memory (NVM) can deliver higher density and lower cost per bit when
compared with DRAM. Its main drawback is that it is slower than DRAM. On the other hand …