Tpp: Transparent page placement for cxl-enabled tiered-memory

HA Maruf, H Wang, A Dhanotia, J Weiner… - Proceedings of the 28th …, 2023 - dl.acm.org
The increasing demand for memory in hyperscale applications has led to memory becoming
a large portion of the overall datacenter spend. The emergence of coherent interfaces like …

Hemem: Scalable tiered memory management for big data applications and real nvm

A Raybuck, T Stamler, W Zhang, M Erez… - Proceedings of the ACM …, 2021 - dl.acm.org
High-capacity non-volatile memory (NVM) is a new main memory tier. Tiered DRAM+ NVM
servers increase total memory capacity by up to 8x, but can diminish memory bandwidth by …

Towards an adaptable systems architecture for memory tiering at warehouse-scale

P Duraisamy, W Xu, S Hare, R Rajwar… - Proceedings of the 28th …, 2023 - dl.acm.org
Fast DRAM increasingly dominates infrastructure spend in large scale computing
environments and this trend will likely worsen without an architectural shift. The cost of …

Hm-ann: Efficient billion-point nearest neighbor search on heterogeneous memory

J Ren, M Zhang, D Li - Advances in Neural Information …, 2020 - proceedings.neurips.cc
The state-of-the-art approximate nearest neighbor search (ANNS) algorithms face a
fundamental tradeoff between query latency and accuracy, because of small main memory …

Sentinel: Efficient tensor migration and allocation on heterogeneous memory systems for deep learning

J Ren, J Luo, K Wu, M Zhang… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
Memory capacity is a major bottleneck for training deep neural networks (DNN).
Heterogeneous memory (HM) combining fast and slow memories provides a promising …

Kleio: A hybrid memory page scheduler with machine intelligence

TD Doudali, S Blagodurov, A Vishnu… - Proceedings of the 28th …, 2019 - dl.acm.org
The increasing demand of big data analytics for more main memory capacity in datacenters
and exascale computing environments is driving the integration of heterogeneous memory …

Hermes: a heterogeneous-aware multi-tiered distributed I/O buffering system

A Kougkas, H Devarajan, XH Sun - Proceedings of the 27th International …, 2018 - dl.acm.org
Modern High-Performance Computing (HPC) systems are adding extra layers to the memory
and storage hierarchy named deep memory and storage hierarchy (DMSH), to increase I/O …

Memory disaggregation: advances and open challenges

H Al Maruf, M Chowdhury - ACM SIGOPS Operating Systems Review, 2023 - dl.acm.org
Compute and memory are tightly coupled within each server in traditional datacenters.
Large-scale datacenter operators have identified this coupling as a root cause behind …

Evaluating emerging CXL-enabled memory pooling for HPC systems

J Wahlgren, M Gokhale, IB Peng - 2022 IEEE/ACM Workshop …, 2022 - ieeexplore.ieee.org
Current HPC systems provide memory resources that are statically configured and tightly
coupled with compute nodes. However, workloads on HPC systems are evolving. Diverse …

Runtime data management on non-volatile memory-based heterogeneous memory for task-parallel programs

K Wu, J Ren, D Li - SC18: International Conference for High …, 2018 - ieeexplore.ieee.org
Non-volatile memory (NVM) provides a scalable solution to replace DRAM as main memory.
Because of relatively high latency and low bandwidth of NVM (comparing with DRAM), NVM …