Graphpim: Enabling instruction-level pim offloading in graph computing frameworks

L Nai, R Hadidi, J Sim, H Kim… - … symposium on high …, 2017 - ieeexplore.ieee.org
With the emergence of data science, graph computing has become increasingly important
these days. Unfortunately, graph computing typically suffers from poor performance when …

Nimble page management for tiered memory systems

Z Yan, D Lustig, D Nellans… - Proceedings of the Twenty …, 2019 - dl.acm.org
Software-controlled heterogeneous memory systems have the potential to increase the
performance and cost efficiency of computing systems. However they can only deliver on …

Exploring the design space of page management for {Multi-Tiered} memory systems

J Kim, W Choe, J Ahn - … Annual Technical Conference (USENIX ATC 21), 2021 - usenix.org
With the arrival of tiered memory systems comprising various types of memory, such as
DRAM and SCM, the operating system support for memory management is becoming …

Memtis: Efficient memory tiering with dynamic page classification and page size determination

T Lee, SK Monga, C Min, YI Eom - … of the 29th Symposium on Operating …, 2023 - dl.acm.org
The evergrowing memory demand fueled by datacenter workloads is the driving force
behind new memory technology innovations (eg, NVM, CXL). Tiered memory is a promising …

Flatflash: Exploiting the byte-accessibility of ssds within a unified memory-storage hierarchy

A Abulila, VS Mailthody, Z Qureshi, J Huang… - Proceedings of the …, 2019 - dl.acm.org
Using flash-based solid state drives (SSDs) as main memory has been proposed as a
practical solution towards scaling memory capacity for data-intensive applications. However …

Design and Analysis of an APU for Exascale Computing

T Vijayaraghavan, Y Eckert, GH Loh… - … Symposium on High …, 2017 - ieeexplore.ieee.org
The challenges to push computing to exaflop levels are difficult given desired targets for
memory capacity, memory bandwidth, power efficiency, reliability, and cost. This paper …

Rethinking TLB designs in virtualized environments: A very large part-of-memory TLB

JH Ryoo, N Gulur, S Song, LK John - ACM SIGARCH Computer …, 2017 - dl.acm.org
With increasing deployment of virtual machines for cloud services and server applications,
memory address translation overheads in virtualized environments have received great …

A survey of operating system support for persistent memory

M Cai, H Huang - Frontiers of Computer Science, 2021 - Springer
Emerging persistent memory technologies, like PCM and 3D XPoint, offer numerous
advantages, such as higher density, larger capacity, and better energy efficiency, compared …

Banshee: Bandwidth-efficient DRAM caching via software/hardware cooperation

X Yu, CJ Hughes, N Satish, O Mutlu… - Proceedings of the 50th …, 2017 - dl.acm.org
Placing the DRAM in the same package as a processor enables several times higher
memory bandwidth than conventional off-package DRAM. Yet, the latency of in-package …

Space: locality-aware processing in heterogeneous memory for personalized recommendations

H Kal, S Lee, G Ko, WW Ro - 2021 ACM/IEEE 48th Annual …, 2021 - ieeexplore.ieee.org
Personalized recommendation systems have become a major AI application in modern data
centers. The main challenges in processing personalized recommendation inferences are …