Sentinel: Efficient tensor migration and allocation on heterogeneous memory systems for deep learning

J Ren, J Luo, K Wu, M Zhang… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
Memory capacity is a major bottleneck for training deep neural networks (DNN).
Heterogeneous memory (HM) combining fast and slow memories provides a promising …

Sparta: High-performance, element-wise sparse tensor contraction on heterogeneous memory

J Liu, J Ren, R Gioiosa, D Li, J Li - … on Principles and Practice of Parallel …, 2021 - dl.acm.org
Sparse tensor contractions appear commonly in many applications. Efficiently computing a
two sparse tensor product is challenging: It not only inherits the challenges from common …

A survey of non-volatile main memory technologies: State-of-the-arts, practices, and future directions

HK Liu, D Chen, H Jin, XF Liao, B He, K Hu… - Journal of Computer …, 2021 - Springer
Abstract Non-Volatile Main Memories (NVMMs) have recently emerged as a promising
technology for future memory systems. Generally, NVMMs have many desirable properties …

vtmm: Tiered memory management for virtual machines

S Sha, C Li, Y Luo, X Wang, Z Wang - Proceedings of the Eighteenth …, 2023 - dl.acm.org
The memory demand of virtual machines (VMs) is increasing, while the traditional DRAM-
only memory system has limited capacity and high power consumption. The tiered memory …

Hardware-Software Collaborative Tiered-Memory Management Framework for Virtualization

S Sha, C Li, X Wang, Z Wang, Y Luo - ACM Transactions on Computer …, 2024 - dl.acm.org
The tiered-memory system can effectively expand the memory capacity for virtual machines
(VMs). However, virtualization introduces new challenges specifically in enforcing …

MTM: Rethinking Memory Profiling and Migration for Multi-Tiered Large Memory

J Ren, D Xu, J Ryu, K Shin, D Kim, D Li - Proceedings of the Nineteenth …, 2024 - dl.acm.org
Multi-terabyte large memory systems are often characterized by more than two memory tiers
with different latency and bandwidth. Multi-tiered large memory systems call for rethinking of …

Athena: High-performance sparse tensor contraction sequence on heterogeneous memory

J Liu, D Li, R Gioiosa, J Li - Proceedings of the 35th ACM International …, 2021 - dl.acm.org
Sparse tensor contraction sequence has been widely employed in many fields, such as
chemistry and physics. However, how to efficiently implement the sequence faces multiple …

Optimizing large-scale plasma simulations on persistent memory-based heterogeneous memory with effective data placement across memory hierarchy

J Ren, J Luo, I Peng, K Wu, D Li - Proceedings of the ACM International …, 2021 - dl.acm.org
Particle simulations of plasma are important for understanding plasma dynamics in space
weather and fusion devices. However, production simulations that use billions and even …

Active data replica recovery for quality-assurance Big Data analysis in IC-IoT

S Wang, J Yuan, X Li, Z Qian, F Arena, I You - IEEE Access, 2019 - ieeexplore.ieee.org
QoS-aware big data analysis is critical in Information-Centric Internet of Things (IC-IoT)
system to support various applications like smart city, smart grid, smart health, intelligent …

Design guidelines for high-performance SCM hierarchies

D Ustiugov, A Daglis, J Picorel, M Sutherland… - Proceedings of the …, 2018 - dl.acm.org
With emerging storage-class memory (SCM) nearing commercialization, there is evidence
that it will deliver the much-anticipated high density and access latencies within only a few …