Pond: Cxl-based memory pooling systems for cloud platforms

H Li, DS Berger, L Hsu, D Ernst, P Zardoshti… - Proceedings of the 28th …, 2023 - dl.acm.org
Public cloud providers seek to meet stringent performance requirements and low hardware
cost. A key driver of performance and cost is main memory. Memory pooling promises to …

Sok: The challenges, pitfalls, and perils of using hardware performance counters for security

S Das, J Werner, M Antonakakis… - … IEEE Symposium on …, 2019 - ieeexplore.ieee.org
Hardware Performance Counters (HPCs) have been available in processors for more than a
decade. These counters can be used to monitor and measure events that occur at the CPU …

DCAPS: Dynamic cache allocation with partial sharing

Y Xiang, X Wang, Z Huang, Z Wang, Y Luo… - Proceedings of the …, 2018 - dl.acm.org
In a multicore system, effective management of shared last level cache (LLC), such as
hardware/software cache partitioning, has attracted significant research attention. Some …

Harvesting Memory-bound {CPU} Stall Cycles in Software with {MSH}

Z Luo, S Son, S Ratnasamy, S Shenker - 18th USENIX Symposium on …, 2024 - usenix.org
Memory-bound stalls account for a significant portion of CPU cycles in datacenter
workloads, which makes harvesting them to execute other useful work highly valuable …

Numamma: Numa memory analyzer

F Trahay, M Selva, L Morel, K Marquet - Proceedings of the 47th …, 2018 - dl.acm.org
Non Uniform Memory Access (NUMA) architectures are nowadays common for running High-
Performance Computing (HPC) applications. In such architectures, several distinct physical …

Profiling dataflow systems on multiple abstraction levels

A Beischl, T Kersten, M Bandle, J Giceva… - Proceedings of the …, 2021 - dl.acm.org
Dataflow graphs are a popular abstraction for describing computation, used in many
systems for high-level optimization. For execution, dataflow graphs are lowered and …

On the precision of precise event based sampling

J Yi, B Dong, M Dong, H Chen - Proceedings of the 11th ACM SIGOPS …, 2020 - dl.acm.org
Many performance studies rely on Intel's Precise Event Based Sampling (PEBS) to collect
processor events, where precision is a key for the reliability of analysis. In this paper, we …

Precise event sampling on amd versus intel: Quantitative and qualitative comparison

MA Sasongko, M Chabbi, PHJ Kelly… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Precise event sampling is a profiling feature in commodity processors that can sample
hardware events and accurately locate the instructions that trigger the events. This feature …

TMC: Near-Optimal Resource Allocation for Tiered-Memory Systems

Y Ni, P Mehra, E Miller, H Litz - … of the 2023 ACM Symposium on Cloud …, 2023 - dl.acm.org
Main memory dominates data center server cost, and hence data center operators are
exploring alternative technologies such as CXL-attached and persistent memory to improve …

Specularizer : Detecting Speculative Execution Attacks via Performance Tracing

W Wang, G Chen, Y Cheng, Y Zhang, Z Lin - Detection of Intrusions and …, 2021 - Springer
This paper presents Specularizer, a framework for uncovering speculative execution attacks
using performance tracing features available in commodity processors. It is motivated by the …