Paragon: QoS-aware scheduling for heterogeneous datacenters

C Delimitrou, C Kozyrakis - ACM SIGPLAN Notices, 2013 - dl.acm.org
Large-scale datacenters (DCs) host tens of thousands of diverse applications each day.
However, interference between colocated workloads and the difficulty to match applications …

ZSim: Fast and accurate microarchitectural simulation of thousand-core systems

D Sanchez, C Kozyrakis - ACM SIGARCH Computer architecture news, 2013 - dl.acm.org
Architectural simulation is time-consuming, and the trend towards hundreds of cores is
making sequential simulation even slower. Existing parallel simulation techniques either …

GARNET: A detailed on-chip network model inside a full-system simulator

N Agarwal, T Krishna, LS Peh… - 2009 IEEE international …, 2009 - ieeexplore.ieee.org
Until very recently, microprocessor designs were computation-centric. On-chip
communication was frequently ignored. This was because of fast, single-cycle on-chip …

CPI2 CPU performance isolation for shared compute clusters

X Zhang, E Tune, R Hagmann, R Jnagal… - Proceedings of the 8th …, 2013 - dl.acm.org
Performance isolation is a key challenge in cloud computing. Unfortunately, Linux has few
defenses against performance interference in shared resources such as processor caches …

{DeepDive}: Transparently identifying and managing performance interference in virtualized environments

D Novaković, N Vasić, S Novaković, D Kostić… - 2013 USENIX Annual …, 2013 - usenix.org
We describe the design and implementation of DeepDive, a system for transparently
identifying and managing performance interference between virtual machines (VMs) co …

System-level performance metrics for multiprogram workloads

S Eyerman, L Eeckhout - IEEE micro, 2008 - ieeexplore.ieee.org
Assessing the performance of multiprogram workloads running on multithreaded hardware
is difficult because it involves a balance between single-program performance and overall …

Ubik: Efficient cache sharing with strict QoS for latency-critical workloads

H Kasture, D Sanchez - ACM Sigplan Notices, 2014 - dl.acm.org
Chip-multiprocessors (CMPs) must often execute workload mixes with different performance
requirements. On one hand, user-facing, latency-critical applications (eg, web search) need …

Cooperative caching for chip multiprocessors

J Chang, GS Sohi - ACM SIGARCH Computer Architecture News, 2006 - dl.acm.org
This paper presents CMP Cooperative Caching, a unified framework to manage a CMP's
aggregate on-chip cache resources. Cooperative caching combines the strengths of private …

The ZCache: Decoupling ways and associativity

D Sanchez, C Kozyrakis - 2010 43rd Annual IEEE/ACM …, 2010 - ieeexplore.ieee.org
The ever-increasing importance of main memory latency and bandwidth is pushing CMPs
towards caches with higher capacity and associativity. Associativity is typically improved by …

A survey of cache simulators

H Brais, R Kalayappan, PR Panda - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
Computer architecture simulation tools are essential for implementing and evaluating new
ideas in the domain and can be useful for understanding the behavior of programs and …