IPC considered harmful for multiprocessor workloads

C Delimitrou, C Kozyrakis - ACM SIGPLAN Notices, 2013 - dl.acm.org

Large-scale datacenters (DCs) host tens of thousands of diverse applications each day.
However, interference between colocated workloads and the difficulty to match applications …

被引用次数：983 相关文章所有 16 个版本

[PDF] mit.edu

ZSim: Fast and accurate microarchitectural simulation of thousand-core systems

D Sanchez, C Kozyrakis - ACM SIGARCH Computer architecture news, 2013 - dl.acm.org

Architectural simulation is time-consuming, and the trend towards hundreds of cores is
making sequential simulation even slower. Existing parallel simulation techniques either …

被引用次数：723 相关文章所有 27 个版本

[PDF] mit.edu

GARNET: A detailed on-chip network model inside a full-system simulator

N Agarwal, T Krishna, LS Peh… - 2009 IEEE international …, 2009 - ieeexplore.ieee.org

Until very recently, microprocessor designs were computation-centric. On-chip
communication was frequently ignored. This was because of fast, single-cycle on-chip …

被引用次数：952 相关文章所有 15 个版本

[PDF] googleapis.com

CPI² CPU performance isolation for shared compute clusters

X Zhang, E Tune, R Hagmann, R Jnagal… - Proceedings of the 8th …, 2013 - dl.acm.org

Performance isolation is a key challenge in cloud computing. Unfortunately, Linux has few
defenses against performance interference in shared resources such as processor caches …

被引用次数：422 相关文章所有 11 个版本

[PDF] usenix.org

{DeepDive}: Transparently identifying and managing performance interference in virtualized environments

D Novaković, N Vasić, S Novaković, D Kostić… - 2013 USENIX Annual …, 2013 - usenix.org

We describe the design and implementation of DeepDive, a system for transparently
identifying and managing performance interference between virtual machines (VMs) co …

被引用次数：351 相关文章所有 18 个版本

[PDF] researchgate.net

System-level performance metrics for multiprogram workloads

S Eyerman, L Eeckhout - IEEE micro, 2008 - ieeexplore.ieee.org

Assessing the performance of multiprogram workloads running on multithreaded hardware
is difficult because it involves a balance between single-program performance and overall …

被引用次数：524 相关文章所有 13 个版本

[PDF] mit.edu

Ubik: Efficient cache sharing with strict QoS for latency-critical workloads

H Kasture, D Sanchez - ACM Sigplan Notices, 2014 - dl.acm.org

Chip-multiprocessors (CMPs) must often execute workload mixes with different performance
requirements. On one hand, user-facing, latency-critical applications (eg, web search) need …

被引用次数：226 相关文章所有 8 个版本

[PDF] wisc.edu

Cooperative caching for chip multiprocessors

J Chang, GS Sohi - ACM SIGARCH Computer Architecture News, 2006 - dl.acm.org

This paper presents CMP Cooperative Caching, a unified framework to manage a CMP's
aggregate on-chip cache resources. Cooperative caching combines the strengths of private …

被引用次数：575 相关文章所有 36 个版本

[PDF] gatech.edu

The ZCache: Decoupling ways and associativity

D Sanchez, C Kozyrakis - 2010 43rd Annual IEEE/ACM …, 2010 - ieeexplore.ieee.org

The ever-increasing importance of main memory latency and bandwidth is pushing CMPs
towards caches with higher capacity and associativity. Associativity is typically improved by …

被引用次数：280 相关文章所有 23 个版本

[PDF] iitdh.ac.in

A survey of cache simulators

H Brais, R Kalayappan, PR Panda - ACM Computing Surveys (CSUR), 2020 - dl.acm.org

Computer architecture simulation tools are essential for implementing and evaluating new
ideas in the domain and can be useful for understanding the behavior of programs and …

被引用次数：29 相关文章所有 6 个版本

高级搜索

QQ 群