A survey of techniques for cache partitioning in multicore processors

S Mittal - ACM Computing Surveys (CSUR), 2017 - dl.acm.org
As the number of on-chip cores and memory demands of applications increase, judicious
management of cache resources has become not merely attractive but imperative. Cache …

Cooperative caching for chip multiprocessors

J Chang, GS Sohi - ACM SIGARCH Computer Architecture News, 2006 - dl.acm.org
This paper presents CMP Cooperative Caching, a unified framework to manage a CMP's
aggregate on-chip cache resources. Cooperative caching combines the strengths of private …

Jigsaw: Scalable software-defined caches

N Beckmann, D Sanchez - Proceedings of the 22nd …, 2013 - ieeexplore.ieee.org
Shared last-level caches, widely used in chip-multi-processors (CMPs), face two
fundamental limitations. First, the latency and energy of shared caches degrade as the …

Jenga: Software-defined cache hierarchies

PA Tsai, N Beckmann, D Sanchez - Proceedings of the 44th Annual …, 2017 - dl.acm.org
Caches are traditionally organized as a rigid hierarchy, with multiple levels of progressively
larger and slower memories. Hierarchy allows a simple, fixed design to benefit a wide range …

[图书][B] Multi-core cache hierarchies

R Balasubramonian, NP Jouppi, N Muralimanohar - 2011 - books.google.com
A key determinant of overall system performance and power dissipation is the cache
hierarchy since access to off-chip memory consumes many more cycles and energy than on …

Whirlpool: Improving dynamic cache management with static data classification

A Mukkara, N Beckmann, D Sanchez - ACM SIGARCH Computer …, 2016 - dl.acm.org
Cache hierarchies are increasingly non-uniform and difficult to manage. Several techniques,
such as scratchpads or reuse hints, use static information about how programs access data …

CloudCache: Expanding and shrinking private caches

H Lee, S Cho, BR Childers - 2011 IEEE 17th International …, 2011 - ieeexplore.ieee.org
The number of cores in a single chip multiprocessor is expected to grow in coming years.
Likewise, aggregate on-chip cache capacity is increasing fast and its effective utilization is …

Jumanji: The case for dynamic nuca in the datacenter

BC Schwedock, N Beckmann - 2020 53rd Annual IEEE/ACM …, 2020 - ieeexplore.ieee.org
The datacenter introduces new challenges for computer systems around tail latency and
security. This paper argues that dynamic NUCA techniques are a better solution to these …

Selective replication in memory-side GPU caches

X Zhao, M Jahre, L Eeckhout - 2020 53rd Annual IEEE/ACM …, 2020 - ieeexplore.ieee.org
Data-intensive applications put immense strain on the memory systems of Graphics
Processing Units (GPUs). To cater to this need, GPU memory systems distribute requests …

SLIP: reducing wire energy in the memory hierarchy

S Das, TM Aamodt, WJ Dally - Proceedings of the 42nd Annual …, 2015 - dl.acm.org
Wire energy has become the major contributor to energy in large lower level caches. While
wire energy is related to wire latency its costs are exposed differently in the memory …