Coarse-grain coherence tracking: RegionScout and region coherence arrays

JF Cantin, JE Smith, MH Lipasti, A Moshovos… - IEEE Micro, 2006 - ieeexplore.ieee.org
Cache-coherent shared-memory multiprocessors have wide-ranging applications, from
commercial transaction processing and database services to large-scale scientific …

Coherence domain restriction on large scale systems

Y Fu, TM Nguyen, D Wentzlaff - … of the 48th International Symposium on …, 2015 - dl.acm.org
Designing massive scale cache coherence systems has been an elusive goal. Whether it be
on large-scale GPUs, future thousand-core chips, or across million-core warehouse scale …

Virtual tree coherence: Leveraging regions and in-network multicast trees for scalable cache coherence

NDE Jerger, LS Peh, MH Lipasti - 2008 41st IEEE/ACM …, 2008 - ieeexplore.ieee.org
Scalable cache coherence solutions are imperative to drive the many-core revolution
forward. To fully realize the massive computation power of these many-core architectures …

Improving multiprocessor performance with coarse-grain coherence tracking

JF Cantin, MH Lipasti, JE Smith - … International Symposium on …, 2005 - ieeexplore.ieee.org
To maintain coherence in conventional shared-memory multiprocessor systems, processors
first check other processors' caches before obtaining data from memory. This coherence …

Cache coherence in large-scale shared-memory multiprocessors: Issues and comparisons

DJ Lilja - ACM Computing Surveys (CSUR), 1993 - dl.acm.org
Due to data spreading among processors and due to the cache coherence problem, private
data caches have not been as effective in reducing the average memory delay in …

[PDF][PDF] Cache coherence techniques for multicore processors

MR Marty - 2008 - research.cs.wisc.edu
The cache coherence mechanisms are a key component towards achieving the goal of
continuing exponential performance growth through widespread thread-level parallelism …

SCD: A scalable coherence directory with flexible sharer set encoding

D Sanchez, C Kozyrakis - IEEE International Symposium on …, 2012 - ieeexplore.ieee.org
Large-scale CMPs with hundreds of cores require a directory-based protocol to maintain
cache coherence. However, previously proposed coherence directories are hard to scale …

Compiler and hardware support for cache coherence in large-scale multiprocessors: Design considerations and performance study

L Choi, PC Yew - ACM SIGARCH Computer Architecture News, 1996 - dl.acm.org
In this paper, we study a hardware-supported, compiler directed (HSCD) cache coherence
scheme, which can be implemented on a large-scale multiprocessor using off-the-shelf …

WAYPOINT: scaling coherence to thousand-core architectures

JH Kelm, MR Johnson, SS Lumettta… - Proceedings of the 19th …, 2010 - dl.acm.org
In this paper, we evaluate a set of coherence architectures in the context of a 1024-core chip
multiprocessor (CMP) tailored to throughput-oriented parallel workloads. Based on our …

Flask coherence: A morphable hybrid coherence protocol to balance energy, performance and scalability

LG Menezo, V Puente… - 2015 IEEE 21st …, 2015 - ieeexplore.ieee.org
This work proposes a mechanism to hybridize the benefits of snoop-based and directory-
based coherence protocols in a single construct. A non-inclusive sparse-directory is used to …