SCORPIO: A 36-core research chip demonstrating snoopy coherence on a scalable mesh NoC with in-network ordering

BK Daya, CHO Chen, S Subramanian… - ACM SIGARCH …, 2014 - dl.acm.org
In the many-core era, scalable coherence and on-chip interconnects are crucial for shared
memory processors. While snoopy coherence is common in small multicore systems …

Netrace: dependency-driven trace-based network-on-chip simulation

J Hestness, B Grot, SW Keckler - … of the Third International Workshop on …, 2010 - dl.acm.org
Chip multiprocessors (CMPs) and systems-on-chip (SOCs) are expected to grow in core
count from, a few today to hundreds or more. Since efficient on-chip communication is a …

Towards the ideal on-chip fabric for 1-to-many and many-to-1 communication

T Krishna, LS Peh, BM Beckmann… - Proceedings of the 44th …, 2011 - dl.acm.org
The prevalence of multicore architectures has accentuated the need for scalable cache
coherence solutions. Many of the proposed designs use a mix of 1-to-1, 1-to-many (1-to-M) …

An abacus turn model for time/space-efficient reconfigurable routing

B Fu, Y Han, J Ma, H Li, X Li - Proceedings of the 38th annual …, 2011 - dl.acm.org
Applications' traffic tends to be bursty and the location of hot-spot nodes moves as time goes
by. This will significantly aggregate the blocking problem of wormhole-routed Network-on …

Virtual channels vs. multiple physical networks: a comparative analysis

YJ Yoon, N Concer, M Petracca, L Carloni - Proceedings of the 47th …, 2010 - dl.acm.org
Packet-switched networks-on-chip (NoC) have been proposed as an efficient
communication infrastructure for multi-core architectures. Adding virtual channels to a NoC …

Virtual channels and multiple physical networks: Two alternatives to improve NoC performance

YJ Yoon, N Concer, M Petracca… - IEEE Transactions on …, 2013 - ieeexplore.ieee.org
Virtual channels (VC) and multiple physical (MP) networks are two alternative methods to
provide better performance, support quality-of-service, and avoid protocol deadlocks in …

Widir: A wireless-enabled directory cache coherence protocol

A Franques, A Kokolis, S Abadal… - … Symposium on High …, 2021 - ieeexplore.ieee.org
As the core count in shared-memory manycores keeps increasing, it is becoming
increasingly harder to design cache-coherence protocols that deliver high performance …

Subspace snooping: Filtering snoops with operating system support

D Kim, J Ahn, J Kim, J Huh - … of the 19th international conference on …, 2010 - dl.acm.org
Although snoop-based coherence protocols provide fast cache-to-cache transfers with a
simple and robust coherence mechanism, scaling the protocols has been difficult due to the …

In-network coherence filtering: Snoopy coherence without broadcasts

N Agarwal, LS Peh, NK Jha - Proceedings of the 42nd Annual IEEE/ACM …, 2009 - dl.acm.org
With transistor miniaturization leading to an abundance of on-chip resources and
uniprocessor designs providing diminishing returns, the industry has moved beyond single …

Efficient sequential consistency in gpus via relativistic cache coherence

X Ren, M Lis - 2017 IEEE International Symposium on High …, 2017 - ieeexplore.ieee.org
Recent work has argued that sequential consistency (SC) in GPUs can perform on par with
weak memory models, provided ordering stalls are made less frequent by relaxing ordering …