A case for bufferless routing in on-chip networks

T Moscibroda, O Mutlu - Proceedings of the 36th annual international …, 2009 - dl.acm.org
Buffers in on-chip networks consume significant energy, occupy chip area, and increase
design complexity. In this paper, we make a case for a new approach to designing on-chip …

Predicting inter-thread cache contention on a chip multi-processor architecture

D Chandra, F Guo, S Kim… - … Symposium on High …, 2005 - ieeexplore.ieee.org
This paper studies the impact of L2 cache sharing on threads that simultaneously share the
cache, on a chip multi-processor (CMP) architecture. Cache sharing impacts threads …

Cache coherence for GPU architectures

I Singh, A Shriraman, WWL Fung… - 2013 IEEE 19th …, 2013 - ieeexplore.ieee.org
While scalable coherence has been extensively studied in the context of general purpose
chip multiprocessors (CMPs), GPU architectures present a new set of challenges …

Token coherence: Decoupling performance and correctness

MMK Martin, MD Hill, DA Wood - ACM SIGARCH Computer Architecture …, 2003 - dl.acm.org
Many future shared-memory multiprocessor servers will both target commercial workloads
and use highly-integrated" glueless" designs. Implementing low-latency cache coherence in …

SCORPIO: A 36-core research chip demonstrating snoopy coherence on a scalable mesh NoC with in-network ordering

BK Daya, CHO Chen, S Subramanian… - ACM SIGARCH …, 2014 - dl.acm.org
In the many-core era, scalable coherence and on-chip interconnects are crucial for shared
memory processors. While snoopy coherence is common in small multicore systems …

Variability in architectural simulations of multi-threaded workloads

AR Alameldeen, DA Wood - The Ninth International …, 2003 - ieeexplore.ieee.org
Multi-threaded commercial workloads implement many important Internet services.
Consequently, these workloads are increasingly used to evaluate the performance of …

Improving multiprocessor performance with coarse-grain coherence tracking

JF Cantin, MH Lipasti, JE Smith - … International Symposium on …, 2005 - ieeexplore.ieee.org
To maintain coherence in conventional shared-memory multiprocessor systems, processors
first check other processors' caches before obtaining data from memory. This coherence …

Cyber physical social systems: Towards deeply integrated hybridized systems

F Dressler - 2018 International Conference on Computing …, 2018 - ieeexplore.ieee.org
Research on Cyber Physical Systems (CPS) has led to quite a number of astonishing
technical solutions that are becoming standard in many application domains affecting our …

In-network snoop ordering (INSO): Snoopy coherence on unordered interconnects

N Agarwal, LS Peh, NK Jha - 2009 IEEE 15th International …, 2009 - ieeexplore.ieee.org
Realizing scalable cache coherence in the many-core era comes with a whole new set of
constraints and opportunities. It is widely believed that multi-hop, unordered on-chip …

Widir: A wireless-enabled directory cache coherence protocol

A Franques, A Kokolis, S Abadal… - … Symposium on High …, 2021 - ieeexplore.ieee.org
As the core count in shared-memory manycores keeps increasing, it is becoming
increasingly harder to design cache-coherence protocols that deliver high performance …