Design options for optical ring interconnect in future client devices

P Grani, S Bartolini - ACM Journal on Emerging Technologies in …, 2014 - dl.acm.org
Nanophotonic is a promising solution for on-chip interconnection due to its intrinsic low-
latency and low-power features. Future tiled chip multiprocessors (CMPs) for rich client …

A direct coherence protocol for many-core chip multiprocessors

A Ros, ME Acacio, JM Garcia - IEEE Transactions on Parallel …, 2010 - ieeexplore.ieee.org
Future many-core CMP designs that will integrate tens of processor cores on-chip will be
constrained by area and power. Area constraints make impractical the use of a bus or a …

Dynamic cache clustering for chip multiprocessors

M Hammoud, S Cho, R Melhem - … of the 23rd international conference on …, 2009 - dl.acm.org
This paper proposes DCC (Dynamic Cache Clustering), a novel distributed cache
management scheme for large-scale chip multiprocessors. Using DCC, a per-core cache …

A simple on-chip optical interconnection for improving performance of coherency traffic in cmps

S Bartolini, P Grani - 2012 15th Euromicro Conference on …, 2012 - ieeexplore.ieee.org
Nanophotonic interconnection is a promising solution for inter-core communication in future
chip multiprocessors (CMPs). Main benefits derive from its intrinsic low-latency and high …

Victim retention for reducing cache misses in tiled chip multiprocessors

S Das, HK Kapoor - Microprocessors and Microsystems, 2014 - Elsevier
This paper presents CMP-VR (Chip-Multiprocessor with Victim Retention), an approach to
improve cache performance by reducing the number of off-chip memory accesses. The …

Cache coherence protocols for many-core CMPs

A Ros, ME Acacio, JM Garcıa - Parallel and Distributed …, 2010 - books.google.com
Multi-core architectures have emerged as the best alternative to take advantage of the
increasing number of transistors currently offered in a single die. For example, the dual-core …

A scalable organization for distributed directories

A Ros, ME Acacio, JM García - Journal of Systems Architecture, 2010 - Elsevier
Although directory-based cache-coherence protocols are the best choice when designing
chip multiprocessors with tens of cores on-chip, the memory overhead introduced by the …

The Tag Filter Architecture: An energy-efficient cache and directory design

JJ Valls, A Ros, ME Gómez, J Sahuquillo - Journal of Parallel and …, 2017 - Elsevier
Power consumption in current high-performance chip multiprocessors (CMPs) has become
a major design concern that aggravates with the current trend of increasing the core count. A …

Dynamic associativity management in tiled CMPs by runtime adaptation of fellow sets

S Das, HK Kapoor - IEEE Transactions on Parallel and …, 2017 - ieeexplore.ieee.org
The non-uniform distribution of memory accesses among the cache sets results in some sets
being used heavily while certain others remaining underutilized. Dynamic associativity …

A directory cache with dynamic private-shared partitioning

JJ Valls, ME Gómez, A Ros… - 2016 IEEE 23rd …, 2016 - ieeexplore.ieee.org
As the core counts increase in each chip multiprocessor generation, coherence protocols
should improve scalability inperformance, area, and energy consumption to meet the …