REVAMP: A systematic framework for heterogeneous CGRA realization

TK Bandara, D Wijerathne, T Mitra, LS Peh - Proceedings of the 27th …, 2022 - dl.acm.org
Coarse-Grained Reconfigurable Architectures (CGRAs) provide an excellent balance
between performance, energy efficiency, and flexibility. However, increasingly sophisticated …

Evaluation of hardware data prefetchers on server processors

M Bakhshalipour, S Tabaeiaghdaei… - ACM Computing …, 2019 - dl.acm.org
Data prefetching, ie, the act of predicting an application's future memory accesses and
fetching those that are not in the on-chip caches, is a well-known and widely used approach …

Adapt-noc: A flexible network-on-chip design for heterogeneous manycore architectures

H Zheng, K Wang, A Louri - 2021 IEEE international symposium …, 2021 - ieeexplore.ieee.org
The increased computational capability in heterogeneous manycore architectures facilitates
the concurrent execution of many applications. This requires, among other things, a flexible …

Energy-efficient networks-on-chip architectures: Design and run-time optimization

SK Mandal, A Krishnakumar, UY Ogras - Network-on-Chip Security and …, 2021 - Springer
Abstract Networks-on-Chip (NoC) architectures have become the mainstream
communication backbone of high-end processors and systems-on-chip (SoCs) after their …

A versatile and flexible chiplet-based system design for heterogeneous manycore architectures

H Zheng, K Wang, A Louri - 2020 57th ACM/IEEE Design …, 2020 - ieeexplore.ieee.org
Heterogeneous manycore architectures are deployed to simultaneously run multiple and
diverse applications. This requires various computing capabilities (CPUs, GPUs, and …

Slim noc: A low-diameter on-chip network topology for high energy efficiency and scalability

M Besta, SM Hassan, S Yalamanchili… - ACM SIGPLAN …, 2018 - dl.acm.org
Emerging chips with hundreds and thousands of cores require networks with unprecedented
energy/area efficiency and scalability. To address this, we propose Slim NoC (SN): a new on …

Aries: Accelerating distributed training in chiplet-based systems via flexible interconnects

L Yin, A Ghazizadeh, A Louri… - 2023 IEEE/ACM …, 2023 - ieeexplore.ieee.org
Large-scale deep learning models are widely deployed in many application domains with
remarkable performance improvements. However, training these models with immense …

TAMA: turn-aware mapping and architecture–a power-efficient network-on-chip approach

R Aligholipour, M Baharloo, B Farzaneh… - ACM Transactions on …, 2021 - dl.acm.org
Nowadays, static power consumption in chip multiprocessor (CMP) is the most crucial
concern of chip designers. Power-gating is an effective approach to mitigate static power …

Stitch: Fusible heterogeneous accelerators enmeshed with many-core architecture for wearables

C Tan, M Karunaratne, T Mitra… - 2018 ACM/IEEE 45th …, 2018 - ieeexplore.ieee.org
Wearable devices are now leveraging multi-core processors to cater to the increasing
computational demands of the applications via multi-threading. However, the power …

Task mapping on SMART NoC: Contention matters, not the distance

L Yang, W Liu, P Chen, N Guan, M Li - Proceedings of the 54th Annual …, 2017 - dl.acm.org
On-chip communication is the bottleneck of system performance for NoC-based MPSoCs.
SMART, a recently proposed NoC architecture, enables single-cycle multi-hop …