A survey on coarse-grained reconfigurable architectures from a performance perspective

A Podobas, K Sano, S Matsuoka - IEEE Access, 2020 - ieeexplore.ieee.org
With the end of both Dennard's scaling and Moore's law, computer users and researchers
are aggressively exploring alternative forms of computing in order to continue the …

Dsagen: Synthesizing programmable spatial accelerators

J Weng, S Liu, V Dadu, Z Wang, P Shah… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
Domain-specific hardware accelerators can provide orders of magnitude speedup and
energy efficiency over general purpose processors. However, they require extensive manual …

Hardware acceleration of sparse and irregular tensor computations of ml models: A survey and insights

S Dave, R Baghdadi, T Nowatzki… - Proceedings of the …, 2021 - ieeexplore.ieee.org
Machine learning (ML) models are widely used in many important domains. For efficiently
processing these computational-and memory-intensive applications, tensors of these …

Fifer: Practical acceleration of irregular applications on reconfigurable architectures

QM Nguyen, D Sanchez - MICRO-54: 54th Annual IEEE/ACM …, 2021 - dl.acm.org
Coarse-grain reconfigurable arrays (CGRAs) can achieve much higher performance and
efficiency than general-purpose cores, approaching the performance of a specialized design …

Polygraph: Exposing the value of flexibility for graph processing accelerators

V Dadu, S Liu, T Nowatzki - 2021 ACM/IEEE 48th Annual …, 2021 - ieeexplore.ieee.org
Because of the importance of graph workloads and the limitations of CPUs/GPUs, many
graph processing accelerators have been proposed. The basic approach of prior …

OverGen: Improving FPGA usability through domain-specific overlay generation

S Liu, J Weng, D Kupsh… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
FPGAs have been proven to be powerful computational accelerators across many types of
workloads. The mainstream programming approach is high level synthesis (HLS), which …

Riptide: A programmable, energy-minimal dataflow compiler and architecture

G Gobieski, S Ghosh, M Heule, T Mowry… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
Emerging sensing applications create an unprecedented need for energy efficiency in
programmable processors. To achieve useful multi-year deployments on a small battery or …

The sparse abstract machine

O Hsu, M Strange, R Sharma, J Won… - Proceedings of the 28th …, 2023 - dl.acm.org
We propose the Sparse Abstract Machine (SAM), an abstract machine model for targeting
sparse tensor algebra to reconfigurable and fixed-function spatial dataflow accelerators …

Symphony: Orchestrating sparse and dense tensors with hierarchical heterogeneous processing

M Pellauer, J Clemons, V Balaji, N Crago… - ACM Transactions on …, 2023 - dl.acm.org
Sparse tensor algorithms are becoming widespread, particularly in the domains of deep
learning, graph and data analytics, and scientific computing. Current high-performance …

Taskstream: Accelerating task-parallel workloads by recovering program structure

V Dadu, T Nowatzki - Proceedings of the 27th ACM International …, 2022 - dl.acm.org
Reconfigurable accelerators, like CGRAs and dataflow architectures, have come to
prominence for addressing data-processing problems. However, they are largely limited to …