A survey of coarse-grained reconfigurable architecture and design: Taxonomy, challenges, and applications

L Liu, J Zhu, Z Li, Y Lu, Y Deng, J Han, S Yin… - ACM Computing …, 2019 - dl.acm.org
As general-purpose processors have hit the power wall and chip fabrication cost escalates
alarmingly, coarse-grained reconfigurable architectures (CGRAs) are attracting increasing …

SPINN: synergistic progressive inference of neural networks over device and cloud

S Laskaridis, SI Venieris, M Almeida… - Proceedings of the 26th …, 2020 - dl.acm.org
Despite the soaring use of convolutional neural networks (CNNs) in mobile applications,
uniformly sustaining high-performance inference on mobile has been elusive due to the …

A configurable cloud-scale DNN processor for real-time AI

J Fowers, K Ovtcharov, M Papamichael… - 2018 ACM/IEEE 45th …, 2018 - ieeexplore.ieee.org
Interactive AI-powered services require low-latency evaluation of deep neural network
(DNN) models-aka"" real-time AI"". The growing demand for computationally expensive …

A cloud-scale acceleration architecture

AM Caulfield, ES Chung, A Putnam… - 2016 49th Annual …, 2016 - ieeexplore.ieee.org
Hyperscale datacenter providers have struggled to balance the growing need for
specialized hardware (efficiency) with the economic benefits of homogeneity …

Serving dnns in real time at datacenter scale with project brainwave

E Chung, J Fowers, K Ovtcharov, M Papamichael… - iEEE Micro, 2018 - ieeexplore.ieee.org
To meet the computational demands required of deep learning, cloud operators are turning
toward specialized hardware for improved efficiency and performance. Project Brainwave …

FPGA-based remote power side-channel attacks

M Zhao, GE Suh - 2018 IEEE Symposium on Security and …, 2018 - ieeexplore.ieee.org
The rapid adoption of heterogeneous computing has driven the integration of Field
Programmable Gate Arrays (FPGAs) into cloud datacenters and flexible System-on-Chips …

Plasticine: A reconfigurable architecture for parallel paterns

R Prabhakar, Y Zhang, D Koeplinger… - ACM SIGARCH …, 2017 - dl.acm.org
Reconfigurable architectures have gained popularity in recent years as they allow the
design of energy-efficient accelerators. Fine-grain fabrics (eg FPGAs) have traditionally …

Spatial: A language and compiler for application accelerators

D Koeplinger, M Feldman, R Prabhakar… - Proceedings of the 39th …, 2018 - dl.acm.org
Industry is increasingly turning to reconfigurable architectures like FPGAs and CGRAs for
improved performance and energy efficiency. Unfortunately, adoption of these architectures …

Energy-efficient CNN implementation on a deeply pipelined FPGA cluster

C Zhang, D Wu, J Sun, G Sun, G Luo… - Proceedings of the 2016 …, 2016 - dl.acm.org
Recently, FPGA-based CNN accelerators have demonstrated superior energy efficiency
compared to high-performance devices like GPGPUs. However, due to the constrained on …

Kv-direct: High-performance in-memory key-value store with programmable nic

B Li, Z Ruan, W Xiao, Y Lu, Y Xiong, A Putnam… - Proceedings of the 26th …, 2017 - dl.acm.org
Performance of in-memory key-value store (KVS) continues to be of great importance as
modern KVS goes beyond the traditional object-caching workload and becomes a key …