A Survey of Design and Optimization for Systolic Array-based DNN Accelerators

R Xu, S Ma, Y Guo, D Li - ACM Computing Surveys, 2023 - dl.acm.org
In recent years, it has been witnessed that the systolic array is a successful architecture for
DNN hardware accelerators. However, the design of systolic arrays also encountered many …

AutoSA: A polyhedral compiler for high-performance systolic arrays on FPGA

J Wang, L Guo, J Cong - The 2021 ACM/SIGDA International Symposium …, 2021 - dl.acm.org
While systolic array architectures have the potential to deliver tremendous performance, it is
notoriously challenging to customize an efficient systolic array processor for a target …

Cryptensor: A resource-shared co-processor to accelerate convolutional neural network and polynomial convolution

JC See, HF Ng, HK Tan, JJ Chang… - … on Computer-Aided …, 2023 - ieeexplore.ieee.org
Practical deployment of convolutional neural network (CNN) and cryptography algorithm on
constrained devices are challenging due to the huge computation and memory requirement …

An efficient hardware design for accelerating sparse CNNs with NAS-based models

Y Liang, L Lu, Y Jin, J Xie, R Huang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Deep convolutional neural networks (CNNs) have achieved remarkable performance at the
cost of huge computation. As the CNN models become more complex and deeper …

Tensorlib: A spatial accelerator generation framework for tensor algebra

L Jia, Z Luo, L Lu, Y Liang - 2021 58th ACM/IEEE Design …, 2021 - ieeexplore.ieee.org
Tensor algebra finds applications in various domains, and these applications, especially
when accelerated on spatial hardware accelerators, can deliver high performance and low …

Most resource efficient matrix vector multiplication on FPGAs

A Lehnert, P Holzinger, S Pfenning, R Müller… - IEEE …, 2023 - ieeexplore.ieee.org
Fast and resource-efficient inference in artificial neural networks (ANNs) is of utmost
importance and drives many new developments in the area of new hardware architectures …

High-frequency systolic array-based transformer accelerator on field programmable gate arrays

Y Chen, T Li, X Chen, Z Cai, T Su - Electronics, 2023 - mdpi.com
The systolic array is frequently used in accelerators for neural networks, including
Transformer models that have recently achieved remarkable progress in natural language …

Automatic generation of spatial accelerator for tensor algebra

L Jia, Z Luo, L Lu, Y Liang - IEEE Transactions on Computer …, 2022 - ieeexplore.ieee.org
Tensor algebra finds applications in various domains, including machine learning
applications, data analytics, and others. Spatial hardware accelerators are widely used to …

Tensor networks for simulating quantum circuits on FPGAs

M Levental - arXiv preprint arXiv:2108.06831, 2021 - arxiv.org
Most research in quantum computing today is performed against simulations of quantum
computers rather than true quantum computers. Simulating a quantum computer entails …

A survey on system-on-a-chip design using chisel hw construction language

M Käyrä, TD Hämäläinen - IECON 2021–47th Annual …, 2021 - ieeexplore.ieee.org
This paper presents a survey of functional programming languages in System-on-a-Chip
(SoC) design. The motivation is improving the design productivity by better source code …