VWA: Hardware efficient vectorwise accelerator for convolutional neural network

KW Chang, TS Chang - … Transactions on Circuits and Systems I …, 2019 - ieeexplore.ieee.org
… This paper proposes a hardware efficient CNN accelerator that can execute different
convolution filters efficiently, such as 3 × 3, 4 × 4, 5 × 5, 3 × 3 stride 2, depthwise, and 1 × 1 …

Throughput-optimized FPGA accelerator for deep convolutional neural networks

Z Liu, Y Dou, J Jiang, J Xu, S Li, Y Zhou… - ACM Transactions on …, 2017 - dl.acm.org
… of an FPGA-based accelerator under given resources. Our main contributions are the
following: • We propose a parallel framework for FPGA-based CNN accelerators that exploits four …

A high performance FPGA-based accelerator for large-scale convolutional neural networks

H Li, X Fan, L Jiao, W Cao, X Zhou… - 2016 26th International …, 2016 - ieeexplore.ieee.org
… This work proposes an end-to-end FPGAbased CNN accelerator with all the layers mapped
on one chip so that different layers can work concurrently in a pipelined structure to increase …

An evaluation of edge tpu accelerators for convolutional neural networks

A Yazdanbakhsh, K Seshadri, B Akin… - arXiv preprint arXiv …, 2021 - research.google
… Edge TPUs are a domain of accelerators for low-power,edge … across 423K uniqueconvolutional
neural networks. Building upon … metrics ofEdge TPU accelerators. These learned models …

A GPU-outperforming FPGA accelerator architecture for binary convolutional neural networks

Y Li, Z Liu, K Xu, H Yu, F Ren - ACM Journal on Emerging Technologies …, 2018 - dl.acm.org
… hardware accelerators for convolutional neural networks (… mapped FPGA accelerator
architecture tailored for bitwise convolution and … A key advantage of the FPGA accelerator is that …

Designing novel AAD pooling in hardware for a convolutional neural network accelerator

K Khalil, O Eldash, A Kumar… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
… reconfigurable accelerator for deep convolutional neural net… on eight-layer convolutional
neural network with leaky rectified … based on the convolution neural network with rank based …

GoSPA: An energy-efficient high-performance globally optimized sparse convolutional neural network accelerator

C Deng, Y Sui, S Liao, X Qian… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org
… in convolutional neural network (CNN) models makes sparsity-aware CNN hardware designs
very attractive. The existing sparse CNN accelerators … sparse 2-D convolution, this paper …

A precision-scalable energy-efficient convolutional neural network accelerator

W Liu, J Lin, Z Wang - … Transactions on Circuits and Systems I …, 2020 - ieeexplore.ieee.org
… requirements for accelerators in various aspects. Precision-fixed accelerators lack the …
Kim, “BitBlade: Area and energyefficient precision-scalable neural network accelerator with …

An efficient and flexible accelerator design for sparse convolutional neural networks

X Xie, J Lin, Z Wang, J Wei - … on Circuits and Systems I: Regular …, 2021 - ieeexplore.ieee.org
… Plenty of existing accelerators are built for dense CNNs or … sparse accelerators on FPGAs,
the proposed accelerator can … Compared to prior dense accelerators, this accelerator can …

GCNAX: A flexible and energy-efficient accelerator for graph convolutional neural networks

J Li, A Louri, A Karanth… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
… in these neural network accelerators. Although we can extend CNN accelerators to run SpMMs
accelerators since they are specialized for convolutions rather than matrix multiplications. …