FPGA-based accelerators of deep learning networks for learning and classification: A review

A Shawahna, SM Sait, A El-Maleh - ieee Access, 2018 - ieeexplore.ieee.org
Due to recent advances in digital technologies, and availability of credible data, an area of
artificial intelligence, deep learning, has emerged and has demonstrated its ability and …

Evaluating fast algorithms for convolutional neural networks on FPGAs

L Lu, Y Liang, Q Xiao, S Yan - 2017 IEEE 25th annual …, 2017 - ieeexplore.ieee.org
In recent years, Convolutional Neural Networks (CNNs) have become widely adopted for
computer vision tasks. FPGAs have been adequately explored as a promising hardware …

Exploring heterogeneous algorithms for accelerating deep convolutional neural networks on FPGAs

Q Xiao, Y Liang, L Lu, S Yan, YW Tai - Proceedings of the 54th Annual …, 2017 - dl.acm.org
Convolutional neural network (CNN) finds applications in a variety of computer vision
applications ranging from object recognition and detection to scene understanding owing to …

CHARM: C omposing H eterogeneous A ccele R ators for M atrix Multiply on Versal ACAP Architecture

J Zhuang, J Lau, H Ye, Z Yang, Y Du, J Lo… - Proceedings of the …, 2023 - dl.acm.org
Dense matrix multiply (MM) serves as one of the most heavily used kernels in deep learning
applications. To cope with the high computation demands of these applications …

Evaluating fast algorithms for convolutional neural networks on FPGAs

Y Liang, L Lu, Q Xiao, S Yan - IEEE Transactions on Computer …, 2019 - ieeexplore.ieee.org
In recent years, convolutional neural networks (CNNs) have become widely adopted for
computer vision tasks. Field-programmable gate arrays (FPGAs) have been adequately …

Rosetta: A realistic high-level synthesis benchmark suite for software programmable FPGAs

Y Zhou, U Gupta, S Dai, R Zhao, N Srivastava… - Proceedings of the …, 2018 - dl.acm.org
Modern high-level synthesis (HLS) tools greatly reduce the turn-around time of designing
and implementing complex FPGA-based accelerators. They also expose various …

REQ-YOLO: A resource-aware, efficient quantization framework for object detection on FPGAs

C Ding, S Wang, N Liu, K Xu, Y Wang… - proceedings of the 2019 …, 2019 - dl.acm.org
Deep neural networks (DNNs), as the basis of object detection, will play a key role in the
development of future autonomous systems with full autonomy. The autonomous systems …

SpWA: An efficient sparse winograd convolutional neural networks accelerator on FPGAs

L Lu, Y Liang - Proceedings of the 55th Annual Design Automation …, 2018 - dl.acm.org
FPGAs have been an efficient accelerator for CNN inference due to its high performance,
flexibility, and energy-efficiency. To improve the performance of CNNs on FPGAs, fast …

Decoding small surface codes with feedforward neural networks

S Varsamopoulos, B Criger… - Quantum Science and …, 2017 - iopscience.iop.org
Surface codes reach high error thresholds when decoded with known algorithms, but the
decoding time will likely exceed the available time budget, especially for near-term …

Predictable accelerator design with time-sensitive affine types

R Nigam, S Atapattu, S Thomas, Z Li, T Bauer… - Proceedings of the 41st …, 2020 - dl.acm.org
Field-programmable gate arrays (FPGAs) provide an opportunity to co-design applications
with hardware accelerators, yet they remain difficult to program. High-level synthesis (HLS) …