Efficient acceleration of deep learning inference on resource-constrained edge devices: A review

MMH Shuvo, SK Islam, J Cheng… - Proceedings of the …, 2022 - ieeexplore.ieee.org
Successful integration of deep neural networks (DNNs) or deep learning (DL) has resulted
in breakthroughs in many areas. However, deploying these highly accurate models for data …

Accelerating neural network inference on FPGA-based platforms—A survey

R Wu, X Guo, J Du, J Li - Electronics, 2021 - mdpi.com
The breakthrough of deep learning has started a technological revolution in various areas
such as object identification, image/video recognition and semantic segmentation. Neural …

AddNet: Deep neural networks using FPGA-optimized multipliers

J Faraone, M Kumm, M Hardieck, P Zipf… - … Transactions on Very …, 2019 - ieeexplore.ieee.org
Low-precision arithmetic operations to accelerate deep-learning applications on field-
programmable gate arrays (FPGAs) have been studied extensively, because they offer the …

Real-time super-resolution system of 4k-video based on deep learning

Y Cao, C Wang, C Song, Y Tang… - 2021 IEEE 32nd …, 2021 - ieeexplore.ieee.org
Video super-resolution (VSR) technology excels in reconstructing low-quality video,
avoiding unpleasant blur effect caused by interpolation-based algorithms. However, vast …

A reconfigurable convolutional neural network-accelerated coprocessor based on RISC-V instruction set

N Wu, T Jiang, L Zhang, F Zhou, F Ge - Electronics, 2020 - mdpi.com
As a typical artificial intelligence algorithm, the convolutional neural network (CNN) is widely
used in the Internet of Things (IoT) system. In order to improve the computing ability of an IoT …

Fast algorithms for quaternion-valued convolutional neural networks

A Cariow, G Cariowa - IEEE Transactions on Neural Networks …, 2020 - ieeexplore.ieee.org
In this article, we analyze algorithmic ways to reduce the arithmetic complexity of calculating
quaternion-valued linear convolution and also synthesize a new algorithm for calculating …

FA-LAMP: fpga-accelerated learned approximate matrix profile for time series similarity prediction

A Kalantar, Z Zimmerman, P Brisk - 2021 IEEE 29th Annual …, 2021 - ieeexplore.ieee.org
With the proliferation of low-cost sensors and the Internet-of-Things (IoT), the rate of
producing data far exceeds the compute and storage capabilities of today's infrastructure …

Fpga-based acceleration of time series similarity prediction: From cloud to edge

A Kalantar, Z Zimmerman, P Brisk - ACM Transactions on …, 2022 - dl.acm.org
With the proliferation of low-cost sensors and the Internet of Things, the rate of producing
data far exceeds the compute and storage capabilities of today's infrastructure. Much of this …

Efficient implementation of 2D and 3D sparse deconvolutional neural networks with a uniform architecture on FPGAs

D Wang, J Shen, M Wen, C Zhang - Electronics, 2019 - mdpi.com
Three-dimensional (3D) deconvolution is widely used in many computer vision applications.
However, most previous works have only focused on accelerating two-dimensional (2D) …

Comparative study: AutoDPR-SEM for enhancing CNN reliability in SRAM-based FPGAs through autonomous reconfiguration

H Tian, Y Ibrahim, R Chen, Y Wang, C Jin… - Microelectronics …, 2024 - Elsevier
Convolutional neural networks (CNNs) are widely adopted in safety-critical systems,
including space applications and autonomous vehicles. Field-programmable gate arrays …