A survey of FPGA-based accelerators for convolutional neural networks

S Mittal - Neural computing and applications, 2020 - Springer
Deep convolutional neural networks (CNNs) have recently shown very high accuracy in a
wide range of cognitive tasks, and due to this, they have received significant interest from the …

FPGA-based implementation of classification techniques: A survey

A Saidi, SB Othman, M Dhouibi, SB Saoud - Integration, 2021 - Elsevier
Recently, a number of classification techniques have been introduced. However, processing
large dataset in a reasonable time has become a major challenge. This made classification …

Res-DNN: A residue number system-based DNN accelerator unit

N Samimi, M Kamal, A Afzali-Kusha… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
In this article, a technique, based on using Residue Number System (RNS) is suggested to
improve the energy efficiency of Deep Neural Networks (DNNs). In the DNN architecture …

Review of FPGA-based accelerators of deep convolutional neural networks

NM Philip, NM Sivamangai - 2022 6th International Conference …, 2022 - ieeexplore.ieee.org
Recent research has shown that Deep Convolutional Neural Networks (CNNs) are
extremely accurate in a big range of cognitive tasks. This has piqued the interest of …

An energy-efficient inference method in convolutional neural networks based on dynamic adjustment of the pruning level

MA Maleki, A Nabipour-Meybodi, M Kamal… - ACM Transactions on …, 2021 - dl.acm.org
In this article, we present a low-energy inference method for convolutional neural networks
in image classification applications. The lower energy consumption is achieved by using a …

An Irredundant and Compressed Data Layout to Optimize Bandwidth Utilization of FPGA Accelerators

C Ferry, N Derumigny, S Derrien… - arXiv preprint arXiv …, 2024 - arxiv.org
Memory bandwidth is known to be a performance bottleneck for FPGA accelerators,
especially when they deal with large multi-dimensional data-sets. A large body of work …

Hardware-efficient template-based deep CNNs accelerator design

A Alhussain, M Lin - 2022 IEEE International Conference on …, 2022 - ieeexplore.ieee.org
Acceleration of Convolutional Neural Network (CNN) on edge devices has recently
achieved a remarkable performance in image classification and object detection …

Heterogeneous Multi-core Array-based DNN Accelerator

MA Maleki, M Kamal, A Afzali-Kusha - arXiv preprint arXiv:2206.12605, 2022 - arxiv.org
In this article, we investigate the impact of architectural parameters of array-based DNN
accelerators on accelerator's energy consumption and performance in a wide variety of …

High throughput hardware/software heterogeneous system for RRPN-based scene text detection

Y Xin, D Chen, C Zeng, W Zhang… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
Rotation Region Proposal Networks (RRPN) are used to generate rotated proposals with the
information of text angle for arbitrary oriented scene text detection (STD). However, the …

[PDF][PDF] Automating the derivation of memory allocations for acceleration of polyhedral programs

C Ferry, S Rajopadhye, S Derrien, S Pasricha… - 2024 - api.mountainscholar.org
As processors compute power keeps increasing, so do their demands in memory accesses:
some computations will require a higher bandwidth and exhibit regular memory access …