A survey on hardware accelerators and optimization techniques for RNNs

S Mittal, S Umesh - Journal of Systems Architecture, 2021 - Elsevier
Abstract “Recurrent neural networks”(RNNs) are powerful artificial intelligence models that
have shown remarkable effectiveness in several tasks such as music generation, speech …

Accelerating deep neural networks implementation: A survey

M Dhouibi, AK Ben Salem, A Saidi… - IET Computers & …, 2021 - Wiley Online Library
Abstract Recently, Deep Learning (DL) applications are getting more and more involved in
different fields. Deploying such Deep Neural Networks (DNN) on embedded devices is still a …

A reconfigurable CNN-based accelerator design for fast and energy-efficient object detection system on mobile FPGA

VH Kim, KK Choi - IEEE Access, 2023 - ieeexplore.ieee.org
In limited-resource edge computing circumstances such as on mobile devices, IoT devices,
and electric vehicles, the energy-efficient optimized convolutional neural network (CNN) …

Hybrid CNN-SVM Inference Accelerator on FPGA Using HLS

B Liu, Y Zhou, L Feng, H Fu, P Fu - Electronics, 2022 - mdpi.com
Convolution neural networks (CNN), support vector machine (SVM) and hybrid CNN-SVM
algorithms are widely applied in many fields, including image processing and fault …

A CPU-FPGA heterogeneous acceleration system for scene text detection network

J Jiang, M Jiang, J Zhang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Scene text detection network, such as Connectionist Text Proposal Network (CTPN), takes
CNN-RNN hybrid neural network as the main body, which can effectively recognize the text …

Efficient FPGA Implementation of Convolutional Neural Networks and Long Short-Term Memory for Radar Emitter Signal Recognition

B Wu, X Wu, P Li, Y Gao, J Si, N Al-Dhahir - Sensors, 2024 - mdpi.com
In recent years, radar emitter signal recognition has enjoyed a wide range of applications in
electronic support measure systems and communication security. More and more deep …

ROSETTA: A Resource and Energy-Efficient Inference Processor for Recurrent Neural Networks Based on Programmable Data Formats and Fine Activation Pruning

J Kim, TH Kim - IEEE Transactions on Emerging Topics in …, 2022 - ieeexplore.ieee.org
Recurrent neural networks (RNNs) are extensively employed to perform inference based on
the temporal features of the input data. However, their computational workload and power …

An OpenCL-based hybrid CNN-RNN inference accelerator on FPGA

Y Sun, B Liu, X Xu - 2019 International Conference on Field …, 2019 - ieeexplore.ieee.org
Recently, Convolution Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and
CNN-RNN hybrid networks have demonstrated great success in many deep learning …

AERO: A 1.28 MOP/s/LUT reconfigurable inference processor for recurrent neural networks in a resource-limited FPGA

J Kim, J Kim, TH Kim - Electronics, 2021 - mdpi.com
This study presents a resource-efficient reconfigurable inference processor for recurrent
neural networks (RNN), named AERO. AERO is programmable to perform inference on RNN …

A Heterogeneous Architecture for the Vision Processing Unit with a Hybrid Deep Neural Network Accelerator

P Liu, Z Yang, L Kang, J Wang - Micromachines, 2022 - mdpi.com
The vision chip is widely used to acquire and process images. It connects the image sensor
directly with the vision processing unit (VPU) to execute the vision tasks. Modern vision tasks …