A novel FPGA accelerator design for real-time and ultra-low power deep convolutional neural networks compared with titan X GPU

S Li, Y Luo, K Sun, N Yadav, KK Choi - IEEE Access, 2020 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) based deep learning algorithms require high data
flow and computational intensity. For real-time industrial applications, they need to …

A reconfigurable CNN-based accelerator design for fast and energy-efficient object detection system on mobile FPGA

VH Kim, KK Choi - IEEE Access, 2023 - ieeexplore.ieee.org
In limited-resource edge computing circumstances such as on mobile devices, IoT devices,
and electric vehicles, the energy-efficient optimized convolutional neural network (CNN) …

FPGA‐accelerated deep convolutional neural networks for high throughput and energy efficiency

Y Qiao, J Shen, T Xiao, Q Yang… - Concurrency and …, 2017 - Wiley Online Library
Recent breakthroughs in the deep convolutional neural networks (CNNs) have led to great
improvements in the accuracy of both vision and auditory systems. Characterized by their …

Throughput-optimized FPGA accelerator for deep convolutional neural networks

Z Liu, Y Dou, J Jiang, J Xu, S Li, Y Zhou… - ACM Transactions on …, 2017 - dl.acm.org
Deep convolutional neural networks (CNNs) have gained great success in various computer
vision applications. State-of-the-art CNN models for large-scale applications are …

Power efficient design of high-performance convolutional neural networks hardware accelerator on FPGA: A case study with GoogLeNet

AJ Abd El-Maksoud, M Ebbed, AH Khalil… - IEEE Access, 2021 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) have dominated image recognition and object
detection models in the last few years. They can achieve the highest accuracies with several …

A power-efficient optimizing framework fpga accelerator based on winograd for yolo

C Bao, T Xie, W Feng, L Chang, C Yu - Ieee Access, 2020 - ieeexplore.ieee.org
Accelerating deep learning networks in edge computing based on power-efficient and highly
parallel FPGA platforms is an important goal. Combined with deep learning theory, an …

A High-speed Low-power Deep Neural Network on an FPGA based on the Nested RNS: Applied to an Object Detector

H Nakahara, T Sasao - 2018 IEEE international symposium on …, 2018 - ieeexplore.ieee.org
A pre-trained convolutional deep neural network (CNN) is the feed-forward computation
perspective, and it is widely used for the embedded vision systems. One of the applications …

A dedicated hardware accelerator for real-time acceleration of YOLOv2

K Xu, X Wang, X Liu, C Cao, H Li, H Peng… - Journal of Real-Time …, 2021 - Springer
In recent years, dedicated hardware accelerators for the acceleration of the convolutional
neural network (CNN) have been extensively studied. Although many studies have …

Briefly Analysis about CNN Accelerator based on FPGA

Z Wang, H Li, X Yue, L Meng - Procedia Computer Science, 2022 - Elsevier
Since convolutional computation in deep learning is a large and time-consuming
computation, researchers often use GPU or FPGA to accelerate these computation. This …

An fpga-based reconfigurable cnn accelerator for yolo

S Zhang, J Cao, Q Zhang, Q Zhang… - 2020 IEEE 3rd …, 2020 - ieeexplore.ieee.org
Convolutional neural network (CNN) has been widely used in image processing fields.
Object detection models based on CNN, such as YOLO and SSD, have been proved to be …