Design possibilities and challenges of DNN models: a review on the perspective of end devices

H Hussain, PS Tamizharasan, CS Rahul - Artificial Intelligence Review, 2022 - Springer
Abstract Deep Neural Network (DNN) models for both resource-rich environments and
resource-constrained devices have become abundant in recent years. As of now, the …

FPGA implementation for CNN-based optical remote sensing object detection

N Zhang, X Wei, H Chen, W Liu - Electronics, 2021 - mdpi.com
In recent years, convolutional neural network (CNN)-based methods have been widely used
for optical remote sensing object detection and have shown excellent performance. Some …

High-speed YOLOv4-tiny hardware accelerator for self-driving automotive

Z Valadanzoj, H Daryanavard, A Harifi - The Journal of Supercomputing, 2024 - Springer
Object detection is an important area in self-driving automotive. The YOLO algorithm and its
well-embedded implementation is a promising solution for object detection. In this paper, a …

Resource-constrained FPGA implementation of YOLOv2

Z Zhang, MAP Mahmud, AZ Kouzani - Neural Computing and Applications, 2022 - Springer
Progress is being made to deploy convolutional neural networks (CNNs) into the Internet of
Things (IoT) edge devices for handling image analysis tasks locally. These tasks require low …

Efficient binary 3D convolutional neural network and hardware accelerator

G Li, M Zhang, Q Zhang, Z Lin - Journal of Real-Time Image Processing, 2022 - Springer
The three-dimensional convolutional neural networks have abundant parameters and
computational costs. It is urgent to compress the three-dimensional convolutional neural …

An OpenCL-based FPGA accelerator for Faster R-CNN

J An, D Zhang, K Xu, D Wang - Entropy, 2022 - mdpi.com
In recent years, convolutional neural network (CNN)-based object detection algorithms have
made breakthroughs, and much of the research corresponds to hardware accelerator …

Let Coarse-Grained Resources Be Shared: Mapping Entire Neural Networks on FPGAs

TH Juang, C Schlaak, C Dubach - ACM Transactions on Embedded …, 2023 - dl.acm.org
Traditional High-Level Synthesis (HLS) provides rapid prototyping of hardware accelerators
without coding with Hardware Description Languages (HDLs). However, such an approach …

A low-power hardware architecture for real-time CNN computing

X Liu, C Cao, S Duan - Sensors, 2023 - mdpi.com
Convolutional neural network (CNN) is widely deployed on edge devices, performing tasks
such as objective detection, image recognition and acoustic recognition. However, the …

A compression pipeline for one-stage object detection model

Z Li, Y Sun, G Tian, L Xie, Y Liu, H Su, Y He - Journal of Real-Time Image …, 2021 - Springer
Deep neural networks (DNNs) have strong fitting ability on a variety of computer vision tasks,
but they also require intensive computing power and large storage space, which are not …

FPGA-Based Feature Extraction and Tracking Accelerator for Real-Time Visual SLAM

J Zhang, S Xiong, C Liu, Y Geng, W Xiong, S Cheng… - Sensors, 2023 - mdpi.com
Due to its advantages of low latency, low power consumption, and high flexibility, FPGA-
based acceleration technology has been more and more widely studied and applied in the …