An efficient real-time object detection framework on resource-constricted hardware devices via software and hardware co-design

M Liu, S Luo, K Han, B Yuan… - 2021 IEEE 32nd …, 2021 - ieeexplore.ieee.org
The fast development of object detection techniques has attracted attention to developing
efficient Deep Neural Networks (DNNs). However, the current state-of-the-art DNN models …

High-performance FPGA-based CNN accelerator with block-floating-point arithmetic

X Lian, Z Liu, Z Song, J Dai, W Zhou… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) are widely used and have achieved great success in
computer vision and speech processing applications. However, deploying the large-scale …

Efficientrep: An efficient Repvgg-style convnets with hardware-aware neural network design

K Weng, X Chu, X Xu, J Huang, X Wei - arXiv preprint arXiv:2302.00386, 2023 - arxiv.org
We present a hardware-efficient architecture of convolutional neural network, which has a
repvgg-like architecture. Flops or parameters are traditional metrics to evaluate the efficiency …

Hardware Acceleration for Object Detection using YOLOv5 Deep Learning Algorithm on Xilinx Zynq FPGA Platform

T Saidani, R Ghodhbani, A Alhomoud… - … , Technology & Applied …, 2024 - etasr.com
Object recognition presents considerable difficulties within the domain of computer vision.
Field-Programmable Gate Arrays (FPGAs) offer a flexible hardware platform, having …

Boosting convolutional neural networks performance based on FPGA accelerator

O Al-Shamma, MA Fadhel, RA Hameed… - … Systems Design and …, 2020 - Springer
Abstract Convolutional Neural Network (CNN) has been extensively used for image
recognition due to its great accuracy. This accuracy is achieved through emulating the optic …

Accelerating low bit-width convolutional neural networks with embedded FPGA

L Jiao, C Luo, W Cao, X Zhou… - 2017 27th international …, 2017 - ieeexplore.ieee.org
Convolutional Neural Networks (CNNs) can achieve high classification accuracy while they
require complex computation. Binarized Neural Networks (BNNs) with binarized weights …

A high-efficiency FPGA-based accelerator for convolutional neural networks using Winograd algorithm

Y Huang, J Shen, Z Wang, M Wen… - Journal of Physics …, 2018 - iopscience.iop.org
Convolutional neural networks (CNNs) are widely used in many computer vision
applications. Previous FPGA implementations of CNNs are mainly based on the …

Design flow of accelerating hybrid extremely low bit-width neural network in embedded FPGA

J Wang, Q Lou, X Zhang, C Zhu, Y Lin… - … conference on field …, 2018 - ieeexplore.ieee.org
Neural network accelerators with low latency and low energy consumption are desirable for
edge computing. To create such accelerators, we propose a design flow for accelerating the …

Evaluating fast algorithms for convolutional neural networks on FPGAs

Y Liang, L Lu, Q Xiao, S Yan - IEEE Transactions on Computer …, 2019 - ieeexplore.ieee.org
In recent years, convolutional neural networks (CNNs) have become widely adopted for
computer vision tasks. Field-programmable gate arrays (FPGAs) have been adequately …

Binary complex neural network acceleration on fpga

H Peng, S Zhou, S Weitze, J Li, S Islam… - 2021 IEEE 32nd …, 2021 - ieeexplore.ieee.org
Being able to learn from complex data with phase information is imperative for many signal
processing applications. Today's real-valued deep neural networks (DNNs) have shown …