FPGA HLS today: successes, challenges, and opportunities

J Cong, J Lau, G Liu, S Neuendorffer, P Pan… - ACM Transactions on …, 2022 - dl.acm.org
The year 2011 marked an important transition for FPGA high-level synthesis (HLS), as it
went from prototyping to deployment. A decade later, in this article, we assess the progress …

Sparse-YOLO: Hardware/software co-design of an FPGA accelerator for YOLOv2

Z Wang, K Xu, S Wu, L Liu, L Liu, D Wang - IEEE Access, 2020 - ieeexplore.ieee.org
Convolutional neural network (CNN) based object detection algorithms are becoming
dominant in many application fields due to their superior accuracy advantage over …

Codenet: Efficient deployment of input-adaptive object detection on embedded fpgas

Q Huang, D Wang, Z Dong, Y Gao, Y Cai, T Li… - The 2021 ACM/SIGDA …, 2021 - dl.acm.org
Deploying deep learning models on embedded systems for computer vision tasks has been
challenging due to limited compute resources and strict energy budgets. The majority of …

A survey of FPGA-based vision systems for autonomous cars

D Castells-Rufas, V Ngo, J Borrego-Carazo… - IEEE …, 2022 - ieeexplore.ieee.org
On the road to making self-driving cars a reality, academic and industrial researchers are
working hard to continue to increase safety while meeting technical and regulatory …

Towards agile dnn accelerator design using incremental synthesis on FPGAs

Q Xiao, Y Liang - Proceedings of the 2022 ACM/SIGDA International …, 2022 - dl.acm.org
Hardware-software co-design is the new trend for deep neural network and FPGA
accelerator development, which iteratively revises and tunes the full system. The bottleneck …

[PDF][PDF] 基于改进型YOLOv4 的焊缝图像检测与识别

程松, 戴金涛, 杨洪刚, 陈云霞 - Laser & Optoelectronics …, 2022 - researching.cn
摘要针对YOLOv4 在焊缝X 射线探伤缺陷图中检测精度与召回率低的问题, 设计了YOLOv4-cs
算法. 该算法改进了YOLOv4 的卷积方式, 使得模型训练参数大大减小, 其次通过去除下采样及在 …

Neural networks and FPGA hardware accelerators for millimeter-wave radio-over-fiber systems

J Lee, J He, K Wang - 2020 22nd International Conference on …, 2020 - ieeexplore.ieee.org
High speed data streaming has been highly demanded by mobile end users and millimetre-
wave (mm-wave) radio-over-fiber (RoF) optical communications have been studied to satisfy …

[HTML][HTML] Sparse Convolution FPGA Accelerator Based on Multi-Bank Hash Selection

J Xu, H Pu, D Wang - Micromachines, 2024 - mdpi.com
Reconfigurable processor-based acceleration of deep convolutional neural network (DCNN)
algorithms has emerged as a widely adopted technique, with particular attention on sparse …

Maximizing Data and Hardware Reuse for HLS with Early-Stage Symbolic Partitioning

TH Juang, C Dubach - ACM Transactions on Architecture and Code …, 2025 - dl.acm.org
While traditional HLS (High-Level Synthesis) converts “high-level” C-like programs into
hardware automatically, producing high-performance designs still requires hardware …

FPGA-based neural network accelerators for millimeter-wave radio-over-fiber systems

J Lee, J He, K Wang - Optics Express, 2020 - opg.optica.org
With rapidly developing high-speed wireless communications, the 60 GHz millimeter-wave
(mm-wave) frequency range has attracted extensive interests, and radio-over-fiber (RoF) …