Enabling design methodologies and future trends for edge AI: Specialization and codesign

C Hao, J Dotzel, J Xiong, L Benini, Z Zhang… - IEEE Design & …, 2021 - ieeexplore.ieee.org
This work is an introduction and a survey for the Special Issue on Machine Intelligence at the
Edge. The authors argue that workloads that were formerly performed in the cloud are …

Accelerating low bit-width convolutional neural networks with embedded FPGA

L Jiao, C Luo, W Cao, X Zhou… - 2017 27th international …, 2017 - ieeexplore.ieee.org
Convolutional Neural Networks (CNNs) can achieve high classification accuracy while they
require complex computation. Binarized Neural Networks (BNNs) with binarized weights …

Machine learning on FPGAs to face the IoT revolution

X Zhang, A Ramachandran, C Zhuge… - 2017 IEEE/ACM …, 2017 - ieeexplore.ieee.org
FPGAs have been rapidly adopted for acceleration of Deep Neural Networks (DNNs) with
improved latency and energy efficiency compared to CPU and GPU-based implementations …

IMCE: Energy-efficient bit-wise in-memory convolution engine for deep neural network

S Angizi, Z He, F Parveen, D Fan - 2018 23rd Asia and South …, 2018 - ieeexplore.ieee.org
In this paper, we pave a novel way towards the concept of bit-wise In-Memory Convolution
Engine (IMCE) that could implement the dominant convolution computation of Deep …

Boosting the performance of CNN accelerators with dynamic fine-grained channel gating

W Hua, Y Zhou, C De Sa, Z Zhang… - Proceedings of the 52nd …, 2019 - dl.acm.org
This paper proposes a new fine-grained dynamic pruning technique for CNN inference,
named channel gating, and presents an accelerator architecture that can effectively exploit …

An ultra-low power binarized convolutional neural network-based speech recognition processor with on-chip self-learning

S Zheng, P Ouyang, D Song, X Li, L Liu… - … on Circuits and …, 2019 - ieeexplore.ieee.org
Always-on speech interfaces are prevailing in human-machine interaction, especially on
wearable devices, Internet of Things, etc., which benefits from the recent breakthroughs in …

UGEMM: Unary computing architecture for GEMM applications

D Wu, J Li, R Yin, H Hsiao, Y Kim… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
General matrix multiplication (GEMM) is universal in various applications, such as signal
processing, machine learning, and computer vision. Conventional GEMM hardware …

An energy-efficient reconfigurable processor for binary-and ternary-weight neural networks with flexible data bit width

S Yin, P Ouyang, J Yang, T Lu, X Li… - IEEE Journal of Solid …, 2018 - ieeexplore.ieee.org
Due to less memory requirement, low computation overhead and negligible accuracy
degradation, deep neural networks with binary/ternary weights (BTNNs) have been widely …

An FPGA-based energy-efficient reconfigurable depthwise separable convolution accelerator for image recognition

L Xuan, KF Un, CS Lam… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
With the advances in massive computing ability and big data science, deep neural network
(DNN) has been developing rapidly for different applications. However, due to its extensive …

A binarized neural network approach to accelerate in-vehicle network intrusion detection

L Zhang, X Yan, D Ma - IEEE Access, 2022 - ieeexplore.ieee.org
Controller Area Network (CAN) is the de facto standard for in-vehicle networks. However, it
is inherently vulnerable to various attacks due to the lack of security features. Intrusion …