An FPGA-based energy-efficient reconfigurable convolutional neural network accelerator for object recognition applications

J Li, KF Un, WH Yu, PI Mak… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
The computational efficiency is the prime concern of a computation-intensive deep
convolutional neural network (CNN). In this Brief, we report an FPGA-based computation …

Towards energy-efficient and secure edge AI: A cross-layer framework ICCAD special session paper

M Shafique, A Marchisio, RVW Putra… - 2021 IEEE/ACM …, 2021 - ieeexplore.ieee.org
The security and privacy concerns along with the amount of data that is required to be
processed on regular basis has pushed processing to the edge of the computing systems …

GreenTPU: Improving timing error resilience of a near-threshold tensor processing unit

P Pandey, P Basu, K Chakraborty, S Roy - Proceedings of the 56th …, 2019 - dl.acm.org
The emergence of hardware accelerators has brought about several orders of magnitude
improvement in the speed of the deep neural-network (DNN) inference. Among such DNN …

A robust digital RRAM-based convolutional block for low-power image processing and learning applications

E Giacomin, T Greenberg-Toledo… - … on Circuits and …, 2018 - ieeexplore.ieee.org
Currently, there is a growing attention toward developing efficient hardware convolutional
blocks for several applications such as computer vision or image processing. Recent works …

Base-2 softmax function: Suitability for training and efficient hardware implementation

Y Zhang, Y Zhang, L Peng, L Quan… - … on Circuits and …, 2022 - ieeexplore.ieee.org
The softmax function is widely used in deep neural networks (DNNs), its hardware
performance plays an important role in the training and inference of DNN accelerators …

A lightweight CNN-based algorithm and implementation on embedded system for real-time face recognition

Z Chen, J Chen, G Ding, H Huang - Multimedia systems, 2023 - Springer
Deep learning has become the main solution for face recognition applications due to its high
accuracy and robustness. In recent years, a batch of research on lightweight convolutional …

Test and yield loss reduction of AI and deep learning accelerators

M Sadi, U Guin - … Transactions on Computer-Aided Design of …, 2021 - ieeexplore.ieee.org
With data-driven analytics becoming mainstream, the global demand for dedicated artificial
intelligence (AI) and deep learning accelerator chips is soaring. These accelerators …

HyCA: A hybrid computing architecture for fault-tolerant deep learning

C Liu, C Chu, D Xu, Y Wang, Q Wang… - … on Computer-Aided …, 2021 - ieeexplore.ieee.org
Hardware faults on the regular 2-D computing array of a typical deep learning accelerator
(DLA) can lead to dramatic prediction accuracy loss. Prior redundancy design approaches …

Fefet-based binarized neural networks under temperature-dependent bit errors

M Yayla, S Buschjäger, A Gupta… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
Ferroelectric FET (FeFET) is a highly promising emerging non-volatile memory (NVM)
technology, especially for binarized neural network (BNN) inference on the low-power edge …

In-place zero-space memory protection for cnn

H Guan, L Ning, Z Lin, X Shen… - Advances in Neural …, 2019 - proceedings.neurips.cc
Abstract Convolutional Neural Networks (CNN) are being actively explored for safety-critical
applications such as autonomous vehicles and aerospace, where it is essential to ensure …