Efficient processing of deep neural networks: A tutorial and survey

V Sze, YH Chen, TJ Yang, JS Emer - Proceedings of the IEEE, 2017 - ieeexplore.ieee.org
Deep neural networks (DNNs) are currently widely used for many artificial intelligence (AI)
applications including computer vision, speech recognition, and robotics. While DNNs …

[HTML][HTML] A survey on hardware accelerators: Taxonomy, trends, challenges, and perspectives

B Peccerillo, M Mannino, A Mondelli… - Journal of Systems …, 2022 - Elsevier
In recent years, the limits of the multicore approach emerged in the so-called “dark silicon”
issue and diminishing returns of an ever-increasing core count. Hardware manufacturers …

In-datacenter performance analysis of a tensor processing unit

NP Jouppi, C Young, N Patil, D Patterson… - Proceedings of the 44th …, 2017 - dl.acm.org
Many architects believe that major improvements in cost-energy-performance must now
come from domain-specific hardware. This paper evaluates a custom ASIC---called a Tensor …

PUMA: A programmable ultra-efficient memristor-based accelerator for machine learning inference

A Ankit, IE Hajj, SR Chalamalasetti, G Ndu… - Proceedings of the …, 2019 - dl.acm.org
Memristor crossbars are circuits capable of performing analog matrix-vector multiplications,
overcoming the fundamental energy efficiency limitations of digital logic. They have been …

Enable deep learning on mobile devices: Methods, systems, and applications

H Cai, J Lin, Y Lin, Z Liu, H Tang, H Wang… - ACM Transactions on …, 2022 - dl.acm.org
Deep neural networks (DNNs) have achieved unprecedented success in the field of artificial
intelligence (AI), including computer vision, natural language processing, and speech …

ISAAC: A convolutional neural network accelerator with in-situ analog arithmetic in crossbars

A Shafiee, A Nag, N Muralimanohar… - ACM SIGARCH …, 2016 - dl.acm.org
A number of recent efforts have attempted to design accelerators for popular machine
learning algorithms, such as those involving convolutional and deep neural networks (CNNs …

Eyeriss: A spatial architecture for energy-efficient dataflow for convolutional neural networks

YH Chen, J Emer, V Sze - ACM SIGARCH computer architecture news, 2016 - dl.acm.org
Deep convolutional neural networks (CNNs) are widely used in modern AI systems for their
superior accuracy but at the cost of high computational complexity. The complexity comes …

Understanding error propagation in deep learning neural network (DNN) accelerators and applications

G Li, SKS Hari, M Sullivan, T Tsai… - Proceedings of the …, 2017 - dl.acm.org
Deep learning neural networks (DNNs) have been successful in solving a wide range of
machine learning problems. Specialized hardware accelerators have been proposed to …

[图书][B] Efficient processing of deep neural networks

V Sze, YH Chen, TJ Yang, JS Emer - 2020 - Springer
This book provides a structured treatment of the key principles and techniques for enabling
efficient processing of deep neural networks (DNNs). DNNs are currently widely used for …

Angel-eye: A complete design flow for mapping CNN onto embedded FPGA

K Guo, L Sui, J Qiu, J Yu, J Wang, S Yao… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
Convolutional neural network (CNN) has become a successful algorithm in the region of
artificial intelligence and a strong candidate for many computer vision algorithms. But the …