Efficient acceleration of deep learning inference on resource-constrained edge devices: A review

MMH Shuvo, SK Islam, J Cheng… - Proceedings of the …, 2022 - ieeexplore.ieee.org
Successful integration of deep neural networks (DNNs) or deep learning (DL) has resulted
in breakthroughs in many areas. However, deploying these highly accurate models for data …

Deep learning for IoT big data and streaming analytics: A survey

M Mohammadi, A Al-Fuqaha, S Sorour… - … Surveys & Tutorials, 2018 - ieeexplore.ieee.org
In the era of the Internet of Things (IoT), an enormous amount of sensing devices collect
and/or generate various sensory data over time for a wide range of fields and applications …

Artificial optic-neural synapse for colored and color-mixed pattern recognition

S Seo, SH Jo, S Kim, J Shim, S Oh, JH Kim… - Nature …, 2018 - nature.com
The priority of synaptic device researches has been given to prove the device potential for
the emulation of synaptic dynamics and not to functionalize further synaptic devices for more …

A configurable cloud-scale DNN processor for real-time AI

J Fowers, K Ovtcharov, M Papamichael… - 2018 ACM/IEEE 45th …, 2018 - ieeexplore.ieee.org
Interactive AI-powered services require low-latency evaluation of deep neural network
(DNN) models-aka"" real-time AI"". The growing demand for computationally expensive …

Efficient processing of deep neural networks: A tutorial and survey

V Sze, YH Chen, TJ Yang, JS Emer - Proceedings of the IEEE, 2017 - ieeexplore.ieee.org
Deep neural networks (DNNs) are currently widely used for many artificial intelligence (AI)
applications including computer vision, speech recognition, and robotics. While DNNs …

Bit fusion: Bit-level dynamically composable architecture for accelerating deep neural network

H Sharma, J Park, N Suda, L Lai… - 2018 ACM/IEEE 45th …, 2018 - ieeexplore.ieee.org
Hardware acceleration of Deep Neural Networks (DNNs) aims to tame their enormous
compute intensity. Fully realizing the potential of acceleration in this domain requires …

[图书][B] Efficient processing of deep neural networks

V Sze, YH Chen, TJ Yang, JS Emer - 2020 - Springer
This book provides a structured treatment of the key principles and techniques for enabling
efficient processing of deep neural networks (DNNs). DNNs are currently widely used for …

Scaling for edge inference of deep neural networks

X Xu, Y Ding, SX Hu, M Niemier, J Cong, Y Hu… - Nature Electronics, 2018 - nature.com
Deep neural networks offer considerable potential across a range of applications, from
advanced manufacturing to autonomous cars. A clear trend in deep neural networks is the …

UNPU: An energy-efficient deep neural network accelerator with fully variable weight bit precision

J Lee, C Kim, S Kang, D Shin, S Kim… - IEEE Journal of Solid …, 2018 - ieeexplore.ieee.org
An energy-efficient deep neural network (DNN) accelerator, unified neural processing unit
(UNPU), is proposed for mobile deep learning applications. The UNPU can support both …

Conv-RAM: An energy-efficient SRAM with embedded convolution computation for low-power CNN-based machine learning applications

A Biswas, AP Chandrakasan - 2018 IEEE International Solid …, 2018 - ieeexplore.ieee.org
Convolutional neural networks (CNN) provide state-of-the-art results in a wide variety of
machine learning (ML) applications, ranging from image classification to speech recognition …