Model compression and hardware acceleration for neural networks: A comprehensive survey

L Deng, G Li, S Han, L Shi, Y Xie - Proceedings of the IEEE, 2020 - ieeexplore.ieee.org
Domain-specific hardware is becoming a promising topic in the backdrop of improvement
slow down for general-purpose processors due to the foreseeable end of Moore's Law …

Patdnn: Achieving real-time dnn execution on mobile devices with pattern-based weight pruning

W Niu, X Ma, S Lin, S Wang, X Qian, X Lin… - Proceedings of the …, 2020 - dl.acm.org
With the emergence of a spectrum of high-end mobile devices, many applications that
formerly required desktop-level computation capability are being transferred to these …

Autocompress: An automatic dnn structured pruning framework for ultra-high compression rates

N Liu, X Ma, Z Xu, Y Wang, J Tang, J Ye - Proceedings of the AAAI …, 2020 - ojs.aaai.org
Structured weight pruning is a representative model compression technique of DNNs to
reduce the storage and computation requirements and accelerate inference. An automatic …

Optimized dual fire attention network and medium-scale fire classification benchmark

H Yar, T Hussain, M Agarwal, ZA Khan… - … on Image Processing, 2022 - ieeexplore.ieee.org
Vision-based fire detection systems have been significantly improved by deep models;
however, higher numbers of false alarms and a slow inference speed still hinder their …

An empirical study of the impact of hyperparameter tuning and model optimization on the performance properties of deep neural networks

L Liao, H Li, W Shang, L Ma - ACM Transactions on Software …, 2022 - dl.acm.org
Deep neural network (DNN) models typically have many hyperparameters that can be
configured to achieve optimal performance on a particular dataset. Practitioners usually tune …

Enhancing real-time fire detection: an effective multi-attention network and a fire benchmark

T Khan, ZA Khan, C Choi - Neural Computing and Applications, 2023 - Springer
Over the past decades, fire has been considered one of the most serious natural disasters
because of its devastating nature, rapid spread, and high impact on the ecology, economy …

A survey on the optimization of neural network accelerators for micro-ai on-device inference

AN Mazumder, J Meng, HA Rashid… - IEEE Journal on …, 2021 - ieeexplore.ieee.org
Deep neural networks (DNNs) are being prototyped for a variety of artificial intelligence (AI)
tasks including computer vision, data analytics, robotics, etc. The efficacy of DNNs coincides …

High performance CNN accelerators based on hardware and algorithm co-optimization

T Yuan, W Liu, J Han, F Lombardi - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) have been widely used in image classification and
recognition due to their effectiveness; however, CNNs use a large volume of weight data that …

Improving neural network efficiency via post-training quantization with adaptive floating-point

F Liu, W Zhao, Z He, Y Wang, Z Wang… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract Model quantization has emerged as a mandatory technique for efficient inference
with advanced Deep Neural Networks (DNN). It converts the model parameters in full …

Machine learning in real-time Internet of Things (IoT) systems: A survey

J Bian, A Al Arafat, H Xiong, J Li, L Li… - IEEE Internet of …, 2022 - ieeexplore.ieee.org
Over the last decade, machine learning (ML) and deep learning (DL) algorithms have
significantly evolved and been employed in diverse applications, such as computer vision …