Residue-Net: Multiplication-free neural network by in-situ no-loss migration to residue number systems

S Salamat, S Shubhi, B Khaleghi… - … of the 26th Asia and South …, 2021 - dl.acm.org
Deep neural networks are widely deployed on embedded devices to solve a wide range of
problems from edge-sensing to autonomous driving. The accuracy of these networks is …

Flexible-width bit-level compressor for convolutional neural network

J Zhu, X Chen, L Du, H Geng, Y Bai, Y Li… - 2021 IEEE 3rd …, 2021 - ieeexplore.ieee.org
In this paper, a weight compression technique named Flexible-width Bit-level (FWBL) coding
is proposed to compress convolutional neural networks (CNN) models without re-training …

[图书][B] Fast and Energy Efficient Big Data Processing on FPGAs

S Salamat - 2021 - search.proquest.com
With the rapid development of the Internet of things (IoT), networks, software, and computing
platforms, the size of the generated data is dramatically increasing, bringing the dawn of the …

Optimizing Data Compression: Enhanced Golomb-Rice Encoding with Parallel Decoding Strategies for TinyML Models

M Vaddeboina, A Yilmayer… - 2024 27th Euromicro …, 2024 - ieeexplore.ieee.org
Deep Neural Networks (DNNs) offer possibilities for tackling practical challenges and
broadening the scope of Artificial Intelligence (AI) applications. The demanding memory …

Deep learning acceleration: from quantization to in-memory computing

S Zhu - 2022 - dr.ntu.edu.sg
Deep learning has demonstrated high accuracy and efficiency in various applications. For
example, Convolutional Neural Networks (CNNs) widely adopted in Computer Vision (CV) …

Weight data compression method, weight data decompression method, weight data compression device, and weight data decompression device

Y Hashimoto - US Patent 11,700,014, 2023 - Google Patents
A weight data compression method includes: generating a 4-bit data string of 4-bit data
items each expressed as any one of nine 4-bit values, by dividing ternary weight data into …