Sequential models achieve state-of-the-art results in audio, visual and textual domains with respect to both estimating the data distribution and generating desired samples. Efficient …
Abstract Model compression techniques, such as pruning and quantization, are becoming increasingly important to reduce the memory footprints and the amount of computations …
J Li, A Louri, A Karanth… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) are at the core of many state-of-the-art deep learning models in computer vision, speech, and text processing. Training and deploying such CNN …
D Lee, P Kapoor, B Kim - arXiv preprint arXiv:1810.12823, 2018 - arxiv.org
Model compression has been introduced to reduce the required hardware resources while maintaining the model accuracy. Lots of techniques for model compression, such as …
D Lee, SJ Kwon, B Kim, GY Wei - arXiv preprint arXiv:1905.10145, 2019 - arxiv.org
Low-rank approximation is an effective model compression technique to not only reduce parameter storage requirements, but to also reduce computations. For convolutional neural …
Quantization based on the binary codes is gaining attention because each quantized bit can be directly utilized for computations without dequantization using look-up tables. Previous …
J Lee, S Kim, S Kim, W Jo, HJ Yoo - arXiv preprint arXiv:2101.09650, 2021 - arxiv.org
Deep reinforcement learning (DRL) has shown remarkable success in sequential decision- making problems but suffers from a long training time to obtain such good performance …
D Ahn, D Lee, T Kim, JJ Kim - International Conference on Learning …, 2018 - openreview.net
Weight pruning has been introduced as an efficient model compression technique. Even though pruning removes significant amount of weights in a network, memory requirement …
X Liu, W Li, J Huo, L Yao, Y Gao - Proceedings of the AAAI Conference on …, 2020 - aaai.org
Deep neural network compression is important and increasingly developed especially in resource-constrained environments, such as autonomous drones and wearable devices …