QGAN: Quantized generative adversarial networks

P Wang, D Wang, Y Ji, X Xie, H Song, XX Liu… - arXiv preprint arXiv …, 2019 - arxiv.org
arXiv preprint arXiv:1901.08263, 2019arxiv.org
The intensive computation and memory requirements of generative adversarial neural
networks (GANs) hinder its real-world deployment on edge devices such as smartphones.
Despite the success in model reduction of CNNs, neural network quantization methods have
not yet been studied on GANs, which are mainly faced with the issues of both the
effectiveness of quantization algorithms and the instability of training GAN models. In this
paper, we start with an extensive study on applying existing successful methods to quantize …
The intensive computation and memory requirements of generative adversarial neural networks (GANs) hinder its real-world deployment on edge devices such as smartphones. Despite the success in model reduction of CNNs, neural network quantization methods have not yet been studied on GANs, which are mainly faced with the issues of both the effectiveness of quantization algorithms and the instability of training GAN models. In this paper, we start with an extensive study on applying existing successful methods to quantize GANs. Our observation reveals that none of them generates samples with reasonable quality because of the underrepresentation of quantized values in model weights, and the generator and discriminator networks show different sensitivities upon quantization methods. Motivated by these observations, we develop a novel quantization method for GANs based on EM algorithms, named as QGAN. We also propose a multi-precision algorithm to help find the optimal number of bits of quantized GAN models in conjunction with corresponding result qualities. Experiments on CIFAR-10 and CelebA show that QGAN can quantize GANs to even 1-bit or 2-bit representations with results of quality comparable to original models.
arxiv.org
以上显示的是最相近的搜索结果。 查看全部搜索结果