作者
Ameya Prabhu, Vishal Batchu, Rohit Gajawada, Sri Aurobindo Munagala, Anoop Namboodiri
发表日期
2018/3/12
研讨会论文
2018 IEEE Winter Conference on Applications of Computer Vision (WACV)
页码范围
821-829
出版商
IEEE
简介
Binarization is an extreme network compression approach that provides large computational speedups along with energy and memory savings, albeit at significant accuracy costs. We investigate the question of where to binarize inputs at layer-level granularity and show that selectively binarizing the inputs to specific layers in the network could lead to significant improvements in accuracy while preserving most of the advantages of binarization. We analyze the binarization tradeoff using a metric that jointly models the input binarization-error and computational cost and introduce an efficient algorithm to select layers whose inputs are to be binarized. Practical guidelines based on insights obtained from applying the algorithm to a variety of models are discussed. Experiments on Imagenet dataset using AlexNet and ResNet-18 models show 3-4% improvements in accuracy over fully binarized networks with minimal …
引用总数
2018201920202021202220232024126431
学术搜索中的文章
A Prabhu, V Batchu, R Gajawada, SA Munagala… - 2018 IEEE Winter Conference on Applications of …, 2018