作者
Xiaolong Ma, Geng Yuan, Sheng Lin, Caiwen Ding, Fuxun Yu, Tao Liu, Wujie Wen, Xiang Chen, Yanzhi Wang
发表日期
2020/1/13
研讨会论文
25th Asia and South Pacific Design Automation Conference (ASP-DAC)
出版商
IEEE
简介
The memristor crossbar array has emerged as an intrinsically suitable matrix computation and low-power acceleration framework for DNN applications. Many techniques such as memristor-based weight pruning and memristor-based quantization have been studied. However, the high accuracy solution for the above techniques is still waiting for unraveling. In this paper, we propose a memristor-based DNN framework which combines both structured weight pruning and quantization by incorporating ADMM algorithm for better pruning and quantization performance. We also discover the non-optimality of the ADMM solution in weight pruning and the unused data path in a structured pruned model. We design a software-hardware co-optimization framework which contains the first proposed Network Purification and Unused Path Removal algorithms targeting on post-processing a structured pruned model after ADMM …
引用总数
202020212022202320248151989
学术搜索中的文章
X Ma, G Yuan, S Lin, C Ding, F Yu, T Liu, W Wen… - 2020 25th Asia and South Pacific design automation …, 2020