Reform: Static and dynamic resource-aware dnn reconfiguration framework for mobile device

Z Xu, F Yu, C Liu, X Chen - Proceedings of the 56th Annual Design …, 2019 - dl.acm.org
Proceedings of the 56th Annual Design Automation Conference 2019, 2019dl.acm.org
Although the Deep Neural Network (DNN) technique has been widely applied in various
applications, the DNN-based applications are still too computationally intensive for the
resource-constrained mobile devices. Many works have been proposed to optimize the DNN
computation performance, but most of them are limited in an algorithmic perspective,
ignoring certain computing issues in practical deployment. To achieve the comprehensive
DNN performance enhancement in practice, the expected DNN optimization works should …
Although the Deep Neural Network (DNN) technique has been widely applied in various applications, the DNN-based applications are still too computationally intensive for the resource-constrained mobile devices. Many works have been proposed to optimize the DNN computation performance, but most of them are limited in an algorithmic perspective, ignoring certain computing issues in practical deployment. To achieve the comprehensive DNN performance enhancement in practice, the expected DNN optimization works should closely cooperate with specific hardware and system constraints (i.e. computation capacity, energy cost, memory occupancy, and inference latency). Therefore, in this work, we propose ReForm -- a resource-aware DNN optimization framework. Through thorough mobile DNN computing analysis and innovative model reconfiguration schemes (i.e. ADMM based static model fine-tuning, dynamically selective computing), ReForm can efficiently and effectively reconfigure a pre-trained DNN model for practical mobile deployment with regards to various static and dynamic computation resource constraints. Experiments show that ReForm has ~3.5× faster optimization speed than state-of-the-art resource-aware optimization method. Also, ReForm can effective reconfigure a DNN model to different mobile devices with distinct resource constraints. Moreover, ReForm achieves satisfying computation cost reduction with ignorable accuracy drop in both static and dynamic computing scenarios (at most 18% workload, 16.23% latency, 48.63% memory, and 21.5% energy enhancement).
ACM Digital Library
以上显示的是最相近的搜索结果。 查看全部搜索结果