Deepx: A software accelerator for low-power deep learning inference on mobile devices

ND Lane, S Bhattacharya, P Georgiev… - 2016 15th ACM/IEEE …, 2016 - ieeexplore.ieee.org
… from deep learning [1] – an innovative area of machine learning that is … The range of inference
tasks impacted by deep learning … However surprisingly, even though such inferences are …

Laconic deep learning inference acceleration

S Sharify, AD Lascorz, M Mahmoud, M Nikolic… - Proceedings of the 46th …, 2019 - dl.acm.org
… computations during inference with Deep Learning models. … needed by multiplications during
inference can be potentially … energy efficiency for inference with Deep Learning Networks. …

Deep learning inference service at microsoft

J Soifer, J Li, M Li, J Zhu, Y Li, Y He, E Zheng… - … Machine Learning  …, 2019 - usenix.org
… This paper introduces the Deep Learning Inference Service, an online production service at
… -latency deep neural network model inference. We present the system architecture and deep

Gene expression inference with deep learning

Y Chen, Y Li, R Narayan, A Subramanian, X Xie - Bioinformatics, 2016 - academic.oup.com
… Here, we present a deep learning method for gene expression inference (D-GEX). D-GEX is
… different strategies and tried to interpret the advantages of deep learning compared with LR. …

Deep learning type inference

VJ Hellendoorn, C Bird, ET Barr… - … of the 2018 26th acm joint …, 2018 - dl.acm.org
… We study the accuracy and behavior of deep learning networks when applied to type
inference across a range of metrics (see Section 4.4). Our proposed model enhances a …

Efficient acceleration of deep learning inference on resource-constrained edge devices: A review

MMH Shuvo, SK Islam, J Cheng… - Proceedings of the …, 2022 - ieeexplore.ieee.org
Successful integration of deep neural networks (DNNs) or deep learning (DL) has resulted
in breakthroughs in many areas. However, deploying these highly accurate models for data-…

Simba: Scaling deep-learning inference with multi-chip-module-based architecture

YS Shao, J Clemons, R Venkatesan, B Zimmer… - Proceedings of the …, 2019 - dl.acm.org
… to high-performance DNN inference algorithms has not been … inference accelerators as
semiconductor scaling slows. This paper presents Simba, a scalable deep-learning inference ac…

Deep learning inference in facebook data centers: Characterization, performance optimizations and hardware implications

J Park, M Naumov, P Basu, S Deng, A Kalaiah… - arXiv preprint arXiv …, 2018 - arxiv.org
… The application of deep learning techniques resulted in remarkable improvement of
machine learning models. In this paper we provide detailed characterizations of deep learning

Scrooge: A cost-effective deep learning inference system

Y Hu, R Ghosh, R Govindan - Proceedings of the ACM Symposium on …, 2021 - dl.acm.org
… In the paper, we presented a cost-efficient deep learning inference system named Scrooge,
which packs DL workload efficiently to maximize inference throughput while satisfying DL …

Integer quantization for deep learning inference: Principles and empirical evaluation

H Wu, P Judd, X Zhang, M Isaev… - arXiv preprint arXiv …, 2020 - arxiv.org
… the computational performance of deep learning applications. It is … Once trained, neural
networks can be deployed for inference … quantization for neural network inference, where trained …