Deep reinforcement learning based resource management for DNN inference in industrial IoT

W Zhang, D Yang, H Peng, W Wu… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
… However, our work considers the DNN inference services, in which inference accuracy is a
… management and provide high-accuracy inference services for intelligent IIoT applications. …

Accuracy-guaranteed collaborative DNN inference in industrial IoT via deep reinforcement learning

W Wu, P Yang, W Zhang, C Zhou… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
… -intensive deep neural network (DNN) inference services, … In this article, we investigate the
collaborative DNN inference … Specifically, sampling rate adaption, inference task offloading…

Deep reinforcement learning based resource management for DNN inference in IIoT

W Zhang, D Yang, H Peng, W Wu… - … 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
… allocation for deep neural network (DNN) inference in the … DNN inference tasks, a resource
management problem is formulated with the objective of maximizing the average inference

Fa3c: Fpga-accelerated deep reinforcement learning

H Cho, P Oh, J Park, W Jung, J Lee - Proceedings of the Twenty-Fourth …, 2019 - dl.acm.org
Deep RL platform, called FA3C. Traditionally, FPGA-based DNN accelerators have
mainly focused on inference … Our platform targets both inference and training using single-precision …

DNN Inference Acceleration for Smart Devices in Industry 5.0 By Decentralized Deep Reinforcement Learning

C Dong, M Shafiq, MM Al Dabel… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
… algorithm locally and get an inference scheme based on local … In this paper, we investigate
DNN inference acceleration in … First, we formulate task offloading of DNN inference among …

Learning to infer: RL-based search for DNN primitive selection on Heterogeneous Embedded Systems

M De Prado, N Pazos, L Benini - 2019 Design, Automation & …, 2019 - ieeexplore.ieee.org
… CNNs’ inference latency may become a bottleneck for Deep Learning adoption by … -DNN,
a fully automatic search based on Reinforcement Learning which, combined with an inference

Autoscale: Energy efficiency optimization for stochastic edge inference using reinforcement learning

YG Kim, CJ Wu - 2020 53rd Annual IEEE/ACM international …, 2020 - ieeexplore.ieee.org
… Therefore, AutoScale leverages a reinforcement learning … to maximize the DNN inference
energy efficiency while … an in-depth characterization of DNN inference execution on mobile and …

Stochastic cumulative DNN inference with RL-aided adaptive IoT device-edge collaboration

K Qu, W Zhuang, W Wu, M Li, X Shen… - IEEE Internet of …, 2023 - ieeexplore.ieee.org
… We focus on deep neural network (DNN) based classification … level and delay performance
of DNN inference via device-edge … DNN inference in industrial IoT via deep reinforcement

Mixed precision quantization for reram-based dnn inference accelerators

S Huang, A Ankit, P Silveira, R Antunes… - Proceedings of the 26th …, 2021 - dl.acm.org
… We also propose an automated quantization flow powered by deep reinforcement learning
to search for the best quantization configuration in the large design space. Our evaluation …

Deep reinforcement learning: Framework, applications, and embedded implementations

H Li, T Wei, A Ren, Q Zhu… - 2017 IEEE/ACM …, 2017 - ieeexplore.ieee.org
… In order to mitigate the potential oscillation in the DNN inference results, we adopt the duplicate
Q method from [15], which maintains two Q value estimates for each state-action pair and …