R Yang, Y Li, H He, W Zhang - 2022 International Joint …, 2022 - ieeexplore.ieee.org
The collaborative inference approach splits the Deep Neural Networks (DNNs) model into two parts. It runs collaboratively on the end device and cloud server to minimize inference …
Deep neural networks (DNNs) have been widely used in many intelligent applications such as object recognition and automatic driving due to their superior performance in conducting …
F Dong, H Wang, D Shen, Z Huang… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
Edge intelligence, as a prospective paradigm for accelerating DNN inference, is mostly implemented by model partitioning which inevitably incurs the large transmission overhead …
F Xue, W Fang, W Xu, Q Wang, X Ma… - 2020 IEEE 22nd …, 2020 - ieeexplore.ieee.org
Deep Neural Networks (DNN) have been widely used in a large number of application scenarios. However, DNN models are generally both computation-intensive and memory …
H Qi, F Ren, L Wang, P Jiang, S Wan… - ACM Transactions on …, 2024 - dl.acm.org
Edge intelligence has emerged as a promising paradigm to accelerate DNN inference by model partitioning, which is particularly useful for intelligent scenarios that demand high …
Y Chen, T Luo, W Fang, NN Xiong - ACM Transactions on Internet …, 2024 - dl.acm.org
Deep learning technology has grown significantly in new application scenarios such as smart cities and driverless vehicles, but its deployment needs to consume a lot of resources …
Deep Neural Networks (DNNs) are widely used to analyze the abundance of data collected by massive Internet-of-Thing (IoT) devices. The traditional approaches usually send the data …
Y Duan, J Wu - 2021 IEEE/ACM 29th International Symposium …, 2021 - ieeexplore.ieee.org
The quality of service (QoS) of intelligent applications on mobile devices heavily depends on the inference speed of Deep Neural Network (DNN) models. Cooperative DNN inference …
JI Chang, JJ Kuo, CH Lin, WT Chen… - 2019 IEEE Global …, 2019 - ieeexplore.ieee.org
Recently, the notions of partitioning the Deep Neural Network (DNN) model over the multi- level computing units and making a fast inference with the early-inference technique have …