Distributed artificial intelligence empowered by end-edge-cloud computing: A survey

S Duan, D Wang, J Ren, F Lyu, Y Zhang… - … Surveys & Tutorials, 2022 - ieeexplore.ieee.org
As the computing paradigm shifts from cloud computing to end-edge-cloud computing, it
also supports artificial intelligence evolving from a centralized manner to a distributed one …

Band: coordinated multi-dnn inference on heterogeneous mobile processors

JS Jeong, J Lee, D Kim, C Jeon, C Jeong… - Proceedings of the 20th …, 2022 - dl.acm.org
The rapid development of deep learning algorithms, as well as innovative hardware
advancements, encourages multi-DNN workloads such as augmented reality applications …

Flexible high-resolution object detection on edge devices with tunable latency

S Jiang, Z Lin, Y Li, Y Shu, Y Liu - Proceedings of the 27th Annual …, 2021 - dl.acm.org
Object detection is a fundamental building block of video analytics applications. While
Neural Networks (NNs)-based object detection models have shown excellent accuracy on …

Melon: Breaking the memory wall for resource-efficient on-device machine learning

Q Wang, M Xu, C Jin, X Dong, J Yuan, X Jin… - Proceedings of the 20th …, 2022 - dl.acm.org
On-device learning is a promising technique for emerging privacy-preserving machine
learning paradigms. However, through quantitative experiments, we find that commodity …

[PDF][PDF] CoDL: efficient CPU-GPU co-execution for deep learning inference on mobile devices.

F Jia, D Zhang, T Cao, S Jiang, Y Liu, J Ren, Y Zhang - MobiSys, 2022 - chrisplus.me
Concurrent inference execution on heterogeneous processors is critical to improve the
performance of increasingly heavy deep learning (DL) models. However, available …

Mandheling: Mixed-precision on-device dnn training with dsp offloading

D Xu, M Xu, Q Wang, S Wang, Y Ma, K Huang… - Proceedings of the 28th …, 2022 - dl.acm.org
This paper proposes Mandheling, the first system that enables highly resource-efficient on-
device training by orchestrating mixed-precision training with on-chip Digital Signal …

A comprehensive benchmark of deep learning libraries on mobile devices

Q Zhang, X Li, X Che, X Ma, A Zhou, M Xu… - Proceedings of the …, 2022 - dl.acm.org
Deploying deep learning (DL) on mobile devices has been a notable trend in recent years.
To support fast inference of on-device DL, DL libraries play a critical role as algorithms and …

A comprehensive deep learning library benchmark and optimal library selection

Q Zhang, X Che, Y Chen, X Ma, M Xu… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Deploying deep learning (DL) on mobile devices has been a notable trend in recent years.
To support fast inference of on-device DL, DL libraries play a critical role as algorithms and …

Deepperform: An efficient approach for performance testing of resource-constrained neural networks

S Chen, M Haque, C Liu, W Yang - Proceedings of the 37th IEEE/ACM …, 2022 - dl.acm.org
Today, an increasing number of Adaptive Deep Neural Networks (AdNNs) are being used
on resource-constrained embedded devices. We observe that, similar to traditional software …

Blastnet: Exploiting duo-blocks for cross-processor real-time dnn inference

N Ling, X Huang, Z Zhao, N Guan, Z Yan… - Proceedings of the 20th …, 2022 - dl.acm.org
In recent years, Deep Neural Network (DNN) has been increasingly adopted by a wide
range of time-critical applications running on edge platforms with heterogeneous …