Mandheling: Mixed-precision on-device dnn training with dsp offloading

D Xu, M Xu, Q Wang, S Wang, Y Ma, K Huang… - Proceedings of the 28th …, 2022 - dl.acm.org
This paper proposes Mandheling, the first system that enables highly resource-efficient on-
device training by orchestrating mixed-precision training with on-chip Digital Signal …

Llmcad: Fast and scalable on-device large language model inference

D Xu, W Yin, X Jin, Y Zhang, S Wei, M Xu… - arXiv preprint arXiv …, 2023 - arxiv.org
Generative tasks, such as text generation and question answering, hold a crucial position in
the realm of mobile applications. Due to their sensitivity to privacy concerns, there is a …

Mergesfl: Split federated learning with feature merging and batch size regulation

Y Liao, Y Xu, H Xu, L Wang, Z Yao… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
Recently, federated learning (FL) has emerged as a popular technique for edge AI to mine
valuable knowledge in edge computing (EC) systems. To boost the performance of AI …

Blastnet: Exploiting duo-blocks for cross-processor real-time dnn inference

N Ling, X Huang, Z Zhao, N Guan, Z Yan… - Proceedings of the 20th …, 2022 - dl.acm.org
In recent years, Deep Neural Network (DNN) has been increasingly adopted by a wide
range of time-critical applications running on edge platforms with heterogeneous …

Graft: Efficient inference serving for hybrid deep learning with SLO guarantees via DNN re-alignment

J Wu, L Wang, Q Jin, F Liu - IEEE Transactions on Parallel and …, 2023 - ieeexplore.ieee.org
Deep neural networks (DNNs) have been widely adopted for various mobile inference tasks,
yet their ever-increasing computational demands are hindering their deployment on …

Edge Video Analytics: A Survey on Applications, Systems and Enabling Techniques

R Xu, S Razavi, R Zheng - IEEE Communications Surveys & …, 2023 - ieeexplore.ieee.org
Video, as a key driver in the global explosion of digital information, can create tremendous
benefits for human society. Governments and enterprises are deploying innumerable …

Adaptivenet: Post-deployment neural architecture adaptation for diverse edge environments

H Wen, Y Li, Z Zhang, S Jiang, X Ye, Y Ouyang… - Proceedings of the 29th …, 2023 - dl.acm.org
Deep learning models are increasingly deployed to edge devices for real-time applications.
To ensure stable service quality across diverse edge environments, it is highly desirable to …

Elms: Elasticized large language models on mobile devices

W Yin, R Yi, D Xu, G Huang, M Xu, X Liu - arXiv preprint arXiv:2409.09071, 2024 - arxiv.org
On-device Large Language Models (LLMs) are revolutionizing mobile AI, enabling
applications such as UI automation while addressing privacy concerns. Currently, the …

EdgeTuner: Fast scheduling algorithm tuning for dynamic edge-cloud workloads and resources

R Han, S Wen, CH Liu, Y Yuan… - IEEE INFOCOM 2022 …, 2022 - ieeexplore.ieee.org
Edge-cloud jobs are rapidly prevailing in many application domains, posing the challenge of
using both resource-strenuous edge devices and elastic cloud resources. Efficient resource …

Parallelsfl: A novel split federated learning framework tackling heterogeneity issues

Y Liao, Y Xu, H Xu, Z Yao, L Huang… - Proceedings of the 30th …, 2024 - dl.acm.org
Mobile devices contribute more than half of the world's web traffic, providing massive and
diverse data for powering various federated learning (FL) applications. In order to avoid the …