A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices

L Liu, H An, P Chen, L Ye - arXiv preprint arXiv:2412.03772, 2024 - arxiv.org
With the rapid development of large language models (LLMs), which possess powerful
natural language processing and generation capabilities, LLMs are poised to provide more …

EdgeLLM: Fast On-device LLM Inference with Speculative Decoding

D Xu, W Yin, H Zhang, X Jin, Y Zhang… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Generative tasks, such as text generation and question answering, are essential for mobile
applications. Given their inherent privacy sensitivity, executing them on devices is …

PieBridge: Fast and Parameter-Efficient On-Device Training via Proxy Networks

W Yin, D Xu, G Huang, Y Zhang, S Wei, M Xu… - Proceedings of the 22nd …, 2024 - dl.acm.org
On-device training Neural Networks (NNs) has been a crucial catalyst towards privacy-
preserving and personalized mobile intelligence. Recently, a novel training paradigm …

FlexQuant: Elastic Quantization Framework for Locally Hosted LLM on Edge Devices

Y Chai, M Kwen, D Brooks, GY Wei - arXiv preprint arXiv:2501.07139, 2025 - arxiv.org
Deploying LLMs on edge devices presents serious technical challenges. Memory elasticity
is crucial for edge devices with unified memory, where memory is shared and fluctuates …