A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

H Cheng, M Zhang, JQ Shi - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …

Mobile edge intelligence for large language models: A contemporary survey

G Qu, Q Chen, W Wei, Z Lin, X Chen… - … Surveys & Tutorials, 2025 - ieeexplore.ieee.org
On-device large language models (LLMs), referring to running LLMs on edge devices, have
raised considerable interest since they are more cost-effective, latency-efficient, and privacy …

Llm-based edge intelligence: A comprehensive survey on architectures, applications, security and trustworthiness

O Friha, MA Ferrag, B Kantarci… - IEEE Open Journal …, 2024 - ieeexplore.ieee.org
The integration of Large Language Models (LLMs) and Edge Intelligence (EI) introduces a
groundbreaking paradigm for intelligent edge devices. With their capacity for human-like …

Small language models: Survey, measurements, and insights

Z Lu, X Li, D Cai, R Yi, F Liu, X Zhang, ND Lane… - arXiv preprint arXiv …, 2024 - arxiv.org
Small language models (SLMs), despite their widespread adoption in modern smart
devices, have received significantly less academic attention compared to their large …

Leveraging large language models for integrated satellite-aerial-terrestrial networks: recent advances and future directions

S Javaid, RA Khalil, N Saeed, B He… - IEEE Open Journal of …, 2024 - ieeexplore.ieee.org
Integrated satellite, aerial, and terrestrial networks (ISATNs) represent a sophisticated
convergence of diverse communication technologies to ensure seamless connectivity …

Heterogeneous contrastive learning for foundation models and beyond

L Zheng, B Jing, Z Li, H Tong, J He - Proceedings of the 30th ACM …, 2024 - dl.acm.org
In the era of big data and Artificial Intelligence, an emerging paradigm is to utilize contrastive
self-supervised learning to model large-scale heterogeneous data. Many existing foundation …

On-device language models: A comprehensive review

J Xu, Z Li, W Chen, Q Wang, X Gao, Q Cai… - arXiv preprint arXiv …, 2024 - arxiv.org
The advent of large language models (LLMs) revolutionized natural language processing
applications, and running LLMs on edge devices has become increasingly attractive for …

Melting point: Mobile evaluation of language transformers

S Laskaridis, K Katevas, L Minto… - Proceedings of the 30th …, 2024 - dl.acm.org
Transformers have recently revolutionized the machine learning (ML) landscape, gradually
making their way into everyday tasks and equipping our computers with" sparks of …

Empowering 1000 tokens/second on-device llm prefilling with mllm-npu

D Xu, H Zhang, L Yang, R Liu, G Huang, M Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
On-device large language models (LLMs) are catalyzing novel mobile applications such as
UI task automation and personalized email auto-reply, without giving away users' private …

Fwdllm: Efficient federated finetuning of large language models with perturbed inferences

M Xu, D Cai, Y Wu, X Li, S Wang - … of the 2024 USENIX Conference on …, 2024 - dl.acm.org
Large Language Models (LLMs) are transforming the landscape of mobile intelligence.
Federated Learning (FL), a method to preserve user data privacy, is often employed in fine …