Tiny machine learning: progress and futures [feature]

J Lin, L Zhu, WM Chen, WC Wang… - IEEE Circuits and …, 2023 - ieeexplore.ieee.org
Tiny machine learning (TinyML) is a new frontier of machine learning. By squeezing deep
learning models into billions of IoT devices and microcontrollers (MCUs), we expand the …

Efficient memory management for large language model serving with pagedattention

W Kwon, Z Li, S Zhuang, Y Sheng, L Zheng… - Proceedings of the 29th …, 2023 - dl.acm.org
High throughput serving of large language models (LLMs) requires batching sufficiently
many requests at a time. However, existing systems struggle because the key-value cache …

On-device training under 256kb memory

J Lin, L Zhu, WM Chen, WC Wang… - Advances in Neural …, 2022 - proceedings.neurips.cc
On-device training enables the model to adapt to new data collected from the sensors by
fine-tuning a pre-trained model. Users can benefit from customized AI models without having …

A survey on scheduling techniques in computing and network convergence

S Tang, Y Yu, H Wang, G Wang, W Chen… - … Surveys & Tutorials, 2023 - ieeexplore.ieee.org
The computing demand for massive applications has led to the ubiquitous deployment of
computing power. This trend results in the urgent need for higher-level computing resource …

Pockengine: Sparse and efficient fine-tuning in a pocket

L Zhu, L Hu, J Lin, WM Chen, WC Wang… - Proceedings of the 56th …, 2023 - dl.acm.org
On-device learning and efficient fine-tuning enable continuous and privacy-preserving
customization (eg, locally fine-tuning large language models on personalized data) …

Revisiting edge ai: Opportunities and challenges

T Meuser, L Lovén, M Bhuyan, SG Patil… - IEEE Internet …, 2024 - ieeexplore.ieee.org
Edge artificial intelligence (AI) is an innovative computing paradigm that aims to shift the
training and inference of machine learning models to the edge of the network. This paradigm …

[PDF][PDF] Tinytrain: Deep neural network training at the extreme edge

YD Kwon, R Li, SI Venieris… - arXiv preprint arXiv …, 2023 - theyoungkwon.github.io
On-device training is essential for user personalisation and privacy. With the pervasiveness
of IoT devices and microcontroller units (MCU), this task becomes more challenging due to …

MODeL: memory optimizations for deep learning

B Steiner, M Elhoushi, J Kahn… - … on Machine Learning, 2023 - proceedings.mlr.press
The size of deep neural networks has grown exponentially in recent years. Unfortunately,
hardware devices have not kept pace with the rapidly increasing memory requirements. To …

On-device training: A first overview on existing systems

S Zhu, T Voigt, F Rahimian, J Ko - ACM Transactions on Sensor …, 2024 - dl.acm.org
The recent breakthroughs in machine learning (ML) and deep learning (DL) have catalyzed
the design and development of various intelligent systems over wide application domains …

Pluto and Charon: A time and memory efficient collaborative edge AI framework for personal LLMs fine-tuning

B Ouyang, S Ye, L Zeng, T Qian, J Li… - Proceedings of the 53rd …, 2024 - dl.acm.org
Large language models (LLMs) have unlocked a plethora of powerful applications at the
network edge, such as intelligent personal assistants. Data privacy and security concerns …