A survey of resource-efficient llm and multimodal foundation models M Xu, W Yin, D Cai, R Yi, D Xu, Q Wang, B Wu, Y Zhao, C Yang, S Wang, ... arXiv preprint arXiv:2401.08092, 2024 | 63 | 2024 |
Llmcad: Fast and scalable on-device large language model inference D Xu, W Yin, X Jin, Y Zhang, S Wei, M Xu, X Liu arXiv preprint arXiv:2309.04255, 2023 | 37 | 2023 |
Llm as a system service on mobile devices W Yin, M Xu, Y Li, X Liu arXiv preprint arXiv:2403.11805, 2024 | 24 | 2024 |
ELMS: Elasticized Large Language Models On Mobile Devices W Yin, R Yi, D Xu, G Huang, M Xu, X Liu arXiv preprint arXiv:2409.09071, 2024 | 1 | 2024 |
PieBridge: Fast and Parameter-Efficient On-Device Training via Proxy Networks W Yin, D Xu, G Huang, Y Zhang, S Wei, M Xu, X Liu Proceedings of the 22nd ACM Conference on Embedded Networked Sensor Systems …, 2024 | | 2024 |