J Xu, R Zhang, C Guo, W Hu, Z Liu, F Wu… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Language Models (LLMs) are widely used across various domains, processing
millions of daily requests. This surge in demand poses significant challenges in optimizing …