J Wang, S Zhang, QC He, Y Chen - arXiv preprint arXiv:2501.02573, 2025 - arxiv.org
The machine learning and data science community has made significant while dispersive
progress in accelerating transformer-based large language models (LLMs), and one …