关注
Borui Wan
Borui Wan
在 connect.hku.hk 的电子邮件经过验证
标题
引用次数
引用次数
年份
Adaptive message quantization and parallelization for distributed full-graph gnn training
B Wan, J Zhao, C Wu
Proceedings of Machine Learning and Systems 5, 2023
222023
LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization
J Zhao, B Wan, Y Peng, H Lin, C Wu
arXiv preprint arXiv:2403.01136, 2024
92024
ByteCheckpoint: A Unified Checkpointing System for Large Foundation Model Development
B Wan, M Han, Y Sheng, Y Peng, H Lin, M Zhang, Z Lai, Y Menghan, ...
arXiv preprint arXiv:2407.20143, 2024
3*2024
POSTER: LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization
J Zhao, B Wan, C Wu, Y Peng, H Lin
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and …, 2024
12024
QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices
J Zhao, B Wan, Y Peng, H Lin, Y Zhu, C Wu
2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–5