G-meta: Distributed meta learning in gpu clusters for large-scale recommender systems Y Xiao, S Zhao, Z Zhou, Z Huan, L Ju, X Zhang, L Wang, J Zhou Proceedings of the 32nd ACM International Conference on Information and …, 2023 | 7 | 2023 |
An adaptive placement and parallelism framework for accelerating rlhf training Y Xiao, Z Zhou, F Mao, W Wu, S Zhao, L Ju, L Liang, X Zhang, J Zhou arXiv preprint arXiv:2312.11819, 2023 | 6 | 2023 |
Rethinking memory and communication cost for efficient large language model training C Wu, H Zhang, L Ju, J Huang, Y Xiao, Z Huan, S Li, F Meng, L Liang, ... | 3 | 2024 |
AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes Y Xiao, L Ju, Z Zhou, S Li, Z Huan, D Zhang, R Jiang, L Wang, X Zhang, ... 2024 IEEE 40th International Conference on Data Engineering (ICDE), 2024 | 1 | 2024 |
An effective and efficient time-aware entity alignment framework via Two-aspect three-view label propagation L Cai, X Mao, Y Xiao, C Wu, M Lan Proceedings of the Thirty-Second International Joint Conference on …, 2023 | 1 | 2023 |
AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster S Li, Y Xiao, F Meng, L Ju, L Liang, L Wang, J Zhou arXiv preprint arXiv:2404.09686, 2024 | | 2024 |
Rethinking Memory and Communication Costs for Efficient Data Parallel Training of Large Language Models H Zhang, JU Lin, C Wu, J Huang, Y Xiao, Z Zhou, Z Huan, S Li, F Meng, ... The Thirty-eighth Annual Conference on Neural Information Processing Systems, 0 | | |