DGCL: An efficient communication library for distributed GNN training Z Cai, X Yan, Y Wu, K Ma, J Cheng, F Yu Proceedings of the Sixteenth European Conference on Computer Systems, 130-144, 2021 | 84 | 2021 |
Seastar: vertex-centric programming for graph neural networks Y Wu, K Ma, Z Cai, T Jin, B Li, C Zheng, J Cheng, F Yu Proceedings of the Sixteenth European Conference on Computer Systems, 359-375, 2021 | 54 | 2021 |
Elastic deep learning in multi-tenant GPU clusters Y Wu, K Ma, X Yan, Z Liu, Z Cai, Y Huang, J Cheng, H Yuan, F Yu IEEE Transactions on Parallel and Distributed Systems 33 (1), 144-158, 2021 | 47 | 2021 |
Tensoropt: Exploring the tradeoffs in distributed dnn training with auto-parallelism Z Cai, X Yan, K Ma, Y Wu, Y Huang, J Cheng, T Su, F Yu IEEE Transactions on Parallel and Distributed Systems 33 (8), 1967-1981, 2021 | 30 | 2021 |
FEC: Efficient Deep Recommendation Model Training with Flexible Embedding Communication K Ma, X Yan, Z Cai, Y Huang, Y Wu, J Cheng Proceedings of the ACM on Management of Data 1 (2), 1-21, 2023 | 1 | 2023 |
PPS: Fair and efficient black-box scheduling for multi-tenant GPU clusters K Ma, Z Cai, X Yan, Y Zhang, Z Liu, Y Feng, C Li, W Lin, J Cheng Parallel Computing 120, 103082, 2024 | | 2024 |
DGCL Z Cai, X Yan, Y Wu, K Ma, J Cheng, F Yu Proceedings of the Sixteenth European Conference on Computer Systems, 2021 | | 2021 |