Legion: Automatically Pushing the Envelope of {Multi-GPU} System for {Billion-Scale}{GNN} Training J Sun, L Su, Z Shi, W Shen, Z Wang, L Wang, J Zhang, Y Li, W Yu, J Zhou, ... 2023 USENIX Annual Technical Conference (USENIX ATC 23), 165-179, 2023 | 17 | 2023 |
Staleness-Reduction Mini-Batch -Means X Zhu, J Sun, Z He, J Jiang, Z Wang IEEE Transactions on Neural Networks and Learning Systems, 2023 | 6 | 2023 |
Helios: An Efficient Out-of-core GNN Training System on Terabyte-scale Graphs with In-memory Performance J Sun, M Sun, Z Zhang, J Xie, Z Shi, Z Yang, J Zhang, F Wu, Z Wang arXiv preprint arXiv:2310.00837, 2023 | 3 | 2023 |
P4SGD: Programmable Switch Enhanced Model-Parallel Training on Generalized Linear Models on Distributed FPGAs H Huang, Y Li, J Sun, X Zhu, J Zhang, L Luo, J Li, Z Wang IEEE Transactions on Parallel and Distributed Systems 34 (8), 2311-2324, 2023 | 2 | 2023 |
SSiMD: Supporting Six Signed Multiplications in a DSP Block for Low-Precision CNN on FPGAs Q Liu, M Sun, J Sun, L Lu, J Zhao, Z Wang 2023 International Conference on Field Programmable Technology (ICFPT), 161-169, 2023 | | 2023 |