Hms-net: Hierarchical multi-scale sparsity-invariant network for sparse depth completion Z Huang, J Fan, S Cheng, S Yi, X Wang, H Li IEEE Transactions on Image Processing 29, 3429-3441, 2019 | 175 | 2019 |
tcfft: A fast half-precision fft library for nvidia tensor cores B Li, S Cheng, J Lin 2021 IEEE International Conference on Cluster Computing (CLUSTER), 1-11, 2021 | 26* | 2021 |
FastFold: Optimizing AlphaFold Training and Inference on GPU Clusters S Cheng, X Zhao, G Lu, J Fang, T Zheng, R Wu, X Zhang, J Peng, Y You Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and …, 2024 | 24* | 2024 |
Hanayo: Harnessing wave-like pipeline parallelism for enhanced large model training efficiency Z Liu, S Cheng, H Zhou, Y You Proceedings of the International Conference for High Performance Computing …, 2023 | 16 | 2023 |
CUBE–Towards an Optimal Scaling of Cosmological N-body Simulations S Cheng, HR Yu, D Inman, Q Liao, Q Wu, J Lin 2020 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet …, 2020 | 12 | 2020 |
DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers X Zhao, S Cheng, Z Zheng, Z Yang, Z Liu, Y You arXiv preprint arXiv:2403.10266, 2024 | 2 | 2024 |
Wallfacer: Guiding transformer model training out of the long-context dark forest with n-body problem Z Liu, S Wang, S Cheng, Z Zhao, Y Bai, X Zhao, J Demmel, Y You arXiv preprint arXiv:2407.00611, 2024 | 1 | 2024 |
HeteGen: Efficient Heterogeneous Parallel Inference for Large Language Models on Resource-Constrained Devices Z XUANLEI, B Jia, H Zhou, Z Liu, S Cheng, Y You Proceedings of Machine Learning and Systems 6, 162-172, 2024 | 1 | 2024 |
AutoChunk: Automated Activation Chunk for Memory-Efficient Deep Learning Inference X Zhao, S Cheng, G Lu, H Zhou, B Jia, Y You The Twelfth International Conference on Learning Representations, 2023 | 1* | 2023 |
ATP: Adaptive Tensor Parallelism for Foundation Models S Cheng, Z Liu, J Du, Y You arXiv preprint arXiv:2301.08658, 2023 | 1 | 2023 |
FTL: A universal framework for training low-bit DNNs via feature transfer K Du, Y Zhang, H Guan, Q Tian, Y Wang, S Cheng, J Lin Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 1 | 2020 |
Videl: A Vision-based AI Diagnoser for Early Leukemia S Cheng, J Wei, M Zhao, Z Jin, J Wang, Y Wang, J Lin Proceedings of the HPC Asia 2019 Workshops, 1-5, 2019 | 1 | 2019 |
Liger: Interleaving Intra-and Inter-Operator Parallelism for Distributed Large Model Inference J Du, J Wei, J Jiang, S Cheng, D Huang, Z Chen, Y Lu Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and …, 2024 | | 2024 |