C-coll: Introducing error-bounded lossy compression into mpi collectives J Huang, S Di, X Yu, Y Zhai, J Liu, K Raffenetti, H Zhou, K Zhao, Z Chen, ... arXiv preprint arXiv:2304.03890, 2023 | 8 | 2023 |
Anatomy of high-performance gemm with online fault tolerance on gpus S Wu, Y Zhai, J Liu, J Huang, Z Jian, B Wong, Z Chen Proceedings of the 37th International Conference on Supercomputing, 360-372, 2023 | 7 | 2023 |
High-performance Effective Scientific Error-bounded Lossy Compression with Auto-tuned Multi-component Interpolation J Liu, S Di, K Zhao, X Liang, S Jin, Z Jian, J Huang, S Wu, Z Chen, ... Proceedings of the ACM on Management of Data 2 (1), 1-27, 2024 | 6 | 2024 |
An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compression J Huang, S Di, X Yu, Y Zhai, Z Zhang, J Liu, X Lu, K Raffenetti, H Zhou, ... 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2024 | 4 | 2024 |
gzccl: Compression-accelerated collective communication framework for gpu clusters J Huang, S Di, X Yu, Y Zhai, J Liu, Y Huang, K Raffenetti, H Zhou, K Zhao, ... Proceedings of the 38th ACM International Conference on Supercomputing, 437-448, 2024 | 2 | 2024 |
CliZ: Optimizing lossy compression for climate datasets with adaptive fine-tuned data prediction Z Jian, S Di, J Liu, K Zhao, X Liang, H Xu, R Underwood, S Wu, J Huang, ... 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2024 | 2 | 2024 |
Exploring Wavelet Transform Usages for Error-bounded Scientific Data Compression J Huang, J Liu, S Di, Y Zhai, Z Jian, S Wu, K Zhao, Z Chen, Y Guo, ... 2023 IEEE International Conference on Big Data (BigData), 4233-4239, 2023 | 2 | 2023 |
Ft-gemm: A fault tolerant high performance gemm implementation on x86 cpus S Wu, Y Zhai, J Huang, Z Jian, Z Chen Proceedings of the 32nd International Symposium on High-Performance Parallel …, 2023 | 2 | 2023 |
TurboFFT: A High-Performance Fast Fourier Transform with Fault Tolerance on GPU S Wu, Y Zhai, J Liu, J Huang, Z Jian, H Dai, S Di, Z Chen, F Cappello arXiv preprint arXiv:2405.02520, 2024 | 1 | 2024 |
POSTER: Optimizing Collective Communications with Error-bounded Lossy Compression for GPU Clusters J Huang, S Di, X Yu, Y Zhai, J Liu, Y Huang, K Raffenetti, H Zhou, K Zhao, ... Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and …, 2024 | 1 | 2024 |
FT-BLAS: A Fault Tolerant High Performance BLAS Implementation on x86 CPUs Y Zhai, E Giem, K Zhao, J Liu, J Huang, BM Wong, CR Shelton, Z Chen IEEE Transactions on Parallel and Distributed Systems, 2023 | 1 | 2023 |
Accelerating mpi collectives with process-in-process-based multi-object techniques J Huang, K Ouyang, Y Zhai, J Liu, M Si, K Raffenetti, H Zhou, A Hori, ... Proceedings of the 32nd International Symposium on High-Performance Parallel …, 2023 | 1 | 2023 |
Accelerating fault-tolerant blas on x86 cpus Y Zhai, E Giem, K Zhao, J Liu, J Huang, B Wong, C Shelton, Z Chen July, 2022 | 1 | 2022 |
FT K-Means: A High-Performance K-Means on GPU with Fault Tolerance S Wu, Y Ding, Y Zhai, J Liu, J Huang, Z Jian, H Dai, S Di, BM Wong, ... arXiv preprint arXiv:2408.01391, 2024 | | 2024 |
A Survey on Error-Bounded Lossy Compression for Scientific Datasets S Di, J Liu, K Zhao, X Liang, R Underwood, Z Zhang, M Shah, Y Huang, ... arXiv preprint arXiv:2404.02840, 2024 | | 2024 |
PiP-MColl: Process-in-Process-based Multi-object MPI Collectives J Huang, K Ouyang, Y Zhai, J Liu, M Si, K Raffenetti, H Zhou, A Hori, ... 2023 IEEE International Conference on Cluster Computing (CLUSTER), 354-364, 2023 | | 2023 |
Accelerating Collective Communications with Lossy Compression on GPU J Huang, S Di, X Yu, Y Guo Dimensions 449, 849X849X235, 0 | | |