关注
Ruibo FAN
Ruibo FAN
The Hong Kong University of Science and Technology (Guangzhou)
在 connect.hkust-gz.edu.cn 的电子邮件经过验证
标题
引用次数
引用次数
年份
Dissecting the Runtime Performance of the Training, Fine-tuning, and Inference of Large Language Models
L Zhang, X Liu, Z Li, X Pan, P Dong, R Fan, R Guo, X Wang, Q Luo, S Shi, ...
arXiv preprint arXiv:2311.03687, 2023
32023
Fast Sparse GPU Kernels for Accelerated Training of Graph Neural Networks
R Fan, W Wang, X Chu
2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2023
32023
Benchmarking and Dissecting the Nvidia Hopper GPU Architecture
W Luo, R Fan, Z Li, D Du, Q Wang, X Chu
arXiv preprint arXiv:2402.13499, 2024
22024
DTC-SpMM: Bridging the Gap in Accelerating General Sparse Matrix Multiplication with Tensor Cores
R Fan, W Wang, X Chu
Proceedings of the 29th ACM International Conference on Architectural …, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–4