关注
Jingwei Sun
标题
引用次数
引用次数
年份
Automated performance modeling of HPC applications using machine learning
J Sun, G Sun, S Zhan, J Zhang, Y Chen
IEEE Transactions on Computers 69 (5), 749-763, 2020
332020
Adasam: Boosting sharpness-aware minimization with adaptive learning rate and momentum for training deep neural networks
H Sun, L Shen, Q Zhong, L Ding, S Chen, J Sun, J Li, G Sun, D Tao
Neural Networks 169, 506-519, 2024
232024
Lossy compression of communication traces using recurrent neural networks
J Sun, T Yan, H Sun, H Lin, G Sun
IEEE Transactions on Parallel and Distributed Systems 33 (11), 3106-3116, 2021
92021
Performance analysis of graph neural network frameworks
J Wu, J Sun, H Sun, G Sun
2021 IEEE International Symposium on Performance Analysis of Systems and …, 2021
92021
Latency-aware automatic CNN channel pruning with GPU runtime analysis
J Liu, J Sun, Z Xu, G Sun
BenchCouncil Transactions on Benchmarks, Standards and Evaluations 1 (1), 100009, 2021
82021
SPLZ: An efficient algorithm for single source shortest path problem using compression method
J Sun, G Sun
GeoInformatica 20, 1-18, 2016
82016
An efficient channel-level pruning for CNNs without fine-tuning
Z Xu, J Sun, Y Liu, G Sun
2021 International Joint Conference on Neural Networks (IJCNN), 1-8, 2021
62021
Automated performance modeling based on runtime feature detection and machine learning
J Sun, S Zhan, G Sun, Y Chen
2017 IEEE International Symposium on Parallel and Distributed Processing …, 2017
62017
An active learning method for empirical modeling in performance tuning
J Zhang, J Sun, W Zhou, G Sun
2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2020
52020
Using small-scale history data to predict large-scale performance of hpc application
W Zhou, J Zhang, J Sun, G Sun
2020 IEEE International Parallel and Distributed Processing Symposium …, 2020
42020
Constructing skeleton for parallel applications with machine learning methods
Z Zhang, J Sun, J Zhang, Y Qin, G Sun
Workshop Proceedings of the 48th International Conference on Parallel …, 2019
42019
GPU Occupancy Prediction of Deep Learning Models Using Graph Neural Network
H Mei, H Qu, J Sun, Y Gao, H Lin, G Sun
2023 IEEE International Conference on Cluster Computing (CLUSTER), 318-329, 2023
12023
FedLALR: Client-Specific Adaptive Learning Rates Achieve Linear Speedup for Non-IID Data
H Sun, L Shen, S Chen, J Sun, J Li, G Sun, D Tao
arXiv preprint arXiv:2309.09719, 2023
12023
Accelerating GNN inference by soft channel pruning
W Zhang, J Sun, G Sun
2022 IEEE 13th International Symposium on Parallel Architectures, Algorithms …, 2022
12022
Performance characterization and optimization of pruning patterns for sparse DNN inference
Y Liu, J Sun, J Liu, G Sun
BenchCouncil Transactions on Benchmarks, Standards and Evaluations 2 (4), 100090, 2022
12022
Structured Pruning for Large Language Models Using Coupled Components Elimination and Minor Fine-tuning
H Zhang, XS XiaolongShi, J Sun, G Sun
Findings of the Association for Computational Linguistics: NAACL 2024, 1-12, 2024
2024
AG-SpTRSV: An Automatic Framework to Optimize Sparse Triangular Solve on GPUs
Z Hu, J Sun, Z Li, G Sun
ACM Transactions on Architecture and Code Optimization, 2024
2024
Physically plausible and conservative solutions to Navier-Stokes equations using Physics-Informed CNNs
J Li, L Zhou, J Sun, G Sun
2024
Automated HPC Workload Generation Combining Statistical Modeling and Autoregressive Analysis
Z Zhou, J Sun, G Sun
International Symposium on Benchmarking, Measuring and Optimization, 153-170, 2023
2023
EC-SpMM: Efficient Compilation of SpMM Kernel on GPUs
J Lin, H Zhang, X Shi, J Sun, X Yu, J Yao, G Sun
Proceedings of the 52nd International Conference on Parallel Processing, 21-30, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–20