Cognn: efficient scheduling for concurrent gnn training on gpus

Q Sun, Y Liu, H Yang, R Zhang, M Dun… - … Conference for High …, 2022 - ieeexplore.ieee.org
Graph neural networks (GNNs) suffer from low GPU utilization due to frequent memory
accesses. Existing concurrent training mechanisms cannot be directly adapted to GNNs …

Sgap: towards efficient sparse tensor algebra compilation for GPU

G Zhang, Y Zhao, Y Tao, Z Yu, G Dai, S Huang… - CCF Transactions on …, 2023 - Springer
Sparse compiler is a promising solution for sparse tensor algebra optimization. In compiler
implementation, reduction in sparse-dense hybrid algebra plays a key role in performance …

Algorithm/hardware co-optimization for sparsity-aware SpMM acceleration of GNNs

Y Gao, L Gong, C Wang, T Wang, X Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
In recent years, graph neural networks (GNNs) have achieved impressive performance in
various application fields by extracting information from graph-structured data. It contains …