Ge-spmm: General-purpose sparse matrix-matrix multiplication on gpus for graph neural networks

G Huang, G Dai, Y Wang, H Yang - … Conference for High …, 2020 - ieeexplore.ieee.org
The acceleration of Graph Neural Networks (GNNs) requires efficient and framework-
compatible Sparse-Dense Matrix-Matrix Multiplication (SpMM). From the compatibility …

Coarsening the granularity: Towards structurally sparse lottery tickets

T Chen, X Chen, X Ma, Y Wang… - … conference on machine …, 2022 - proceedings.mlr.press
The lottery ticket hypothesis (LTH) has shown that dense models contain highly sparse
subnetworks (ie, winning tickets) that can be trained in isolation to match full accuracy …

TileSpGEMM: A tiled algorithm for parallel sparse general matrix-matrix multiplication on GPUs

Y Niu, Z Lu, H Ji, S Song, Z Jin, W Liu - Proceedings of the 27th ACM …, 2022 - dl.acm.org
Sparse general matrix-matrix multiplication (SpGEMM) is one of the most fundamental
building blocks in sparse linear solvers, graph processing frameworks and machine learning …

Smash: Co-designing software compression and hardware-accelerated indexing for efficient sparse matrix operations

K Kanellopoulos, N Vijaykumar, C Giannoula… - Proceedings of the …, 2019 - dl.acm.org
Important workloads, such as machine learning and graph analytics applications, heavily
involve sparse linear algebra operations. These operations use sparse matrix compression …

Design principles for sparse matrix multiplication on the gpu

C Yang, A Buluç, JD Owens - European Conference on Parallel …, 2018 - Springer
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on
the GPU. Our algorithms expect the sparse input in the popular compressed-sparse-row …

Accel-gcn: High-performance gpu accelerator design for graph convolution networks

X Xie, H Peng, A Hasan, S Huang… - 2023 IEEE/ACM …, 2023 - ieeexplore.ieee.org
Graph Convolutional Networks (GCNs) are pivotal in extracting latent information from graph
data across various domains, yet their acceleration on mainstream GPUs is challenged by …

[HTML][HTML] Adaptive single-layer aggregation framework for energy-efficient and privacy-preserving load forecasting in heterogeneous Federated smart grids

HU Manzoor, A Jafri, A Zoha - Internet of Things, 2024 - Elsevier
Federated Learning (FL) enhances predictive accuracy in load forecasting by integrating
data from distributed load networks while ensuring data privacy. However, the …

Hypergraf: Hyperdimensional graph-based reasoning acceleration on fpga

H Chen, A Zakeri, F Wen, HE Barkam… - … Conference on Field …, 2023 - ieeexplore.ieee.org
The latest hardware accelerators proposed for graph applications primarily focus on graph
neural networks (GNNs) and graph mining. High-level graph reasoning tasks, such as graph …

[PDF][PDF] Cogdl: An extensive toolkit for deep learning on graphs

Y Cen, Z Hou, Y Wang, Q Chen, Y Luo… - arXiv preprint arXiv …, 2021 - ask.qcloudimg.com
Graph representation learning aims to learn low-dimensional node embeddings for graphs.
It is used in several real-world applications such as social network analysis and large-scale …

Efficient tiled sparse matrix multiplication through matrix signatures

SE Kurt, A Sukumaran-Rajam… - … Conference for High …, 2020 - ieeexplore.ieee.org
Tiling is a key technique to reduce data movement in matrix computations. While tiling is well
understood and widely used for dense matrix/tensor computations, effective tiling of sparse …