TileSpGEMM: A tiled algorithm for parallel sparse general matrix-matrix multiplication on GPUs

J Gao, W Ji, F Chang, S Han, B Wei, Z Liu… - ACM Computing …, 2023 - dl.acm.org

General Sparse Matrix-Matrix Multiplication (SpGEMM) has attracted much attention from
researchers in graph analyzing, scientific computing, and deep learning. Many optimization …

被引用次数：74 相关文章所有 5 个版本

[PDF] arxiv.org

Dissecting tensor cores via microbenchmarks: Latency, throughput and numeric behaviors

W Sun, A Li, T Geng, S Stuijk… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Tensor Cores have been an important unit to accelerate Fused Matrix Multiplication
Accumulation (MMA) in all NVIDIA GPUs since Volta Architecture. To program Tensor Cores …

被引用次数：46 相关文章所有 6 个版本

[PDF] arxiv.org

AccelTran: A sparsity-aware accelerator for dynamic inference with transformers

S Tuli, NK Jha - IEEE Transactions on Computer-Aided Design …, 2023 - ieeexplore.ieee.org

Self-attention-based transformer models have achieved tremendous success in the domain
of natural language processing. Despite their efficacy, accelerating the transformer is …

被引用次数：28 相关文章所有 5 个版本

[PDF] acm.org

TANGO: re-thinking quantization for graph neural network training on GPUs

S Chen, D Zheng, C Ding, C Huan, Y Ji… - Proceedings of the …, 2023 - dl.acm.org

Graph learning is becoming increasingly popular due to its superior performance in tackling
many grand challenges. While quantization is widely used to accelerate Graph Neural …

被引用次数：3 相关文章所有 10 个版本

[PDF] ssslab.cn

DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multiplication

Y Lu, W Liu - Proceedings of the International Conference for High …, 2023 - dl.acm.org

Sparse matrix-vector multiplication (SpMV) plays a key role in computational science and
engineering, graph processing, and machine learning applications. Much work on SpMV …

被引用次数：10 相关文章所有 4 个版本

Accelerating Graph Neural Networks on Real Processing-In-Memory Systems

C Giannoula, P Yang, I Fernandez Vega… - arXiv e …, 2024 - ui.adsabs.harvard.edu

Abstract Graph Neural Networks (GNNs) are emerging ML models to analyze graph-
structure data. Graph Neural Network (GNN) execution involves both compute-intensive and …

被引用次数：7 相关文章

[PDF] google.com

HASpGEMM: Heterogeneity-Aware Sparse General Matrix-Matrix Multiplication on Modern Asymmetric Multicore Processors

H Cheng, W Li, Y Lu, W Liu - … of the 52nd International Conference on …, 2023 - dl.acm.org

Sparse general matrix-matrix multiplication (SpGEMM) is an important kernel in
computational science and engineering, and has been widely studied on homogeneous …

被引用次数：8 相关文章所有 2 个版本

Efficient execution of SpGEMM on long vector architectures

V Le Fèvre, M Casas - … of the 32nd International Symposium on High …, 2023 - dl.acm.org

The Sparse GEneral Matrix-Matrix multiplication (SpGEMM) C= A x B is a fundamental
routine extensively used in domains like machine learning or graph analytics. Despite its …

被引用次数：6 相关文章

[PDF] github.io

On Efficient Large Sparse Matrix Chain Multiplication

C Lin, W Luo, Y Fang, C Ma, X Liu, Y Ma - … of the ACM on Management of …, 2024 - dl.acm.org

Sparse matrices are often used to model the interactions among different objects and they
are prevalent in many areas including e-commerce, social network, and biology. As one of …

被引用次数：2 相关文章

HARP: Hardware-Based Pseudo-Tiling for Sparse Matrix Multiplication Accelerator

J Kim, M Jang, H Nam, S Kim - Proceedings of the 56th Annual IEEE …, 2023 - dl.acm.org

General sparse matrix-matrix multiplication (SpGEMM) is a memory-bound workload, due to
the compression format used. To minimize data movements for input matrices, outer product …

被引用次数：1 相关文章所有 4 个版本

高级搜索

QQ 群