Adaptive sparse tiling for sparse matrix multiplication

C Hong, A Sukumaran-Rajam, I Nisa, K Singh… - Proceedings of the 24th …, 2019 - dl.acm.org
Tiling is a key technique for data locality optimization and is widely used in high-
performance implementations of dense matrix-matrix multiplication for multicore/manycore …

TileSpGEMM: A tiled algorithm for parallel sparse general matrix-matrix multiplication on GPUs

Y Niu, Z Lu, H Ji, S Song, Z Jin, W Liu - Proceedings of the 27th ACM …, 2022 - dl.acm.org
Sparse general matrix-matrix multiplication (SpGEMM) is one of the most fundamental
building blocks in sparse linear solvers, graph processing frameworks and machine learning …

GraphBLAST: A high-performance linear algebra-based graph framework on the GPU

C Yang, A Buluç, JD Owens - ACM Transactions on Mathematical …, 2022 - dl.acm.org
High-performance implementations of graph algorithms are challenging to implement on
new parallel hardware such as GPUs because of three challenges:(1) the difficulty of coming …

Design principles for sparse matrix multiplication on the gpu

C Yang, A Buluç, JD Owens - European Conference on Parallel …, 2018 - Springer
We implement two novel algorithms for sparse-matrix dense-matrix multiplication (SpMM) on
the GPU. Our algorithms expect the sparse input in the popular compressed-sparse-row …

Fast linear algebra-based triangle counting with kokkoskernels

MM Wolf, M Deveci, JW Berry… - 2017 IEEE High …, 2017 - ieeexplore.ieee.org
Triangle counting serves as a key building block for a set of important graph algorithms in
network science. In this paper, we address the IEEE HPEC Static Graph Challenge problem …

Scaling up network centrality computations–A brief overview

A van der Grinten, E Angriman… - it-Information …, 2020 - degruyter.com
Network science methodology is increasingly applied to a large variety of real-world
phenomena, often leading to big network data sets. Thus, networks (or graphs) with millions …

Combinatorial BLAS 2.0: Scaling combinatorial algorithms on distributed-memory systems

A Azad, O Selvitopi, MT Hussain… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Combinatorial algorithms such as those that arise in graph analysis, modeling of discrete
systems, bioinformatics, and chemistry, are often hard to parallelize. The Combinatorial …

Performance optimization, modeling and analysis of sparse matrix-matrix products on multi-core and many-core processors

Y Nagasaka, S Matsuoka, A Azad, A Buluç - Parallel Computing, 2019 - Elsevier
Sparse matrix-matrix multiplication (SpGEMM) is a computational primitive that is widely
used in areas ranging from traditional numerical applications to recent big data analysis and …

FastSV: A distributed-memory connected component algorithm with fast convergence

Y Zhang, A Azad, Z Hu - Proceedings of the 2020 SIAM Conference on …, 2020 - SIAM
This paper presents a new distributed-memory algorithm called FastSV for finding
connected components in an undirected graph. Our algorithm simplifies the classic Shiloach …

LAGraph: A community effort to collect graph algorithms built on top of the GraphBLAS

T Mattson, TA Davis, M Kumar, A Buluc… - 2019 IEEE …, 2019 - ieeexplore.ieee.org
In 2013, we released a position paper to launch a community effort to define a common set
of building blocks for constructing graph algorithms in the language of linear algebra. This …