作者
Aydin Buluç, Jeremy T Fineman, Matteo Frigo, John R Gilbert, Charles E Leiserson
发表日期
2009/8/11
图书
Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures
页码范围
233-244
简介
This paper introduces a storage format for sparse matrices, called compressed sparse blocks (CSB), which allows both Ax and A,x to be computed efficiently in parallel, where A is an n×n sparse matrix with nnzen nonzeros and x is a dense n-vector. Our algorithms use Θ(nnz) work (serial running time) and Θ(√nlgn) span (critical-path length), yielding a parallelism of Θ(nnz/√nlgn), which is amply high for virtually any large matrix. The storage requirement for CSB is the same as that for the more-standard compressed-sparse-rows (CSR) format, for which computing Ax in parallel is easy but A,x is difficult. Benchmark results indicate that on one processor, the CSB algorithms for Ax and A,x run just as fast as the CSR algorithm for Ax, but the CSB algorithms also scale up linearly with processors until limited by off-chip memory bandwidth.
引用总数
2008200920102011201220132014201520162017201820192020202120222023202433181118151926443354434751575823
学术搜索中的文章
A Buluç, JT Fineman, M Frigo, JR Gilbert, CE Leiserson - Proceedings of the twenty-first annual symposium on …, 2009