CSR5: An efficient storage format for cross-platform sparse matrix-vector multiplication

W Liu, B Vinter - Proceedings of the 29th ACM on International …, 2015 - dl.acm.org
Sparse matrix-vector multiplication (SpMV) is a fundamental building block for numerous
applications. In this paper, we propose CSR5 (Compressed Sparse Row 5), a new storage …

Data reorganization in memory using 3D-stacked DRAM

B Akin, F Franchetti, JC Hoe - ACM SIGARCH Computer Architecture …, 2015 - dl.acm.org
In this paper we focus on common data reorganization operations such as shuffle,
pack/unpack, swap, transpose, and layout transformations. Although these operations …

Evaluation criteria for sparse matrix storage formats

D Langr, P Tvrdik - IEEE Transactions on parallel and …, 2015 - ieeexplore.ieee.org
When authors present new storage formats for sparse matrices, they usually focus mainly on
a single evaluation criterion, which is the performance of sparse matrix-vector multiplication …

Speculative segmented sum for sparse matrix-vector multiplication on heterogeneous processors

W Liu, B Vinter - Parallel Computing, 2015 - Elsevier
Sparse matrix-vector multiplication (SpMV) is a central building block for scientific software
and graph applications. Recently, heterogeneous processors composed of different types of …

Sparse tensor algebra as a parallel programming model

E Solomonik, T Hoefler - arXiv preprint arXiv:1512.00066, 2015 - arxiv.org
Dense and sparse tensors allow the representation of most bulk data structures in
computational science applications. We show that sparse tensor algebra can also be used …

Fast matrix-vector multiplications for large-scale logistic regression on shared-memory systems

MC Lee, WL Chiang, CJ Lin - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
Shared-memory systems such as regular desktops now possess enough memory to store
large data. However, the training process for data classification can still be slow if we do not …

Locality-aware parallel sparse matrix-vector and matrix-transpose-vector multiplication on many-core processors

MO Karsavuran, K Akbudak… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
Sparse matrix-vector and matrix-transpose-vector multiplication (SpMM TV) repeatedly
performed as z← AT x and y← A z (or y A w) for the same sparse matrix A is a kernel …

[PDF][PDF] Parallel and scalable sparse basic linear algebra subprograms

W Liu - 2015 - nbi.ku.dk
Sparse basic linear algebra subprograms (BLAS) are fundamental building blocks for
numerous scientific computations and graph applications. Compared with Dense BLAS …

Alinea: An advanced linear algebra library for massively parallel computations on graphics processing units

F Magoules, AKC Ahamed - The International Journal of …, 2015 - journals.sagepub.com
Direct and iterative methods are often used to solve linear systems in engineering. The
matrices involved can be large, which leads to heavy computations on the central …

GPU accelerated sparse matrix‐vector multiplication and sparse matrix‐transpose vector multiplication

Y Tao, Y Deng, S Mu, Z Zhang, M Zhu… - Concurrency and …, 2015 - Wiley Online Library
Many high performance computing applications require computing both sparse matrix‐
vector product (SMVP) and sparse matrix‐transpose vector product (SMTVP) for better …