Sparse convolutional neural networks

B Liu, M Wang, H Foroosh… - Proceedings of the …, 2015 - openaccess.thecvf.com
Deep neural networks have achieved remarkable performance in both image classification
and object detection problems, at the cost of a large number of parameters and …

Implementing sparse matrix-vector multiplication on throughput-oriented processors

N Bell, M Garland - Proceedings of the conference on high performance …, 2009 - dl.acm.org
Sparse matrix-vector multiplication (SpMV) is of singular importance in sparse linear
algebra. In contrast to the uniform regularity of dense linear algebra, sparse operations …

[HTML][HTML] The landscape of parallel computing research: A view from berkeley

K Asanovic, R Bodik, B Catanzaro, J Gebis… - 2006 - escholarship.org
The recent switch to parallel microprocessors is a milestone in the history of computing.
Industry has laid out a roadmap for multicore designs that preserves the programming …

Auto-tuning a high-level language targeted to GPU codes

S Grauer-Gray, L Xu, R Searles… - 2012 innovative …, 2012 - ieeexplore.ieee.org
Determining the best set of optimizations to apply to a kernel to be executed on the graphics
processing unit (GPU) is a challenging problem. There are large sets of possible …

[PDF][PDF] Efficient sparse matrix-vector multiplication on CUDA

N Bell, M Garland - 2008 - twiki.di.uniroma1.it
The massive parallelism of graphics processing units (GPUs) offers tremendous
performance in many high-performance computing applications. While dense linear algebra …

Optimization of sparse matrix-vector multiplication on emerging multicore platforms

S Williams, L Oliker, R Vuduc, J Shalf, K Yelick… - Proceedings of the …, 2007 - dl.acm.org
We are witnessing a dramatic change in computer architecture due to the multicore
paradigm shift, as every electronic device from cell phones to supercomputers confronts …

Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks

A Buluç, JT Fineman, M Frigo, JR Gilbert… - Proceedings of the …, 2009 - dl.acm.org
This paper introduces a storage format for sparse matrices, called compressed sparse
blocks (CSB), which allows both Ax and A, x to be computed efficiently in parallel, where A is …

SPIRAL: Code generation for DSP transforms

M Puschel, JMF Moura, JR Johnson… - Proceedings of the …, 2005 - ieeexplore.ieee.org
Fast changing, increasingly complex, and diverse computing platforms pose central
problems in scientific computing: How to achieve, with reasonable effort, portable optimal …

A recursive algebraic coloring technique for hardware-efficient symmetric sparse matrix-vector multiplication

C Alappat, A Basermann, AR Bishop… - ACM Transactions on …, 2020 - dl.acm.org
The symmetric sparse matrix-vector multiplication (SymmSpMV) is an important building
block for many numerical linear algebra kernel operations or graph traversal applications …

Model-driven autotuning of sparse matrix-vector multiply on GPUs

JW Choi, A Singh, RW Vuduc - ACM sigplan notices, 2010 - dl.acm.org
We present a performance model-driven framework for automated performance tuning
(autotuning) of sparse matrix-vector multiply (SpMV) on systems accelerated by graphics …