[图书][B] Efficient processing of deep neural networks

V Sze, YH Chen, TJ Yang, JS Emer - 2020 - Springer
This book provides a structured treatment of the key principles and techniques for enabling
efficient processing of deep neural networks (DNNs). DNNs are currently widely used for …

Challenges and advances in parallel sparse matrix-matrix multiplication

A Buluc, JR Gilbert - 2008 37th International Conference on …, 2008 - ieeexplore.ieee.org
We identify the challenges that are special to parallel sparse matrix-matrix multiplication
(PSpGEMM). We show that sparse algorithms are not as scalable as their dense …

Gemini: A {Computation-Centric} distributed graph processing system

X Zhu, W Chen, W Zheng, X Ma - 12th USENIX Symposium on Operating …, 2016 - usenix.org
Traditionally distributed graph processing systems have largely focused on scalability
through the optimizations of inter-node communication and load balance. However, they …

The tensor algebra compiler

F Kjolstad, S Kamil, S Chou, D Lugato… - Proceedings of the …, 2017 - dl.acm.org
Tensor algebra is a powerful tool with applications in machine learning, data analytics,
engineering and the physical sciences. Tensors are often sparse and compound operations …

Outerspace: An outer product based sparse matrix multiplication accelerator

S Pal, J Beaumont, DH Park… - … Symposium on High …, 2018 - ieeexplore.ieee.org
Sparse matrices are widely used in graph and data analytics, machine learning, engineering
and scientific applications. This paper describes and analyzes OuterSPACE, an accelerator …

Graphmat: High performance graph analytics made productive

N Sundaram, NR Satish, MMA Patwary… - arXiv preprint arXiv …, 2015 - arxiv.org
Given the growing importance of large-scale graph analytics, there is a need to improve the
performance of graph analysis frameworks without compromising on productivity. GraphMat …

A systematic survey of general sparse matrix-matrix multiplication

J Gao, W Ji, F Chang, S Han, B Wei, Z Liu… - ACM Computing …, 2023 - dl.acm.org
General Sparse Matrix-Matrix Multiplication (SpGEMM) has attracted much attention from
researchers in graph analyzing, scientific computing, and deep learning. Many optimization …

Faster cnns with direct sparse convolutions and guided pruning

J Park, S Li, W Wen, PTP Tang, H Li, Y Chen… - arXiv preprint arXiv …, 2016 - arxiv.org
Phenomenally successful in practical inference problems, convolutional neural networks
(CNN) are widely deployed in mobile devices, data centers, and even supercomputers. The …

Algorithm 1000: SuiteSparse: GraphBLAS: Graph algorithms in the language of sparse linear algebra

TA Davis - ACM Transactions on Mathematical Software (TOMS), 2019 - dl.acm.org
SuiteSparse: GraphBLAS is a full implementation of the GraphBLAS standard, which defines
a set of sparse matrix operations on an extended algebra of semirings using an almost …

Sparsetir: Composable abstractions for sparse compilation in deep learning

Z Ye, R Lai, J Shao, T Chen, L Ceze - Proceedings of the 28th ACM …, 2023 - dl.acm.org
Sparse tensors are rapidly becoming critical components of modern deep learning
workloads. However, developing high-performance sparse operators can be difficult and …