Adaptive runtime tuning of parallel sparse matrix-vector multiplication on distributed memory systems

S Lee, R Eigenmann - Proceedings of the 22nd annual international …, 2008 - dl.acm.org
Sparse matrix-vector (SpMV) multiplication is a widely used kernel in scientific applications.
In these applications, the SpMV multiplication is usually deeply nested within multiple loops …

Efficient representation scheme for multidimensional array operations

CY Lin, JS Liu, YC Chung - IEEE Transactions on Computers, 2002 - ieeexplore.ieee.org
Array operations are used in a large number of important scientific codes. To implement
these array operations efficiently, many methods have been proposed in the literature, most …

Efficient data compression methods for multidimensional sparse array operations based on the EKMR scheme

CY Lin, YC Chung, JS Liu - IEEE Transactions on Computers, 2003 - ieeexplore.ieee.org
We have proposed the extended Karnaugh map representation (EKMH) scheme for
multidimensional array representation. We propose two data compression schemes, EKMR …

A novel multi–graphics processing unit parallel optimization framework for the sparse matrix‐vector multiplication

J Gao, Y Wang, J Wang - Concurrency and Computation …, 2017 - Wiley Online Library
The sparse matrix‐vector multiplication (SpMV) is of great importance in scientific
computations. Graphics processing unit (GPU)‐accelerated SpMVs for large‐sized problems …

Compiler optimizations for vector instructions

MK Gschwind, JS Ji, RI McIntosh, SJ Munroe… - US Patent …, 2017 - Google Patents
An optimizing compiler includes a vector optimization mechanism that optimizes vector
instructions by eliminating one or more vector element reverse operations. The compiler can …

Layered vector architecture compatibility for cross-system portability

MK Gschwind, RI McIntosh - US Patent 9,996,326, 2018 - Google Patents
An application that includes intrinsics defined in one architecture is to execute without
change on a different architecture. Program code that depends on vector element ordering is …

Compiler optimizations for vector instructions

MK Gschwind, JS Ji, RI McIntosh, SJ Munroe… - US Patent …, 2017 - Google Patents
An optimizing compiler includes a vector optimization mechanism that optimizes vector
instructions by eliminating one or more vector element reverse operations. The compiler can …

Efficient data parallel algorithms for multidimensional array operations based on the EKMR scheme for distributed memory multicomputers

CY Lin, YC Chung, JS Liu - IEEE Transactions on parallel and …, 2003 - ieeexplore.ieee.org
Array operations are useful in a large number of important scientific codes, such as
molecular dynamics, finite element methods, climate modeling, atmosphere and ocean …

SpComp: A Sparsity Structure-Specific Compilation of Matrix Operations

B Basak, UP Khedker, S Biswas - arXiv preprint arXiv:2307.06109, 2023 - arxiv.org
Sparse matrix operations involve a large number of zero operands which makes most of the
operations redundant. The amount of redundancy magnifies when a matrix operation …

Compiler optimizations for vector operations that are reformatting-resistant

MK Gschwind, WJ Schmidt - US Patent 9,880,821, 2018 - Google Patents
An optimizing compiler includes a vector optimization mechanism that optimizes vector
operations that are reformatting-resistant, such as source instructions that do not have a …