AmgX: A library for GPU accelerated algebraic multigrid and preconditioned iterative methods

M Naumov, M Arsaev, P Castonguay, J Cohen… - SIAM Journal on …, 2015 - SIAM
The solution of large sparse linear systems arises in many applications, such as
computational fluid dynamics and oil reservoir simulation. In realistic cases the matrices are …

Dark memory and accelerator-rich system optimization in the dark silicon era

A Pedram, S Richardson, M Horowitz… - IEEE Design & …, 2016 - ieeexplore.ieee.org
Unlike traditional dark silicon works that attack the computing logic, this article puts a focus
on the memory part, which dissipates most of the energy for memory-bound CPU …

[PDF][PDF] Vivace: a practical gauss-seidel method for stable soft body dynamics.

M Fratarcangeli, V Tibaldo, F Pellacini - ACM Trans. Graph., 2016 - academia.edu
The solution of large sparse systems of linear constraints is at the base of most interactive
solvers for physically-based animation of soft body dynamics. We focus on applications with …

High-performance parallel graph coloring with strong guarantees on work, depth, and quality

M Besta, A Carigiet, K Janda… - … Conference for High …, 2020 - ieeexplore.ieee.org
We develop the first parallel graph coloring heuristics with strong theoretical guarantees on
work and depth and coloring quality. The key idea is to design a relaxation of the vertex …

Parallel graph coloring for manycore architectures

M Deveci, EG Boman, KD Devine… - 2016 IEEE …, 2016 - ieeexplore.ieee.org
Graph algorithms are challenging to parallelize on manycore architectures due to complex
data dependencies and irregular memory access. We consider the well studied problem of …

GPU-based butterfly counting

Y Xia, F Zhang, Q Xu, M Zhang, Z Yao, L Lu, X Du… - The VLDB Journal, 2024 - Springer
When dealing with large bipartite graphs, butterfly counting is a crucial and time-consuming
operation. Graphics processing units (GPUs) are widely used parallel heterogeneous …

Preconditioned krylov solvers on GPUs

H Anzt, M Gates, J Dongarra, M Kreutzer, G Wellein… - Parallel Computing, 2017 - Elsevier
In this paper, we study the effect of enhancing GPU-accelerated Krylov solvers with
preconditioners. We consider the BiCGSTAB, CGS, QMR, and IDR (s) Krylov solvers. For a …

Efficient parallel implementations of sparse triangular solves for GPU architectures

R Li, C Zhang - Proceedings of the 2020 SIAM Conference on Parallel …, 2020 - SIAM
The sparse triangular matrix solve (SpTrSV) is an important computation kernel that is
demanded by a variety of numerical methods such as the Gauss-Seidel iterations. However …

Increasing the parallelism of graph coloring via shortcutting

G Alabandi, E Powers, M Burtscher - Proceedings of the 25th ACM …, 2020 - dl.acm.org
Graph coloring is an assignment of colors to the vertices of a graph such that no two
adjacent vertices get the same color. It is a key building block in many applications. Finding …

Hybrid parallel iterative sparse linear solver framework for reservoir geomechanical and flow simulation

L Gasparini, JRP Rodrigues, DA Augusto… - Journal of …, 2021 - Elsevier
We discuss new developments of a hybrid parallel iterative sparse linear solver framework
focused on petroleum reservoir flow and geomechanical simulation. It runs efficiently on …