A systematic survey of general sparse matrix-matrix multiplication

J Gao, W Ji, F Chang, S Han, B Wei, Z Liu… - ACM Computing …, 2023 - dl.acm.org
General Sparse Matrix-Matrix Multiplication (SpGEMM) has attracted much attention from
researchers in graph analyzing, scientific computing, and deep learning. Many optimization …

Productivity, performance, and portability for computational fluid dynamics applications

IZ Reguly, GR Mudalige - Computers & Fluids, 2020 - Elsevier
Hardware trends over the last decade show increasing complexity and heterogeneity in high
performance computing architectures, which presents developers of CFD applications with …

GraphBLAST: A high-performance linear algebra-based graph framework on the GPU

C Yang, A Buluç, JD Owens - ACM Transactions on Mathematical …, 2022 - dl.acm.org
High-performance implementations of graph algorithms are challenging to implement on
new parallel hardware such as GPUs because of three challenges:(1) the difficulty of coming …

Development of a novel simulator for modelling underground hydrogen and gas mixture storage

Z Cai, K Zhang, C Guo - International Journal of Hydrogen Energy, 2022 - Elsevier
Underground hydrogen storage can store grid-scale energy for balancing both short-term
and long-term inter-seasonal supply and demand. However, there is no numerical simulator …

[PDF][PDF] PyAMG: Algebraic multigrid solvers in python

N Bell, LN Olson, J Schroder - Journal of Open Source Software, 2022 - joss.theoj.org
The overarching goals of PyAMG include both readability and performance. This includes
readable implementations of popular variations of AMG (see the Methods section), the ability …

A GPU-based multilevel additive schwarz preconditioner for cloth and deformable body simulation

B Wu, Z Wang, H Wang - ACM Transactions on Graphics (TOG), 2022 - dl.acm.org
In this paper, we wish to push the limit of real-time cloth and deformable body simulation to a
higher level with 50K to 500K vertices, based on the development of a novel GPU-based …

Part-scale thermal process modeling for laser powder bed fusion with matrix-free method and GPU computing

F Dugast, P Apostolou, A Fernandez, W Dong… - Additive …, 2021 - Elsevier
This paper presents an efficient GPU-based part-scale thermal process simulator for laser
powder bed fusion (L-PBF) additive manufacturing (AM). To take full advantage of modern …

High-performance parallel graph coloring with strong guarantees on work, depth, and quality

M Besta, A Carigiet, K Janda… - … Conference for High …, 2020 - ieeexplore.ieee.org
We develop the first parallel graph coloring heuristics with strong theoretical guarantees on
work and depth and coloring quality. The key idea is to design a relaxation of the vertex …

Chemomechanical simulation of soap film flow on spherical bubbles

W Huang, J Iseringhausen, T Kneiphof, Z Qu… - ACM Transactions on …, 2020 - dl.acm.org
Soap bubbles are widely appreciated for their fragile nature and their colorful appearance.
The natural sciences and, in extension, computer graphics, have comprehensively studied …

Eulerian incompressible smoothed particle hydrodynamics on multiple GPUs

J O'connor, JM Domínguez, BD Rogers, SJ Lind… - Computer Physics …, 2022 - Elsevier
Recent advances in the development of Eulerian incompressible smoothed particle
hydrodynamics (EISPH), such as high-order convergence and natural coupling with …