BestSF: a sparse meta-format for optimizing SpMV on GPU

A Benatia, W Ji, Y Wang, F Shi - ACM Transactions on Architecture and …, 2018 - dl.acm.org
The Sparse Matrix-Vector Multiplication (SpMV) kernel dominates the computing cost in
numerous scientific applications. Many implementations based on different sparse formats …

ParILUT-a parallel threshold ILU for GPUs

H Anzt, T Ribizel, G Flegar, E Chow… - 2019 IEEE …, 2019 - ieeexplore.ieee.org
In this paper, we present the first algorithm for computing threshold ILU factorizations on
GPU architectures. The proposed ParILUT-GPU algorithm is based on interleaving parallel …

MAGMA: Enabling exascale performance with accelerated BLAS and LAPACK for diverse GPU architectures

A Abdelfattah, N Beams, R Carson… - … Journal of High …, 2024 - journals.sagepub.com
MAGMA (Matrix Algebra for GPU and Multicore Architectures) is a pivotal open-source
library in the landscape of GPU-enabled dense and sparse linear algebra computations …

A hybrid CPU/GPU approach for the parallel algebraic recursive multilevel solver pARMS

A Jamal, M Baboulin, A Khabou… - … on Symbolic and …, 2016 - ieeexplore.ieee.org
We illustrate how the distributed parallel Algebraic Recursive Multilevel Solver based on
MPI can be adapted for heterogeneous CPU/GPU architectures. The tasks performed on the …

Xvpfloat: RISC-V ISA Extension for Variable Extended Precision Floating Point Computation

E Guthmuller, C Fuguet, A Bocco… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
A key concern in the field of scientific computation is the convergence of numerical solvers
when applied to large problems. The numerical workarounds used to improve convergence …

[HTML][HTML] 熔盐堆堆芯流体力学计算的GPU 并行方法研究

胡传伟, 鄂彦志, 邹杨, 徐洪杰 - 核技术, 2017 - xml-data.org
使用计算流体力学(Computational Fluid Dynamics, CFD) 数值方法对熔盐堆堆芯的流动和
热传导等相关物理问题进行模拟求解, 需要大量的计算时间. 利用图形处理器(Graphics …

A parallel iterative solver for large sparse linear systems enhanced with randomization and GPU accelerator, and its resilience to soft errors

A Jamal - 2017 - theses.hal.science
In this PhD thesis, we address three challenges faced by linear algebra solvers in the
perspective of future exascale systems: accelerating convergence using innovative …

[PDF][PDF] GPU Acceleration at Scale with OpenPower platforms in Code Saturne

S Antao, C Moulinec, Y Fournier, R Sawko, M Zimon… - 2018 - sc18.supercomputing.org
Samuel Antao1, Charles Moulinec2, Yvan Fournier3, Robert Sawko1, Malgorzata Zimon1,
Christopher Thompson1, Alex Skillen2, Juan U Page 1 GPU Acceleration at Scale with …

[PDF][PDF] Implementing Finite Differ ence Schemes on Graphic Processing Units

P Lippmann - 2022 - repository.tudelft.nl
The continued development of improved algorithms and architecture for numerical
simulations is at the core of increased computational performance and, therefore, the ability …

Extreme-scale Algorithms and Solver Resilience

J Dongarra - 2016 - osti.gov
A widening gap exists between the peak performance of high-performance computers and
the performance achieved by complex applications running on these platforms. Over the …