Amgt: Algebraic multigrid solver on tensor cores

Y Lu, L Zeng, T Wang, X Fu, W Li… - … Conference for High …, 2024 - ieeexplore.ieee.org
Algebraic multigrid (AMG) methods are particularly efficient to solve a wide range of sparse
linear systems, due to their good flexibility and adaptability. Even though modern parallel …

Mille-feuille: A tile-grained mixed precision single-kernel conjugate gradient solver on gpus

D Yang, Y Zhao, Y Niu, W Jia, E Shao… - … Conference for High …, 2024 - ieeexplore.ieee.org
Conjugate gradient (CG) and biconjugate gradient stabilized (BiCGSTAB) are effective
methods used for solving sparse linear systems. We in this paper propose Mille-feuille, a …

FP16 Acceleration in Structured Multigrid Preconditioner for Real-World Applications

Y Zong, P Yu, H Huang, W Xue - … of the 53rd International Conference on …, 2024 - dl.acm.org
Half-precision hardware support is now almost ubiquitous. In contrast to its active use in AI,
half-precision is less commonly employed in scientific and engineering computing. The …

Ginkgo-A math library designed to accelerate Exascale Computing Project science applications

T Cojean, P Nayak, T Ribizel… - … Journal of High …, 2024 - journals.sagepub.com
Large-scale simulations require efficient computation across the entire computing hierarchy.
A challenge of the Exascale Computing Project (ECP) was to reconcile highly …

Mixed-precision numerics in scientific applications: survey and perspectives

A Kashi, H Lu, W Brewer, D Rogers… - arXiv preprint arXiv …, 2024 - arxiv.org
The explosive demand for artificial intelligence (AI) workloads has led to a significant
increase in silicon area dedicated to lower-precision computations on recent high …

High-Performance, Scalable Geometric Multigrid via Fine-Grain Data Blocking for GPUs

O Antepara, S Williams, H Johansen… - SC24-W: Workshops of …, 2024 - ieeexplore.ieee.org
We present a performance study of geometric multigrid (GMG) on NVIDIA, AMD, and Intel
GPU-accelerated supercomputers. The approach employs fine-grain data blocking in …