Unified Communication Optimization Strategies for Sparse Triangular Solver on CPU and GPU Clusters

Y Liu, N Ding, P Sao, S Williams, XS Li - Proceedings of the International …, 2023 - dl.acm.org
This paper presents a unified communication optimization framework for sparse triangular
solve (SpTRSV) algorithms on CPU and GPU clusters. The framework builds upon a 3D …

Newly released capabilities in the distributed-memory SuperLU sparse direct solver

XS Li, P Lin, Y Liu, P Sao - ACM Transactions on Mathematical Software, 2023 - dl.acm.org
We present the new features available in the recent release of SuperLU_DIST, Version 8.1.
1. SuperLU_DIST is a distributed-memory parallel sparse direct solver. The new features …

Evaluating the potential of disaggregated memory systems for HPC applications

N Ding, P Maris, HA Nam, T Groves… - Concurrency and …, 2024 - Wiley Online Library
Disaggregated memory is a promising approach that addresses the limitations of traditional
memory architectures by enabling memory to be decoupled from compute nodes and …

Efficient block algorithms for parallel sparse triangular solve

Z Lu, Y Niu, W Liu - Proceedings of the 49th International Conference on …, 2020 - dl.acm.org
The sparse triangular solve (SpTRSV) kernel is an important building block for a number of
linear algebra routines such as sparse direct and iterative solvers. The major challenge of …

Parallel optimization and application of unstructured sparse triangular solver on new generation of sunway architecture

J Li, L Li, Q Wang, W Xue, J Liang, J Shi - Parallel Computing, 2024 - Elsevier
Large-scale sparse linear equation solver plays an important role in both numerical
simulation and artificial intelligence, and sparse triangular equation solver is a key step in …

On the effectiveness of random walks for modeling epidemics on networks

S Kim, J Breen, E Dudkina, F Poloni, E Crisostomi - Plos one, 2023 - journals.plos.org
Random walks on graphs are often used to analyse and predict epidemic spreads and to
investigate possible control actions to mitigate them. In this study, we first show that models …

A message-driven, multi-GPU parallel sparse triangular solver

N Ding, Y Liu, S Williams, XS Li - SIAM Conference on Applied and …, 2021 - SIAM
Sparse triangular solve is used in conjunction with Sparse LU for solving sparse linear
systems, either as a direct solver or as a preconditioner. As GPUs have become a first-class …

UNR: Unified Notifiable RMA Library for HPC

G Feng, J Xie, D Dong, Y Lu - SC24: International Conference …, 2024 - ieeexplore.ieee.org
Remote Memory Access (RMA) enables direct access to remote memory to achieve high
performance for HPC applications. However, most modern parallel programming models …

On the use of Markov chains for epidemic modeling on networks

S Kim, J Breen, E Dudkina, F Poloni… - arXiv preprint arXiv …, 2022 - arxiv.org
We discuss various models for epidemics on networks that rely on Markov chains. Random
walks on graphs are often used to predict epidemic spread and to investigate possible …

Methodology for Evaluating the Potential of Disaggregated Memory Systems

N Ding, S Williams, HA Nam, T Groves… - 2022 IEEE/ACM …, 2022 - ieeexplore.ieee.org
Tightly-coupled HPC systems have rigid memory allocation and can result in expensive
memory resource underutilization. As novel memory and network technologies mature …