Pangulu: A scalable regular two-dimensional block-cyclic sparse direct solver on distributed heterogeneous systems

X Fu, B Zhang, T Wang, W Li, Y Lu, E Yi… - Proceedings of the …, 2023 - dl.acm.org
Sparse direct solvers play a vital role in large-scale high performance computing in science
and engineering. Existing distributed sparse direct methods employ multifrontal/supernodal …

Performance-Driven Analog Layout Automation: Current Status and Future Directions

P Xu, J Li, TY Ho, B Yu, K Zhu - 2024 29th Asia and South …, 2024 - ieeexplore.ieee.org
Optimizing circuit performance presents a pivotal challenge in the realm of automatic analog
physical design. The intricacy of analog performance arises from its sensitivity to layout …

HASpGEMM: Heterogeneity-Aware Sparse General Matrix-Matrix Multiplication on Modern Asymmetric Multicore Processors

H Cheng, W Li, Y Lu, W Liu - … of the 52nd International Conference on …, 2023 - dl.acm.org
Sparse general matrix-matrix multiplication (SpGEMM) is an important kernel in
computational science and engineering, and has been widely studied on homogeneous …

Mille-feuille: A tile-grained mixed precision single-kernel conjugate gradient solver on gpus

D Yang, Y Zhao, Y Niu, W Jia, E Shao… - … Conference for High …, 2024 - ieeexplore.ieee.org
Conjugate gradient (CG) and biconjugate gradient stabilized (BiCGSTAB) are effective
methods used for solving sparse linear systems. We in this paper propose Mille-feuille, a …

Tilesptrsv: a tiled algorithm for parallel sparse triangular solve on gpus

Z Lu, W Liu - CCF Transactions on High Performance Computing, 2023 - Springer
Sparse triangular solve (SpTRSV) is one of the most important level-2 kernels in sparse
basic linear algebra subprograms (BLAS). Compared to another level-2 sparse BLAS kernel …

Machine Learning and GPU Accelerated Sparse Linear Solvers for Transistor-Level Circuit Simulation: A Perspective Survey

Z Jin, W Li, Y Bai, T Wang, Y Lu… - 2024 29th Asia and …, 2024 - ieeexplore.ieee.org
Sparse linear solvers play a crucial role in transistor-level circuit simulation, especially for
large-scale post-layout circuit simulation when considering complex parasitic effects. As …

[PDF][PDF] Csp: Comprehensively-sparsified preconditioner for efficient nonlinear circuit simulation

Y Zhao, X Yang, Y Bai, L Zeng, D Niu, W Liu, Z Jin - ICCAD, 2024 - ssslab.cn
Solving sparse linear systems dominates the simulation time for nonlinear integrated
circuits. Developing an effective preconditioner is crucial for accelerating the iterative solver …

Accelerating Large-Scale Sparse LU Factorization for RF Circuit Simulation

G Feng, H Wang, Z Guo, M Li, T Zhao, Z Jin… - … Conference on Parallel …, 2024 - Springer
Sparse LU factorization is the indispensable building block of the circuit simulation, and
dominates the simulation time, especially when dealing with large-scale circuits. Radio …

Towards faster and robust solution for dynamic LR and QR factorization

F Zhuang, H He, A Ye, L Zou - Scientific Reports, 2024 - nature.com
Dynamic LR and QR factorization are fundamental problems that exist widely in the control
field. However, the existing solutions under noises are lack of convergence speed and anti …

Efficient Hardware Accelerator Based on Medium Granularity Dataflow for SpTRSV

Q Chen, X Yang, S Lu - IEEE Transactions on Very Large Scale …, 2024 - ieeexplore.ieee.org
Sparse triangular solve (SpTRSV) is widely used in various domains. Numerous studies
have been conducted using CPUs, GPUs, and specific hardware accelerators, where …