SC23 Proceedings

The International Conference for High Performance Computing, Networking, Storage, and Analysis

Workshops Archive

symPACK: A GPU-Capable Fan-Out Sparse Cholesky Solver


Workshop: The 6th Annual Parallel Applications Workshop, Alternatives to MPI+X (PAW-ATM)

Authors: Julian Bellavita (Lawrence Berkeley National Laboratory (LBNL)); Mathias Jacquelin (Cerebras Systems); and Esmond Ng, Dan Bonachea, Johnny Corbino, and Paul H. Hargrove (Lawrence Berkeley National Laboratory (LBNL))


Abstract: Sparse symmetric positive definite systems of equations are ubiquitous in scientific workloads and applications. Parallel sparse Cholesky factorization is the method of choice for solving such linear systems. Therefore, the development of parallel sparse Cholesky codes that can efficiently run on today’s large-scale heterogeneous distributed-memory platforms is of vital importance. Modern supercomputers offer nodes that contain a mix of CPUs and GPUs. To fully utilize the computing power of these nodes, scientific codes must be adapted to offload expensive computations to GPUs.

We present symPACK, a GPU-capable parallel sparse Cholesky solver that uses one-sided communication primitives and remote procedure calls provided by the UPC++ library. We also utilize the UPC++ "memory kinds" feature to enable efficient communication of GPU-resident data. We show that on a number of large problems, symPACK outperforms comparable state-of-the-art GPU-capable Cholesky factorization codes by up to 14x on the NERSC Perlmutter supercomputer.





Back to The 6th Annual Parallel Applications Workshop, Alternatives to MPI+X (PAW-ATM) Archive Listing



Back to Full Workshop Archive Listing