Stellar mergers with HPX-Kokkos and SYCL: methods of using an asynchronous many-task runtime system with SYCL

G Daiß, P Diehl, H Kaiser, D Pflüger - Proceedings of the 2023 …, 2023 - dl.acm.org
Ranging from NVIDIA GPUs to AMD GPUs and Intel GPUs: Given the heterogeneity of
available accelerator cards within current supercomputers, portability is a key aspect for …

From task-based gpu work aggregation to stellar mergers: Turning fine-grained cpu tasks into portable gpu kernels

G Daiß, P Diehl, D Marcello… - 2022 IEEE/ACM …, 2022 - ieeexplore.ieee.org
Meeting both scalability and performance portability requirements is a challenge for any
HPC application, especially for adaptively refined ones. In Octo-Tiger, an astrophysics …

Simulating stellar merger using HPX/Kokkos on A64FX on Supercomputer Fugaku

P Diehl, G Daiß, K Huck, D Marcello, S Shiber… - The Journal of …, 2024 - Springer
The increasing availability of machines relying on non-GPU architectures, such as ARM
A64FX in high-performance computing, provides a set of interesting challenges to …

Asynchronous-Many-Task Systems: Challenges and Opportunities--Scaling an AMR Astrophysics Code on Exascale machines using Kokkos and HPX

G Daiß, P Diehl, J Yan, JK Holmen, R Gayatri… - arXiv preprint arXiv …, 2024 - arxiv.org
Dynamic and adaptive mesh refinement is pivotal in high-resolution, multi-physics, multi-
model simulations, necessitating precise physics resolution in localized areas across …

From merging frameworks to merging stars: experiences using HPX, Kokkos and SIMD Types

G Daiß, SY Singanaboina, P Diehl… - 2022 IEEE/ACM 7th …, 2022 - ieeexplore.ieee.org
Octo-Tiger, a large-scale 3D AMR code for the merger of stars, uses a combination of HPX,
Kokkos and explicit SIMD types, aiming to achieve performance-portability for a broad range …

Broad performance measurement support for asynchronous multi-tasking with apex

KA Huck - 2022 IEEE/ACM 7th International Workshop on …, 2022 - ieeexplore.ieee.org
APEX (Autonomic Performance Environment for eXascale) is a performance measurement
library for distributed, asynchronous multitasking runtime systems. It provides support for …

Evaluating HPX and Kokkos on RISC-V using an astrophysics application Octo-Tiger

P Diehl, G Daiss, S Brandt, A Kheirkhahan… - Proceedings of the SC' …, 2023 - dl.acm.org
In recent years, computers based on the RISC-V architecture have raised broad interest in
the high-performance computing (HPC) community. As the RISC-V community develops the …

Preparing for HPC on RISC-V: Examining Vectorization and Distributed Performance of an Astrophysics Application with HPX and Kokkos

P Diehl, P Syskakis, G Daiß, SR Brandt… - SC24-W: Workshops …, 2024 - ieeexplore.ieee.org
In recent years, interest in RISC-V computing architectures has moved from academic to
mainstream, especially in the field of High Performance Computing where energy limitations …

RAPIDS2 SciDAC Institute: Rutgers Final Report

M Parashar - 2021 - osti.gov
The Rutgers team transitioned to the University of Utah in 2021 and is continuing to be part
of and to contribute to the RAPIDS2 SciDAC Institute. This final report is for the contributions …