[HTML][HTML] Scalability of an Eulerian-Lagrangian large-eddy simulation solver with hybrid MPI/OpenMP parallelisation

P Ouro, B Fraga, U Lopez-Novoa, T Stoesser - Computers & fluids, 2019 - Elsevier
Eulerian-Lagrangian approaches capable of accurately reproducing complex fluid flows are
becoming more and more popular due to the increasing availability and capacity of High …

OpenMP and MPI implementations of an elasto-viscoplastic fast Fourier transform-based micromechanical solver for fast crystal plasticity modeling

A Eghtesad, TJ Barrett, K Germaschewski… - … in Engineering Software, 2018 - Elsevier
We explore several parallel implementations of an elasto-viscoplastic fast Fourier transform
(EVPFFT) model using Message Passing Interface (MPI), OpenMP, and a hybrid of MPI and …

Empirical investigation: performance and power‐consumption based dual‐level model for exascale computing systems

MU Ashraf, FA Eassa, A Ahmad, A Algarni - IET Software, 2020 - Wiley Online Library
Exascale computing systems (ECS) are anticipated to perform at Exaflop speed (1018
operations per second) using power consumption< 20 MW. This ultrascale performance …

OpenMP and CUDA simulations of Sella Zerbino Dam break on unstructured grids

G Petaccia, F Leporati, E Torti - Computational Geosciences, 2016 - Springer
This paper presents two 2D dam break parallelized models based on shallow water
equations (SWE) written in conservative form. The models were implemented exploiting …

GPU optimization for high-quality kinetic fluid simulation

Y Chen, W Li, R Fan, X Liu - IEEE Transactions on Visualization …, 2021 - ieeexplore.ieee.org
Fluid simulations are often performed using the incompressible Navier-Stokes equations
(INSE), leading to sparse linear systems which are difficult to solve efficiently in parallel …

Spatiotemporal parallelization of an analytical heat conduction model for additive manufacturing via a hybrid OpenMP+ MPI approach

B Stump, A Plotkowski - Computational Materials Science, 2020 - Elsevier
The ability to do thermal simulations for entire additive manufacturing builds is a key
computational problem facing the additive manufacturing community; however, complex …

Flexible, scalable mesh and data management using PETSc DMPlex

M Lange, MG Knepley, GJ Gorman - arXiv preprint arXiv:1505.04633, 2015 - arxiv.org
Designing a scientific software stack to meet the needs of the next-generation of mesh-
based simulation demands, not only scalable and efficient mesh and data management on a …

GPU‐based polynomial finite element matrix assembly for simplex meshes

JS Mueller‐Roemer, A Stork - Computer Graphics Forum, 2018 - Wiley Online Library
In this paper, we present a matrix assembly technique for arbitrary polynomial order finite
element simulations on simplex meshes for graphics processing units (GPU). Compared to …

A gpu‐adapted structure for unstructured grids

R Zayer, M Steinberger, HP Seidel - Computer Graphics Forum, 2017 - Wiley Online Library
A key advantage of working with structured grids (eg, images) is the ability to directly tap into
the powerful machinery of linear algebra. This is not much so for unstructured grids where …

Reverse-mode algorithmic differentiation of an OpenMP-parallel compressible flow solver

J Hückelheim, P Hovland, MM Strout… - … Journal of High …, 2019 - journals.sagepub.com
Reverse-mode algorithmic differentiation (AD) is an established method for obtaining adjoint
derivatives of computer simulation applications. In computational fluid dynamics (CFD) …