[HTML][HTML] A neural network-inspired matrix formulation of chemical kinetics for acceleration on gpus

S Barwey, V Raman - Energies, 2021 - mdpi.com
Energies, 2021mdpi.com
High-fidelity simulations of turbulent flames are computationally expensive when using
detailed chemical kinetics. For practical fuels and flow configurations, chemical kinetics can
account for the vast majority of the computational time due to the highly non-linear nature of
multi-step chemistry mechanisms and the inherent stiffness of combustion chemistry. While
reducing this cost has been a key focus area in combustion modeling, the recent growth in
graphics processing units (GPUs) that offer very fast arithmetic processing, combined with …
High-fidelity simulations of turbulent flames are computationally expensive when using detailed chemical kinetics. For practical fuels and flow configurations, chemical kinetics can account for the vast majority of the computational time due to the highly non-linear nature of multi-step chemistry mechanisms and the inherent stiffness of combustion chemistry. While reducing this cost has been a key focus area in combustion modeling, the recent growth in graphics processing units (GPUs) that offer very fast arithmetic processing, combined with the development of highly optimized libraries for artificial neural networks used in machine learning, provides a unique pathway for acceleration. The goal of this paper is to recast Arrhenius kinetics as a neural network using matrix-based formulations. Unlike ANNs that rely on data, this formulation does not require training and exactly represents the chemistry mechanism. More specifically, connections between the exact matrix equations for kinetics and traditional artificial neural network layers are used to enable the usage of GPU-optimized linear algebra libraries without the need for modeling. Regarding GPU performance, speedup and saturation behaviors are assessed for several chemical mechanisms of varying complexity. The performance analysis is based on trends for absolute compute times and throughput for the various arithmetic operations encountered during the source term computation. The goals are ultimately to provide insights into how the source term calculations scale with the reaction mechanism complexity, which types of reactions benefit the GPU formulations most, and how to exploit the matrix-based formulations to provide optimal speedup for large mechanisms by using sparsity properties. Overall, the GPU performance for the species source term evaluations reveals many informative trends with regards to the effect of cell number on device saturation and speedup. Most importantly, it is shown that the matrix-based method enables highly efficient GPU performance across the board, achieving near-peak performance in saturated regimes.
MDPI
以上显示的是最相近的搜索结果。 查看全部搜索结果