An efficient CUDA implementation of the tree-based barnes hut n-body algorithm

M Burtscher, K Pingali - GPU computing Gems Emerald edition, 2011 - Elsevier
Publisher Summary This chapter describes the first CUDA implementation of the classical
Barnes Hut n-body algorithm that runs entirely on the GPU. The Barnes Hut force-calculation …

A sparse octree gravitational N-body code that runs entirely on the GPU processor

J Bédorf, E Gaburov, SP Zwart - Journal of Computational Physics, 2012 - Elsevier
We present the implementation and performance of a new gravitational N-body tree-code
that is specifically designed for the graphics processing unit (GPU). 1 All parts of the tree …

Implementation and performance of FDPS: a framework for developing parallel particle simulation codes

M Iwasawa, A Tanikawa, N Hosono… - Publications of the …, 2016 - academic.oup.com
We present the basic idea, implementation, measured performance, and performance model
of FDPS (Framework for Developing Particle Simulators). FDPS is an application …

42 TFlops hierarchical N-body simulations on GPUs with applications in both astrophysics and turbulence

T Hamada, T Narumi, R Yokota, K Yasuoka… - Proceedings of the …, 2009 - dl.acm.org
As an entry for the 2009 Gordon Bell price/performance prize, we present the results of two
different hierarchical N-body simulations on a cluster of 256 graphics processing units …

Petascale turbulence simulation using a highly parallel fast multipole method on GPUs

R Yokota, LA Barba, T Narumi, K Yasuoka - Computer Physics …, 2013 - Elsevier
This paper reports large-scale direct numerical simulations of homogeneous-isotropic fluid
turbulence, achieving sustained performance of 1.08 petaflop/s on gpu hardware using …

190 tflops astrophysical n-body simulation on a cluster of gpus

T Hamada, K Nitadori - SC'10: Proceedings of the 2010 ACM …, 2010 - ieeexplore.ieee.org
We present the results of a hierarchical N-body simulation on DEGIMA, a cluster of PCs with
576 graphic processing units (GPUs) and using an InfiniBand interconnect. DEGIMA stands …

Optimized parallelization of boundary integral Poisson-Boltzmann solvers

X Yang, E Sliheet, R Iriye, D Reynolds… - Computer Physics …, 2024 - Elsevier
Abstract The Poisson-Boltzmann (PB) model governs the electrostatics of solvated
biomolecules, ie, potential, field, energy, and force. These quantities can provide useful …

The Size Scale of Star Clusters

JP Madrid, JR Hurley, AC Sippel - The Astrophysical Journal, 2012 - iopscience.iop.org
Direct N-body simulations of star clusters in a realistic Milky-Way-like potential are carried
out using the code NBODY6. Based on these simulations, a new relationship between scale …

A pilgrimage to gravity on GPUs

J Bedorf, S Portegies Zwart - The European Physical Journal Special …, 2012 - Springer
In this short review we present the developments over the last 5 decades that have led to the
use of Graphics Processing Units (GPUs) for astrophysical simulations. Since the …

Simulate the aerodynamic olfactory effects of gas-sensitive UAVs: A numerical model and its parallel implementation

B Luo, QH Meng, JY Wang, SG Ma - Advances in Engineering Software, 2016 - Elsevier
The attempts of robot implementations and simulation framework designs for the solution to
three dimensional (3D) robot active olfaction problems are reviewed. A numerical model is …