A survey of methods for analyzing and improving GPU energy efficiency

S Mittal, JS Vetter - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
Recent years have witnessed phenomenal growth in the computational capabilities and
applications of GPUs. However, this trend has also led to a dramatic increase in their power …

Exploring weak scalability for FEM calculations on a GPU-enhanced cluster

D Göddeke, R Strzodka, J Mohd-Yusof, P McCormick… - Parallel Computing, 2007 - Elsevier
The first part of this paper surveys co-processor approaches for commodity based clusters in
general, not only with respect to raw performance, but also in view of their system integration …

Exposing fine-grained parallelism in algebraic multigrid methods

N Bell, S Dalton, LN Olson - SIAM Journal on Scientific Computing, 2012 - SIAM
Algebraic multigrid methods for large, sparse linear systems are a necessity in many
computational simulations, yet parallel algorithms for such solvers are generally …

Assembly of finite element methods on graphics processors

C Cecka, AJ Lew, E Darve - International journal for numerical …, 2011 - Wiley Online Library
Recently, graphics processing units (GPUs) have had great success in accelerating many
numerical computations. We present their application to computations on unstructured …

[PDF][PDF] A Parallel Multigrid Poisson Solver for Fluids Simulation on Large Grids.

A McAdams, E Sifakis, J Teran - Symposium on Computer …, 2010 - graphics.cs.wisc.edu
We present a highly efficient numerical solver for the Poisson equation on irregular
voxelized domains supporting an arbitrary mix of Neumann and Dirichlet boundary …

CFD-based analysis and two-level aerodynamic optimization on graphics processing units

IC Kampolis, XS Trompoukis, VG Asouti… - Computer Methods in …, 2010 - Elsevier
This paper presents the porting of 2D and 3D Navier–Stokes equations solvers for
unstructured grids, from the CPU to the graphics processing unit (GPU; NVIDIA's Ge-Force …

[图书][B] Introduction to high performance scientific computing

V Eijkhout - 2010 - books.google.com
Page 1 - ºf Eijkhou Page 2 Introduction to High Performance Scientific Computing Victor Eijkhout
with Edmond Chow, Robert van de Geijn 2nd edition, revision 2015 Page 3 Introduction to …

GPU-accelerated sparse matrix-matrix multiplication by iterative row merging

F Gremse, A Hofter, LO Schwen, F Kiessling… - SIAM Journal on …, 2015 - SIAM
We present an algorithm for general sparse matrix-matrix multiplication (SpGEMM) on many-
core architectures, such as GPUs. SpGEMM is implemented by iterative row merging, similar …

Методы ускорения газодинамических расчетов на неструктурированных сетках

КН Волков, ЮН Дерюгин, ВН Емельянов… - 2014 - elibrary.ru
Развиваются методы ускорения сходимости итерационного процесса, основанные на
использовании геометрических и алгебраических многосеточных технологий …

Unsteady CFD computations using vertex‐centered finite volumes for unstructured grids on graphics processing units

VG Asouti, XS Trompoukis, IC Kampolis… - … Methods in Fluids, 2011 - Wiley Online Library
This paper presents a Navier–Stokes solver for steady and unsteady turbulent flows on
unstructured/hybrid grids, with triangular and quadrilateral elements, which was …