作者
Simon Scheidegger, Dmitry Mikushin, Felix Kubler, Olaf Schenk
发表日期
2018/5/21
研讨会论文
2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS)
页码范围
610-619
出版商
IEEE
简介
We propose a massively parallelized and optimized framework to solve high-dimensional dynamic stochastic economic models on modern GPU-and KNL-based clusters. First, we introduce a novel approach for adaptive sparse grid index compression alongside a surplus matrix reordering, which significantly reduces the global memory throughput of the compute kernels and maps randomly accessed data onto cache or fast shared memory. Second, we fully vectorize the compute kernels for AVX, AVX2 and AVX512 CPUs, respectively. Third, we develop a hybrid cluster oriented work-preempting scheduler based on TBB, which evenly distributes the time iteration workload onto available CPU cores and accelerators. Numerical experiments on Cray XC40 KNL "Grand Tave" and on Cray XC50 "Piz Daint" systems at the Swiss National Supercomputer Centre (CSCS) show that our framework scales nicely to at least 4 …
引用总数
2019202020212022202324511
学术搜索中的文章
S Scheidegger, D Mikushin, F Kubler, O Schenk - 2018 IEEE International Parallel and Distributed …, 2018