A parallel pattern for iterative stencil+ reduce

M Aldinucci, M Danelutto, M Drocco, P Kilpatrick… - The Journal of …, 2018 - Springer
We advocate the Loop-of-stencil-reduce pattern as a means of simplifying the
implementation of data-parallel programs on heterogeneous multi-core platforms. Loop-of …

High-level programming of stencil computations on multi-GPU systems using the SkelCL library

M Steuwer, M Haidl, S Breuer… - Parallel Processing …, 2014 - World Scientific
The implementation of stencil computations on modern, massively parallel systems with
GPUs and other accelerators currently relies on manually-tuned coding using low-level …

Harnessing parallelism in multi/many-cores with streams and parallel patterns

M Torquati - 2019 - tesidottorato.depositolegale.it
Multi-core computing systems are becoming increasingly parallel and heterogeneous.
Parallelism exploitation is today the primary instrument for improving application …

Design of an adaptive GPU sharing and scheduling scheme in container-based cluster

Q Chen, J Oh, S Kim, Y Kim - Cluster Computing, 2020 - Springer
Container based virtualization is an innovative technology that accelerates software
development by providing portability and maintainability of applications. Recently, a growing …

Auto-tuning Hybrid CPU-GPU Execution of Algorithmic Skeletons in SkePU

T Öhberg - 2018 - diva-portal.org
The trend in computer architectures has for several years been heterogeneous systems
consisting of a regular CPU and at least one additional, specialized processing unit, such as …

The loop-of-stencil-reduce paradigm

M Aldinucci, M Danelutto, M Drocco… - 2015 IEEE Trustcom …, 2015 - ieeexplore.ieee.org
In this paper we advocate the Loop-of-stencil-reduce pattern as a way to simplify the parallel
programming of heterogeneous platforms (multicore+ GPUs). Loop-of-Stencil-reduce is …

[PDF][PDF] Towards an efficient parallel skeleton for generic iterative stencil computations in distributed gpus

M de Castro… - Supercomputing …, 2021 - sc21.supercomputing.org
Iterative stencil computations, or Iterative Stencil Loops (ISLs), are a class of computations
that update the elements of an array using the values of the previous iteration for some …

[PDF][PDF] Area exam: General-purpose performance portable programming models for productive exascale computing

A Johnson - University of Oregon, Eugene, OR, USA. Area Exam …, 2020 - cs.uoregon.edu
Modern supercomputer architectures have grown increasingly complex and diverse since
the end of Moore's law in the mid-2000s, and are far more difficult to program than their …

EPSILOD: efficient parallel skeleton for generic iterative stencil computations in distributed GPUs

M de Castro, I Santamaria-Valenzuela, Y Torres… - The Journal of …, 2023 - Springer
Iterative stencil computations are widely used in numerical simulations. They present a high
degree of parallelism, high locality and mostly-coalesced memory access patterns …

Toward an adaptive fair GPU sharing scheme in container-based clusters

J Oh, S Kim, Y Kim - 2018 IEEE 3rd International Workshops on …, 2018 - ieeexplore.ieee.org
Virtualization is an innovative technology that accelerates software development by
providing portability and maintainability of applications. However, it often leads …