F Broquedis, T Gautier, V Danjean - … on OpenMP, IWOMP 2012, Rome, Italy …, 2012 - Springer
To efficiently exploit high performance computing platforms, applications currently have to express more and more finer-grain parallelism. The OpenMP standard allows programmers …
Cluster-based architectures are increasingly being adopted to design embedded many- cores. These platforms can deliver very high peak performance within a contained power …
Multiprocessor systems-on-chip (MPSoC) are now considered first-class citizens both in the embedded systems and in the high-performance computing arenas, in the form of …
A Podobas, M Brorsson… - … and Computation: Practice …, 2015 - Wiley Online Library
Programmers today face a bewildering array of parallel programming models and tools, making it difficult to choose an appropriate one for each application. An increasingly popular …
In this work we present a highly efficient implementation of OpenMP tasks. It is based on a runtime infrastructure architected for data locality, a crucial prerequisite for exploiting the …
Y Zou, S Rajopadhye - IEEE Transactions on Parallel and …, 2017 - ieeexplore.ieee.org
Energy is now critical in all aspects of computing. We address a class of programs that includes so-called “stencil computations.” We address energy optimization of such …
In this work we propose a novel technique to reduce the overheads related to nested parallel loops in OpenMP programs. In particular we show that in many cases it is possible …
Programmers today face a bewildering array of parallel programming models and tools, making it difficult to choose an appropriate one for each application. The present study …
Modern designs for embedded systems are increasingly embracing cluster-based architectures, where small sets of cores communicate through tightly-coupled shared …