In this paper, we present the Habanero-Java (HJ) language developed at Rice University as an extension to the original Java-based definition of the X10 language. HJ includes a …
V Sarkar, W Harrod, AE Snavely - Journal of Physics: Conference …, 2009 - iopscience.iop.org
Computer systems anticipated in the 2015–2020 timeframe are referred to as Extreme Scale because they will be built using massive multi-core processors with 100's of cores per chip …
Extreme Scale processors containing hundreds or even thousands of cores will challenge current operating system (OS) practices. Many of the fundamental assumptions that underlie …
JA Stratton, V Grover, J Marathe, B Aarts… - Proceedings of the 8th …, 2010 - dl.acm.org
In this paper we describe techniques for compiling fine-grained SPMD-threaded programs, expressed in programming models such as OpenCL or CUDA, to multicore execution …
A Susungi, C Tadonki - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
While compilers generally support parallel programming languages and APIs, their internal program representations are mostly designed from the sequential programs standpoint …
R Han, J Lee, J Sim, H Kim - ACM Transactions on Architecture and …, 2022 - dl.acm.org
As CUDA becomes the de facto programming language among data parallel applications such as high-performance computing or machine learning applications, running CUDA on …
The growing complexity of multi-core architectures has motivated a wide range of software mechanisms to improve the orchestration of parallel executions. Task parallelism has …
Asynchronous task-based programming models are gaining popularity to address the programmability and performance challenges in high performance computing. One of the …
Lazy scheduling is a runtime scheduler for task-parallel codes that effectively coarsens parallelism on load conditions in order to significantly reduce its overheads compared to …