OpenMP to GPGPU: a compiler framework for automatic translation and optimization

S Lee, SJ Min, R Eigenmann - ACM Sigplan Notices, 2009 - dl.acm.org
GPGPUs have recently emerged as powerful vehicles for general-purpose high-
performance computing. Although a new Compute Unified Device Architecture (CUDA) …

MIMD Programs Execution Support on SIMD Machines: A Holistic Survey

D Mustafa, R Alkhasawneh, F Obeidat… - IEEE Access, 2024 - ieeexplore.ieee.org
The Single Instruction Multiple Data (SIMD) architecture, supported by various high-
performance computing platforms, efficiently utilizes data-level parallelism. The SIMD model …

Integrated design of AES (Advanced Encryption Standard) encrypter and decrypter

CC Lu, SY Tseng - Proceedings IEEE International Conference …, 2002 - ieeexplore.ieee.org
This paper proposed a method of integrating the AES encrypter and the AES decrypter into a
full functional AES crypto-engine. This method can make it a very low-complexity …

Compiler and runtime support for enabling generalized reduction computations on heterogeneous parallel configurations

VT Ravi, W Ma, D Chiu, G Agrawal - Proceedings of the 24th ACM …, 2010 - dl.acm.org
A trend that has materialized, and has given rise to much attention, is of the increasingly
heterogeneous computing platforms. Presently, it has become very common for a desktop or …

GRace: a low-overhead mechanism for detecting data races in GPU programs

M Zheng, VT Ravi, F Qin, G Agrawal - ACM SIGPLAN Notices, 2011 - dl.acm.org
In recent years, GPUs have emerged as an extremely cost-effective means for achieving
high performance. Many application developers, including those with no prior parallel …

A virtual memory based runtime to support multi-tenancy in clusters with GPUs

M Becchi, K Sajjapongse, I Graves, A Procter… - Proceedings of the 21st …, 2012 - dl.acm.org
Graphics Processing Units (GPUs) are increasingly becoming part of HPC clusters.
Nevertheless, cloud computing services and resource management frameworks targeting …

Recent trends in software and hardware for GPGPU computing: a comprehensive survey

B Neelima, PS Raghavendra - 2010 5th International …, 2010 - ieeexplore.ieee.org
With the growth of Graphics Processor (GPU) programmability and processing power,
graphics hardware has become a compelling platform for computationally demanding tasks …

High-throughput analysis of large microscopy image datasets on CPU-GPU cluster platforms

G Teodoro, T Pan, TM Kurc, J Kong… - 2013 IEEE 27th …, 2013 - ieeexplore.ieee.org
Analysis of large pathology image datasets offers significant opportunities for the
investigation of disease morphology, but the resource requirements of analysis pipelines …

Gmrace: Detecting data races in gpu programs via a low-overhead scheme

M Zheng, VT Ravi, F Qin… - IEEE Transactions on …, 2013 - ieeexplore.ieee.org
In recent years, GPUs have emerged as an extremely cost-effective means for achieving
high performance. While languages like CUDA and OpenCL have eased GPU programming …

Using shared memory to accelerate mapreduce on graphics processing units

F Ji, X Ma - 2011 IEEE International Parallel & Distributed …, 2011 - ieeexplore.ieee.org
Modern General Purpose Graphics Processing Units (GPGPUs) provide high degrees of
parallelism in computation and memory access, making them suitable for data parallel …