Many-core compiler fuzzing

C Lidbury, A Lascu, N Chong, AF Donaldson - ACM SIGPLAN Notices, 2015 - dl.acm.org
We address the compiler correctness problem for many-core systems through novel
applications of fuzz testing to OpenCL compilers. Focusing on two methods from prior work …

A compiler for throughput optimization of graph algorithms on GPUs

S Pai, K Pingali - Proceedings of the 2016 ACM SIGPLAN International …, 2016 - dl.acm.org
Writing high-performance GPU implementations of graph algorithms can be challenging. In
this paper, we argue that three optimizations called throughput optimizations are key to high …

Sound and partially-complete static analysis of data-races in gpu programs

D Liew, T Cogumbreiro, J Lange - Proceedings of the ACM on …, 2024 - dl.acm.org
GPUs are progressively being integrated into modern society, playing a pivotal role in
Artificial Intelligence and High-Performance Computing. Programmers need a deep …

The design and implementation of a verification technique for GPU kernels

A Betts, N Chong, AF Donaldson, J Ketema… - ACM Transactions on …, 2015 - dl.acm.org
We present a technique for the formal verification of GPU kernels, addressing two classes of
correctness properties: data races and barrier divergence. Our approach is founded on a …

CURD: a dynamic CUDA race detector

Y Peng, V Grover, J Devietti - ACM SIGPLAN Notices, 2018 - dl.acm.org
As GPUs have become an integral part of nearly every pro-cessor, GPU programming has
become increasingly popular. GPU programming requires a combination of extreme levels …

Fast and precise symbolic analysis of concurrency bugs in device drivers (t)

P Deligiannis, AF Donaldson… - 2015 30th IEEE/ACM …, 2015 - ieeexplore.ieee.org
Concurrency errors, such as data races, make device drivers notoriously hard to develop
and debug without automated tool support. We present Whoop, a new automated approach …

Barracuda: Binary-level analysis of runtime races in cuda programs

A Eizenberg, Y Peng, T Pigli, W Mansky… - Proceedings of the 38th …, 2017 - dl.acm.org
GPU programming models enable and encourage massively parallel programming with
over a million threads, requiring extreme parallelism to achieve good performance. Massive …

Memory access protocols: certified data-race freedom for GPU kernels

T Cogumbreiro, J Lange, D Liew, H Zicarelli - Formal Methods in System …, 2024 - Springer
GPUs offer parallelism as a commodity, but they are difficult to program correctly. Static
analyzers that guarantee data-race freedom (DRF) are essential to help programmers …

IGUARD: In-GPU advanced race detection

AK Kamath, A Basu - Proceedings of the ACM SIGOPS 28th Symposium …, 2021 - dl.acm.org
Newer use cases of GPU (Graphics Processing Unit) computing, eg, graph analytics, look
less like traditional bulk-synchronous GPU programs. To cater to the needs of emerging …

Verification of producer-consumer synchronization in GPU programs

R Sharma, M Bauer, A Aiken - ACM SIGPLAN Notices, 2015 - dl.acm.org
Previous efforts to formally verify code written for GPUs have focused solely on kernels
written within the traditional data-parallel GPU programming model. No previous work has …