Integrating profile-driven parallelism detection and machine-learning-based mapping

Z Wang, G Tournavitis, B Franke… - ACM Transactions on …, 2014 - dl.acm.org
Compiler-based auto-parallelization is a much-studied area but has yet to find widespread
application. This is largely due to the poor identification and exploitation of application …

T4: Compiling sequential code for effective speculative parallelization in hardware

VA Ying, MC Jeffrey, D Sanchez - 2020 ACM/IEEE 47th Annual …, 2020 - ieeexplore.ieee.org
Multicores are now ubiquitous, but programmers still write sequential code. Speculative
parallelization is an enticing approach to parallelize code while retaining the ease of …

Perspective: A sensible approach to speculative automatic parallelization

S Apostolakis, Z Xu, G Chan, S Campanoni… - Proceedings of the …, 2020 - dl.acm.org
The promise of automatic parallelization, freeing programmers from the error-prone and time-
consuming process of making efficient use of parallel processing resources, remains …

Aggressive pipelining of irregular applications on reconfigurable hardware

Z Li, L Liu, Y Deng, S Yin, Y Wang, S Wei - Proceedings of the 44th …, 2017 - dl.acm.org
CPU-FPGA heterogeneous platforms offer a promising solution for high-performance and
energy-efficient computing systems by providing specialized accelerators with post-silicon …

Exploiting commutativity to reduce the cost of updates to shared data in cache-coherent systems

G Zhang, W Horn, D Sanchez - … of the 48th International Symposium on …, 2015 - dl.acm.org
We present Coup, a technique to lower the cost of updates to shared data in cache-coherent
systems. Coup exploits the insight that many update operations, such as additions and …

Automatically exploiting cross-invocation parallelism using runtime information

J Huang, TB Jablin, SR Beard… - Proceedings of the …, 2013 - ieeexplore.ieee.org
Automatic parallelization is a promising approach to producing scalable multi-threaded
programs for multicore architectures. Many existing automatic techniques only parallelize …

Practically correct,{Just-in-Time} shell script parallelization

K Kallas, T Mustafa, J Bielak, D Karnikis… - … USENIX Symposium on …, 2022 - usenix.org
Recent shell-script parallelization systems enjoy mostly automated parallel speedups by
compiling scripts ahead-of-time. Unfortunately, such static parallelization is hampered by the …

SCAF: a speculation-aware collaborative dependence analysis framework

S Apostolakis, Z Xu, Z Tan, G Chan… - Proceedings of the 41st …, 2020 - dl.acm.org
Program analysis determines the potential dataflow and control flow relationships among
instructions so that compiler optimizations can respect these relationships to transform code …

PROMPT: A Fast and Extensible Memory Profiling Framework

Z Xu, Y Chon, Y Su, Z Tan, S Apostolakis… - Proceedings of the …, 2024 - dl.acm.org
Memory profiling captures programs' dynamic memory behavior, assisting programmers in
debugging, tuning, and enabling advanced compiler optimizations like speculation-based …

Discovery and exploitation of general reductions: A constraint based approach

P Ginsbach, MFP O'Boyle - 2017 IEEE/ACM International …, 2017 - ieeexplore.ieee.org
Discovering and exploiting scalar reductions in programs has been studied for many years.
The discovery of more complex reduction operations has, however, received less attention …