Pencil: A platform-neutral compute intermediate language for accelerator programming R Baghdadi, U Beaugnon, A Cohen, T Grosser, M Kruse, C Reddy, ... 2015 International Conference on Parallel Architecture and Compilation (PACT …, 2015 | 162 | 2015 |
High-performance generalized tensor operations: A compiler-oriented approach R Gareev, T Grosser, M Kruse ACM Transactions on Architecture and Code Optimization (TACO) 15 (3), 1-27, 2018 | 42 | 2018 |
Autotuning polybench benchmarks with llvm clang/polly loop optimization pragmas using bayesian optimization X Wu, M Kruse, P Balaprakash, H Finkel, P Hovland, V Taylor, M Hall Concurrency and Computation: Practice and Experience 34 (20), e6683, 2022 | 33 | 2022 |
Reduction drawing: Language constructs and polyhedral compilation for reductions on gpu C Reddy, M Kruse, A Cohen Proceedings of the 2016 International Conference on Parallel Architectures …, 2016 | 32 | 2016 |
Lattice QCD estimate of the decay rate D Becirevic, M Kruse, F Sanfilippo arXiv preprint arXiv:1411.6426, 2014 | 23 | 2014 |
Qiral: A high level language for lattice qcd code generation D Barthou, G Grosdidier, M Kruse, O Pene, C Tadonki arXiv preprint arXiv:1208.4035, 2012 | 18 | 2012 |
A polyhedral compilation framework for loops with dynamic data-dependent bounds J Zhao, M Kruse, A Cohen Proceedings of the 27th International Conference on Compiler Construction, 14-24, 2018 | 17 | 2018 |
DeLICM: scalar dependence removal at zero memory cost M Kruse, T Grosser Proceedings of the 2018 International Symposium on Code Generation and …, 2018 | 15 | 2018 |
Outcomes of openMP hackathon: openMP application experiences with the offloading model (part II) B Chapman, B Pham, C Yang, C Daley, C Bertoni, D Kulkarni, ... OpenMP: Enabling Massive Node-Level Parallelism: 17th International Workshop …, 2021 | 11 | 2021 |
Autotuning search space for loop transformations M Kruse, H Finkel, X Wu 2020 IEEE/ACM 6th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM …, 2020 | 11 | 2020 |
User-directed loop-transformations in Clang M Kruse, H Finkel 2018 IEEE/ACM 5th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM …, 2018 | 9 | 2018 |
A proposal for loop-transformation pragmas M Kruse, H Finkel Evolving OpenMP for Evolving Architectures: 14th International Workshop on …, 2018 | 9 | 2018 |
ytopt: Autotuning scientific applications for energy efficiency at large scales X Wu, P Balaprakash, M Kruse, J Koo, B Videau, P Hovland, V Taylor, ... arXiv preprint arXiv:2303.16245, 2023 | 8 | 2023 |
Introducing Molly: distributed memory parallelization with LLVM M Kruse arXiv preprint arXiv:1409.2088, 2014 | 8 | 2014 |
Customized Monte Carlo tree search for LLVM/Polly's composable loop optimization transformations J Koo, P Balaprakash, M Kruse, X Wu, P Hovland, M Hall 2021 International Workshop on Performance Modeling, Benchmarking and …, 2021 | 7 | 2021 |
Lattice QCD estimate of the η c (2S)→ J/ψγ decay rate D Bečirević, M Kruse, F Sanfilippo Journal of High Energy Physics 2015 (5), 1-19, 2015 | 7 | 2015 |
Outcomes of OpenMP Hackathon: OpenMP Application Experiences with the Offloading Mode S Pophale, D Oryspayev, B Chapman, B Pham, C Yang, C Daley, ... Brookhaven National Lab.(BNL), Upton, NY (United States), 2021 | 6 | 2021 |
Loop Transformations using Clang’s abstract syntax tree M Kruse 50th International Conference on Parallel Processing Workshop, 1-7, 2021 | 5 | 2021 |
Design and use of loop-transformation pragmas M Kruse, H Finkel OpenMP: Conquering the Full Hardware Spectrum: 15th International Workshop …, 2019 | 5 | 2019 |
atJIT: A just-in-time autotuning compiler for C++ K Farvardin, H Finkel, M Kruse, J Reppy LLVM Developers Meeting Technical Talk, 2018 | 5 | 2018 |