Op2: An active library framework for solving unstructured mesh-based applications on multi-core and many-core architectures GR Mudalige, MB Giles, I Reguly, C Bertolli, PHJ Kelly 2012 Innovative Parallel Computing (InPar), 1-12, 2012 | 104 | 2012 |
Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark SJ Pennycook, SD Hammond, SA Jarvis, GR Mudalige ACM SIGMETRICS Performance Evaluation Review 38 (4), 23-29, 2011 | 99 | 2011 |
Performance analysis and optimization of the OP2 framework on many-core architectures MB Giles, GR Mudalige, Z Sharif, G Markall, PHJ Kelly The Computer Journal 55 (2), 168-180, 2012 | 95* | 2012 |
WARPP: a toolkit for simulating high-performance parallel scientific codes SD Hammond, GR Mudalige, JA Smith, SA Jarvis, JA Herdman, ... Proceedings of the 2nd International Conference on Simulation Tools and …, 2009 | 82 | 2009 |
The ops domain specific abstraction for multi-block structured grid computations IZ Reguly, GR Mudalige, MB Giles, D Curran, S McIntosh-Smith 2014 Fourth International Workshop on Domain-Specific Languages and High …, 2014 | 80 | 2014 |
Acceleration of a Full-scale Industrial CFD Application with OP2 I Reguly, G Mudalige, C Bertolli, M Giles, A Betts, P Kelly, D Radford IEEE Transactions on Parallel and Distributed Systems, 2015 | 68 | 2015 |
A plug-and-play model for evaluating wavefront computations on parallel architectures GR Mudalige, MK Vernon, SA Jarvis 2008 IEEE International Symposium on Parallel and Distributed Processing, 1-14, 2008 | 50 | 2008 |
Loop Tiling in Large-Scale Stencil Codes at Run-time with OPS IZ Reguly, GR Mudalige, MB Giles IEEE Transactions on Parallel and Distributed Systems 29 (4), 873-886, 2017 | 48 | 2017 |
Designing OP2 for GPU architectures MB Giles, GR Mudalige, B Spencer, C Bertolli, I Reguly Journal of Parallel and Distributed Computing 73 (11), 1451-1460, 2013 | 46* | 2013 |
Productivity, performance, and portability for computational fluid dynamics applications IZ Reguly, GR Mudalige Computers & Fluids 199, 104425, 2020 | 39 | 2020 |
Vectorizing unstructured mesh computations for many-core architectures IZ Reguly, E László, GR Mudalige, MB Giles Proceedings of Programming Models and Applications on Multicores and …, 2014 | 36 | 2014 |
Design and performance of the op2 library for unstructured mesh applications C Bertolli, A Betts, G Mudalige, M Giles, P Kelly European Conference on Parallel Processing, 191-200, 2011 | 33 | 2011 |
Design and initial performance of a high-level unstructured mesh framework on heterogeneous parallel systems GR Mudalige, MB Giles, J Thiyagalingam, IZ Reguly, C Bertolli, PHJ Kelly, ... Parallel Computing 39 (11), 669-692, 2013 | 31 | 2013 |
On the acceleration of wavefront applications using distributed many-core architectures SJ Pennycook, SD Hammond, GR Mudalige, SA Wright, SA Jarvis The Computer Journal 55 (2), 138-153, 2012 | 31 | 2012 |
Loop chaining: A programming abstraction for balancing locality and parallelism CD Krieger, MM Strout, C Olschanowsky, A Stone, S Guzik, X Gao, ... 2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013 | 29 | 2013 |
Large-scale performance of a DSL-based multi-block structured-mesh application for Direct Numerical Simulation GR Mudalige, IZ Reguly, SP Jammy, CT Jacobs, MB Giles, ND Sandham Journal of Parallel and Distributed Computing 131, 130-146, 2019 | 23 | 2019 |
Performance Analysis of a High-Level Abstractions-Based Hydrocode on Future Computing Systems G.R. Mudalige, I.Z. Reguly, M.B. Giles, A.C. Mallinson, W.P. Gaudin, J.A ... High Performance Computing Systems. Performance Modeling, Benchmarking, and …, 2015 | 23* | 2015 |
Op2-clang: A source-to-source translator using clang/llvm libtooling GD Balogh, GR Mudalige, IZ Reguly, SF Antao, C Bertolli 2018 IEEE/ACM 5th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM …, 2018 | 21 | 2018 |
Achieving performance portability for a heat conduction solver mini-application on modern multi-core systems RO Kirk, GR Mudalige, IZ Reguly, SA Wright, MJ Martineau, SA Jarvis 2017 IEEE International Conference on Cluster Computing (CLUSTER), 834-841, 2017 | 21 | 2017 |
Compiler optimizations for industrial unstructured mesh cfd applications on gpus C Bertolli, A Betts, N Loriant, GR Mudalige, D Radford, DA Ham, MB Giles, ... International Workshop on Languages and Compilers for Parallel Computing …, 2012 | 20 | 2012 |