GPU-STREAM v2. 0: Benchmarking the achievable memory bandwidth of many-core processors across diverse parallel programming models T Deakin, J Price, M Martineau, S McIntosh-Smith High Performance Computing: ISC High Performance 2016 International …, 2016 | 116 | 2016 |
High performance in silico virtual drug screening on many-core processors S McIntosh-Smith, J Price, RB Sessions, AA Ibarra The international journal of high performance computing applications 29 (2 …, 2015 | 115 | 2015 |
Evaluating attainable memory bandwidth of parallel programming models via BabelStream T Deakin, J Price, M Martineau, S McIntosh-Smith International Journal of Computational Science and Engineering 17 (3), 247-262, 2018 | 73 | 2018 |
Performance portability across diverse computer architectures T Deakin, S McIntosh-Smith, J Price, A Poenaru, P Atkinson, C Popa, ... 2019 IEEE/ACM International Workshop on Performance, Portability and …, 2019 | 63 | 2019 |
On the performance portability of structured grid codes on many-core computer architectures S McIntosh-Smith, M Boulton, D Curran, J Price Supercomputing: 29th International Conference, ISC 2014, Leipzig, Germany …, 2014 | 58 | 2014 |
A performance analysis of the first generation of HPC‐optimized Arm processors S McIntosh‐Smith, J Price, T Deakin, A Poenaru Concurrency and Computation: Practice and Experience 31 (16), e5110, 2019 | 52 | 2019 |
Oclgrind: An extensible OpenCL device simulator J Price, S McIntosh-Smith Proceedings of the 3rd International Workshop on OpenCL, 1-7, 2015 | 45 | 2015 |
A GPU-accelerated immersive audio-visual framework for interaction with molecular dynamics using consumer depth sensors DR Glowacki, M O'Connor, G Calabró, J Price, P Tew, T Mitchell, J Hyde, ... Faraday Discussions, 2014 | 37 | 2014 |
Pragmatic performance portability with OpenMP 4. x M Martineau, J Price, S McIntosh-Smith, W Gaudin OpenMP: Memory, Devices, and Tasks: 12th International Workshop on OpenMP …, 2016 | 24 | 2016 |
Comparative benchmarking of the first generation of hpc-optimised arm processors on isambard S McIntosh-Smith, J Price, T Deakin, A Poenaru Cray user group 5, 2018 | 21 | 2018 |
Exploiting task parallelism with OpenCL: a case study P Jääskeläinen, V Korhonen, M Koskela, J Takala, K Egiazarian, ... Journal of Signal Processing Systems 91, 33-46, 2019 | 19 | 2019 |
Benchmarking the first generation of production quality Arm‐based supercomputers S McIntosh‐Smith, J Price, A Poenaru, T Deakin Concurrency and Computation: Practice and Experience 32 (20), e5569, 2020 | 16 | 2020 |
Exploiting auto-tuning to analyze and improve performance portability on many-core architectures J Price, S McIntosh-Smith High Performance Computing: ISC High Performance 2017 International …, 2017 | 11 | 2017 |
Improving auto-tuning convergence times with dynamically generated predictive performance models J Price, S McIntosh-Smith 2015 IEEE 9th International Symposium on Embedded Multicore/Many-Core …, 2015 | 11 | 2015 |
Evaluation of asynchronous offloading capabilities of accelerator programming models for multiple devices J Hahnfeld, C Terboven, J Price, HJ Pflug, MS Müller Accelerator Programming Using Directives: 4th International Workshop, WACCPD …, 2018 | 8 | 2018 |
Analyzing and improving performance portability of OpenCL applications via auto-tuning J Price, S McIntosh-Smith Proceedings of the 5th International Workshop on OpenCL, 1-4, 2017 | 8 | 2017 |
Sculpting molecular dynamics in real-time using human energy fields DR Glowacki Molecular Aesthetics, 2013 | 8 | 2013 |
Portable methods for measuring cache hierarchy performance T Deakin, J Price, S McIntosh-Smith SC17, 2017 | 7 | 2017 |
Application-based fault tolerance techniques for fully protecting sparse matrix solvers G Pawelczak, S McIntosh-Smith, J Price, M Martineau 2017 IEEE International Conference on Cluster Computing (CLUSTER), 733-740, 2017 | 7 | 2017 |
Scaling results from the first generation of arm-based supercomputers S McIntosh-Smith, J Price, A Poenaru, T Deakin Proceedings of the Cray User Group, 2019 | 6 | 2019 |