Perfexpert: An easy-to-use performance diagnosis tool for hpc applications M Burtscher, BD Kim, J Diamond, J McCalpin, L Koesterke, J Browne SC'10: Proceedings of the 2010 ACM/IEEE International Conference for High …, 2010 | 136 | 2010 |
An evaluation of the TRIPS computer system M Gebhart, BA Maher, KE Coons, J Diamond, P Gratz, M Marino, ... ACM SIGARCH Computer Architecture News 37 (1), 1-12, 2009 | 92 | 2009 |
A closer look at lightweight graph reordering P Faldu, J Diamond, B Grot 2019 IEEE International Symposium on Workload Characterization (IISWC), 1-13, 2019 | 53 | 2019 |
Domain-Specialized Cache Management for Graph Analytics P Faldu, BG Grot, J Diamond International Symposium on High Performance Computer Architecture (HPCA) 26, 2020 | 52 | 2020 |
Arbitrary modulus indexing JR Diamond, DS Fussell, SW Keckler 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture, 140-152, 2014 | 33 | 2014 |
Reciprocity and the concept of the Brewster wavenumber A Lakhtakia, JR Diamond International journal of infrared and millimeter waves 12, 1167-1174, 1991 | 23 | 1991 |
High performance dense linear algebra on a spatially distributed processor JR Diamond, B Robatmili, SW Keckler, R van de Geijn, K Goto, D Burger Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of …, 2008 | 15 | 2008 |
Multicore optimization for ranger J Diamond, BD Kim, M Burtscher, S Keckler, K Pingali, J Browne 2009 TeraGrid Conference, 2009 | 12 | 2009 |
Large-scale fast Fourier transform on a heterogeneous multi-core system Y Li, JR Diamond, X Wang, H Lin, Y Yang, Z Han The International Journal of High Performance Computing Applications 26 (2 …, 2012 | 7 | 2012 |
A performance model for fast Fourier transform Y Li, L Zhao, H Lin, AC Chow, JR Diamond 2009 IEEE International Symposium on Parallel & Distributed Processing, 1-11, 2009 | 7 | 2009 |
POSTER: Domain-specialized cache management for graph analytics P Faldu, J Diamond, B Grot 2019 28th International Conference on Parallel Architectures and Compilation …, 2019 | 6 | 2019 |
Making sense of performance counter measurements on supercomputing applications J Diamond, JD McCalpin, M Burtscher, BD Kim, SW Keckler, JC Browne Technical Report TR-10-25, 2010 | 6 | 2010 |
Method and system for dynamic cache partitioning using address remapping P Koka, HD Schwetman Jr, M Zulfiqar, J Diamond US Patent 9,489,309, 2016 | 5 | 2016 |
An evaluation of the TRIPS computer system (extended technical report) M Gebhart, BA Maher, KE Coons, J Diamond, P Gratz, M Marino, ... | 4 | 2008 |
Designing on-chip memory systems for throughput architectures JR Diamond | 1 | 2015 |
Evaluation and optimization of multicore performance bottlenecks in supercomputing applications SWKJCB J. Diamond, M. Burtscher, J. D. McCalpin, B. D. Kim Performance Analysis of Systems and Software (ISPASS), 2011 IEEE …, 2011 | | 2011 |
ICES REPORT 10-04 M Burtscher, BD Kim, J Diamond, J McCalpin, L Koesterke, J Browne | | 2010 |
Knowledge Bases and Contributors J Diamond, BD Kim, S Keckler, K Pingali, J McCalpin, L Koesterke, ... | | |
Hoefler, Torsten 369 Huang, Michael 110 TM Aamodt, R Afoakwa, G Agrawal, M Ahmad, J Ahn, JH Ahn, AM Aji, ... | | |
Increasing Throughput Performance with Arbitrary Modulus Indexing JR Diamond, DS Fussell, SW Keckler | | |