A practical automatic polyhedral parallelizer and locality optimizer U Bondhugula, A Hartono, J Ramanujam, P Sadayappan Proceedings of the 29th ACM SIGPLAN Conference on Programming Language …, 2008 | 1511* | 2008 |
NWChem: Past, present, and future E Apra, EJ Bylaska, WA De Jong, N Govind, K Kowalski, TP Straatsma, ... The Journal of chemical physics 152 (18), 2020 | 557 | 2020 |
Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems J Lin, Q Lu, X Ding, Z Zhang, X Zhang, P Sadayappan 2008 IEEE 14th International Symposium on High Performance Computer …, 2008 | 510 | 2008 |
Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model U Bondhugula, M Baskaran, S Krishnamoorthy, J Ramanujam, A Rountev, ... Compiler Construction: 17th International Conference, CC 2008, Held as Part …, 2008 | 445 | 2008 |
Scalable work stealing J Dinan, DB Larkins, P Sadayappan, S Krishnamoorthy, J Nieplocha Proceedings of the Conference on High Performance Computing Networking …, 2009 | 406 | 2009 |
High-performance code generation for stencil computations on GPU architectures J Holewinski, LN Pouchet, P Sadayappan Proceedings of the 26th ACM international conference on Supercomputing, 311-320, 2012 | 364 | 2012 |
Automatic C-to-CUDA code generation for affine programs MM Baskaran, J Ramanujam, P Sadayappan Compiler Construction: 19th International Conference, CC 2010, Held as Part …, 2010 | 337 | 2010 |
Effective automatic parallelization of stencil computations S Krishnamoorthy, M Baskaran, U Bondhugula, J Ramanujam, A Rountev, ... ACM sigplan notices 42 (6), 235-244, 2007 | 313 | 2007 |
A compiler framework for optimization of affine loop nests for GPGPUs MM Baskaran, U Bondhugula, S Krishnamoorthy, J Ramanujam, ... Proceedings of the 22nd annual international conference on Supercomputing …, 2008 | 298 | 2008 |
Distributed job scheduling on computational grids using multiple simultaneous requests V Subramani, R Kettimuthu, S Srinivasan, S Sadayappan Proceedings 11th IEEE International Symposium on High Performance …, 2002 | 271 | 2002 |
Compile-time techniques for data distribution in distributed memory machines J Ramanujam, P Sadayappan IEEE Transactions on parallel and distributed systems 2 (4), 472-482, 1991 | 258 | 1991 |
Scalable I/O forwarding framework for high-performance computing systems N Ali, P Carns, K Iskra, D Kimpe, S Lang, R Latham, R Ross, L Ward, ... 2009 IEEE International Conference on Cluster Computing and Workshops, 1-10, 2009 | 256 | 2009 |
Synthesis of high-performance parallel programs for a class of ab initio quantum chemistry models G Baumgartner, A Auer, DE Bernholdt, A Bibireata, V Choppella, ... Proceedings of the IEEE 93 (2), 276-292, 2005 | 255 | 2005 |
Characterization of backfilling strategies for parallel job scheduling S Srinivasan, R Kettimuthu, V Subramani, P Sadayappan Proceedings. International Conference on Parallel Processing Workshop, 514-519, 2002 | 251 | 2002 |
UTS: An unbalanced tree search benchmark S Olivier, J Huan, J Liu, J Prins, J Dinan, P Sadayappan, CW Tseng Languages and Compilers for Parallel Computing: 19th International Workshop …, 2007 | 248 | 2007 |
Polyhedral-based data reuse optimization for configurable computing LN Pouchet, P Zhang, P Sadayappan, J Cong Proceedings of the ACM/SIGDA international symposium on Field programmable …, 2013 | 225 | 2013 |
Tiling multidimensional iteration spaces for multicomputers J Ramanujam, P Sadayappan Journal of Parallel and Distributed Computing 16 (2), 108-120, 1992 | 225 | 1992 |
Task allocation onto a hypercube by recursive mincut bipartitioning F Ercal, J Ramanujam, P Sadayappan Proceedings of the third conference on Hypercube concurrent computers and …, 1988 | 224 | 1988 |
NWChem E Apra, EJ Bylaska, WA de Jong, N Govind, K Kowalski, TP Straatsma, ... American Institute of Physics, 2020 | 215 | 2020 |
Compiling array expressions for efficient execution on distributed-memory machines SKS Gupta, SD Kaushik, CH Huang, P Sadayappan Journal of Parallel and Distributed Computing 32 (2), 155-172, 1996 | 201 | 1996 |