Adaptive sparse tiling for sparse matrix multiplication C Hong, A Sukumaran-Rajam, I Nisa, K Singh, P Sadayappan Proceedings of the 24th Symposium on Principles and Practice of Parallel …, 2019 | 166 | 2019 |
Register optimizations for stencils on GPUs PS Rawat, F Rastello, A Sukumaran-Rajam, LN Pouchet, A Rountev, ... Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of …, 2018 | 65 | 2018 |
Analytical characterization and design space exploration for optimization of CNNs R Li, Y Xu, A Sukumaran-Rajam, A Rountev, P Sadayappan Proceedings of the 26th ACM International Conference on Architectural …, 2021 | 61 | 2021 |
A code generator for high-performance tensor contractions on GPUs J Kim, A Sukumaran-Rajam, V Thumma, S Krishnamoorthy, A Panyala, ... 2019 IEEE/ACM International Symposium on Code Generation and Optimization …, 2019 | 60 | 2019 |
Domain-specific optimization and generation of high-performance GPU code for stencil computations PS Rawat, M Vaidya, A Sukumaran-Rajam, M Ravishankar, V Grover, ... Proceedings of the IEEE 106 (11), 1902-1920, 2018 | 55 | 2018 |
Load-balanced sparse mttkrp on gpus I Nisa, J Li, A Sukumaran-Rajam, R Vuduc, P Sadayappan 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2019 | 53 | 2019 |
Efficient sparse-matrix multi-vector product on gpus C Hong, A Sukumaran-Rajam, B Bandyopadhyay, J Kim, SE Kurt, I Nisa, ... Proceedings of the 27th International Symposium on High-Performance Parallel …, 2018 | 52 | 2018 |
Effective machine learning based format selection and performance modeling for SpMV on GPUs I Nisa, C Siegel, AS Rajam, A Vishnu, P Sadayappan 2018 IEEE International Parallel and Distributed Processing Symposium …, 2018 | 43 | 2018 |
MultiGraph: Efficient graph processing on GPUs C Hong, A Sukumaran-Rajam, J Kim, P Sadayappan 2017 26th International Conference on Parallel Architectures and Compilation …, 2017 | 43 | 2017 |
Sampled dense matrix multiplication for high-performance machine learning I Nisa, A Sukumaran-Rajam, SE Kurt, C Hong, P Sadayappan 2018 IEEE 25th International Conference on High Performance Computing (HiPC …, 2018 | 42 | 2018 |
An efficient mixed-mode representation of sparse tensors I Nisa, J Li, A Sukumaran-Rajam, PS Rawat, S Krishnamoorthy, ... Proceedings of the International Conference for High Performance Computing …, 2019 | 41 | 2019 |
On optimizing complex stencils on GPUs PS Rawat, M Vaidya, A Sukumaran-Rajam, A Rountev, LN Pouchet, ... 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2019 | 38 | 2019 |
Israt Nisa, Shivani Sabhlok, Ümit V. Çatalyürek, Srinivasan Parthasarathy, and P. Sadayappan. 2018. Efficient Sparse-Matrix Multi-Vector Product on GPUs C Hong, A Sukumaran-Rajam, B Bandyopadhyay, J Kim, SE Kurt Proceedings of the 27th International Symposium on High-Performance Parallel …, 2018 | 37 | 2018 |
The polyhedral model of nonlinear loops A Sukumaran-Rajam, P Clauss ACM Transactions on Architecture and Code Optimization (TACO) 12 (4), 1-27, 2015 | 37 | 2015 |
Analytical cache modeling and tilesize optimization for tensor contractions R Li, A Sukumaran-Rajam, R Veras, TM Low, F Rastello, A Rountev, ... Proceedings of the International Conference for High Performance Computing …, 2019 | 34 | 2019 |
Efficient tiled sparse matrix multiplication through matrix signatures SE Kurt, A Sukumaran-Rajam, F Rastello, P Sadayyapan SC20: International Conference for High Performance Computing, Networking …, 2020 | 32 | 2020 |
Parallel ccd++ on gpu for matrix factorization I Nisa, A Sukumaran-Rajam, R Kunchum, P Sadayappan Proceedings of the General Purpose GPUs, 73-83, 2017 | 25 | 2017 |
Optimizing tensor contractions in ccsd (t) for efficient execution on gpus J Kim, A Sukumaran-Rajam, C Hong, A Panyala, RK Srivastava, ... Proceedings of the 2018 International Conference on Supercomputing, 96-106, 2018 | 24 | 2018 |
On improving performance of sparse matrix-matrix multiplication on gpus R Kunchum, A Chaudhry, A Sukumaran-Rajam, Q Niu, I Nisa, ... Proceedings of the International Conference on Supercomputing, 1-11, 2017 | 23 | 2017 |
APOLLO: Automatic speculative polyhedral loop optimizer JMM Caamaño, A Sukumaran-Rajam, A Baloian, M Selva, P Clauss IMPACT 2017-7th International Workshop on Polyhedral Compilation Techniques, 8, 2017 | 22 | 2017 |