iMODS: internal coordinates normal mode analysis server JR López-Blanco, JI Aliaga, ES Quintana-Ortí, P Chacón Nucleic acids research 42 (W1), W271-W276, 2014 | 551 | 2014 |
rCUDA: Reducing the number of GPU-based accelerators in high performance clusters J Duato, AJ Pena, F Silla, R Mayo, ES Quintana-Ortí 2010 International Conference on High Performance Computing & Simulation …, 2010 | 402 | 2010 |
Solving stable generalized Lyapunov equations with the matrix sign function P Benner, ES Quintana-Ortí Numerical Algorithms 20, 75-100, 1999 | 222 | 1999 |
An extension of the StarSs programming model for platforms with multiple GPUs E Ayguadé, RM Badia, FD Igual, J Labarta, R Mayo, ES Quintana-Ortí Euro-Par 2009 Parallel Processing: 15th International Euro-Par Conference …, 2009 | 219 | 2009 |
The science of deriving dense linear algebra algorithms P Bientinesi, JA Gunnels, ME Myers, ES Quintana-Ortí, RA Geijn ACM Transactions on Mathematical Software (TOMS) 31 (1), 1-26, 2005 | 213 | 2005 |
Programming matrix algorithms-by-blocks for thread-level parallelism G Quintana-Ortí, ES Quintana-Ortí, RAVD Geijn, FGV Zee, E Chan ACM Transactions on Mathematical Software (TOMS) 36 (3), 1-26, 2009 | 194 | 2009 |
SuperMatrix out-of-order scheduling of matrix operations for SMP and multi-core architectures E Chan, ES Quintana-Orti, G Quintana-Orti, R Van De Geijn Proceedings of the nineteenth annual ACM symposium on Parallel algorithms …, 2007 | 181 | 2007 |
Analytical modeling is enough for high-performance BLIS TM Low, FD Igual, TM Smith, ES Quintana-Orti ACM Transactions on Mathematical Software (TOMS) 43 (2), 1-18, 2016 | 172 | 2016 |
Solving dense linear systems on platforms with multiple hardware accelerators G Quintana-Ortí, FD Igual, ES Quintana-Ortí, RA Van de Geijn Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of …, 2009 | 153 | 2009 |
Supermatrix: a multithreaded runtime scheduling system for algorithms-by-blocks E Chan, FG Van Zee, P Bientinesi, ES Quintana-Orti, G Quintana-Orti, ... Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of …, 2008 | 145 | 2008 |
A complete and efficient CUDA-sharing solution for HPC clusters AJ Pena, C Reaño, F Silla, R Mayo, ES Quintana-Ortí, J Duato Parallel Computing 40 (10), 574-588, 2014 | 134 | 2014 |
The libflame library for dense matrix computations FG Van Zee, E Chan, RA Van de Geijn, ES Quintana-Orti, G Quintana-Orti Computing in science & engineering 11 (6), 56-63, 2009 | 134 | 2009 |
Evaluation and tuning of the level 3 CUBLAS for graphics processors S Barrachina, M Castillo, FD Igual, R Mayo, ES Quintana-Orti 2008 IEEE International Symposium on Parallel and Distributed Processing, 1-8, 2008 | 133 | 2008 |
A note on parallel matrix inversion ES Quintana, G Quintana, X Sun, R Van De Geijn SIAM Journal on Scientific Computing 22 (5), 1762-1771, 2001 | 122 | 2001 |
Solving dense linear systems on graphics processors S Barrachina, M Castillo, FD Igual, R Mayo, ES Quintana-Ortí European Conference on Parallel Processing, 739-748, 2008 | 121 | 2008 |
Representing linear algebra algorithms in code: the FLAME application program interfaces P Bientinesi, ES Quintana-Ortí, RA Geijn ACM Transactions on Mathematical Software (TOMS) 31 (1), 27-59, 2005 | 121 | 2005 |
Enabling CUDA acceleration within virtual machines using rCUDA J Duato, AJ Pena, F Silla, JC Fernandez, R Mayo, ES Quintana-Orti 2011 18th International Conference on High Performance Computing, 1-10, 2011 | 110 | 2011 |
Parallelizing dense and banded linear algebra libraries using SMPSs RM Badia, JR Herrero, J Labarta, JM Pérez, ES Quintana‐Ortí, ... Concurrency and Computation: Practice and Experience 21 (18), 2438-2456, 2009 | 104 | 2009 |
A proposal to extend the openmp tasking model for heterogeneous architectures E Ayguade, RM Badia, D Cabrera, A Duran, M Gonzalez, F Igual, ... Evolving OpenMP in an Age of Extreme Parallelism: 5th International Workshop …, 2009 | 102 | 2009 |
Extending OpenMP to survive the heterogeneous multi-core era E Ayguadé, RM Badia, P Bellens, D Cabrera, A Duran, R Ferrer, ... International Journal of Parallel Programming 38, 440-459, 2010 | 99 | 2010 |