Compiling python to a hybrid execution environment R Garg, JN Amaral Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics …, 2010 | 47 | 2010 |
McLab: an extensible compiler toolkit for MATLAB and related languages A Casey, J Li, J Doherty, M Chevalier-Boisvert, T Aslam, A Dubrau, ... Proceedings of the Third C* Conference on Computer Science and Software …, 2010 | 22 | 2010 |
A portable and high-performance general matrix-multiply (GEMM) library for GPUs and single-chip CPU/GPU systems R Garg, L Hendren Parallel, Distributed and Network-Based Processing (PDP), 2014 22nd …, 2014 | 20 | 2014 |
Multidimensional blocking in UPC C Barton, C Caşcaval, G Almasi, R Garg, JN Amaral, M Farreras International Workshop on Languages and Compilers for Parallel Computing, 47-62, 2007 | 13 | 2007 |
Velociraptor: An embedded compiler toolkit for numerical programs targeting CPUs and GPUs R Garg, L Hendren Proceedings of the 23rd international conference on Parallel architectures …, 2014 | 12 | 2014 |
Exploring the floating point performance of modern ARM processors R Garg http://www.anandtech.com/show/6971/exploring-the-floating-point-performance …, 2013 | 7 | 2013 |
Just-in-time shape inference for array-based languages R Garg, L Hendren Proceedings of ACM SIGPLAN International Workshop on Libraries, Languages …, 2014 | 6 | 2014 |
A compiler toolkit for array-based languages targeting CPU/GPU hybrid systems R Garg, L Hendren Technical Report 2012-3, Sable Research Group, Computer Science Department …, 2012 | 3 | 2012 |
A Toolkit for Building Dynamic Compilers for Array-based Languages Targeting CPUs and GPUs R Garg McGill University, Montréal, 2015 | 2 | 2015 |
A new compilation path: From python/numpy to opencl X Li, R Garg, JN Amaral PyHPC: Python for High Performance and Scientific Computing, 2011 | 2 | 2011 |
Velociraptor: a compiler toolkit for array-based languages targeting CPUs and GPUs R Garg, S Jagdale, L Hendren Proceedings of the 2nd ACM SIGPLAN International Workshop on Libraries …, 2015 | 1 | 2015 |
A portable and high-performance matrix operations library for CPUs, GPUs and beyond R Garg, L Hendren http://www.sable.mcgill.ca/publications/techreports/2013-1/sable-tr-2013-1.pdf, 2013 | 1 | 2013 |
Velociraptor: A compiler toolkit for numerical programs targeting CPUs and GPUs R Garg, L Hendren Technical Report SABLE-TR-2013-5, Sable Research Group, School of Computer …, 2013 | 1 | 2013 |
Floating point peak performance of Kaveri and other recent AMD and Intel chips R Garg http://www.anandtech.com/show/7711/floating-point-peak-performance-of-kaveri …, 2014 | | 2014 |
AMD Kaveri review: A8-7600 and A10-7850K tested I Cutress, R Garg http://www.anandtech.com/show/7677/amd-kaveri-review-a8-7600-a10-7850k/6, 2014 | | 2014 |
A look at Altera's OpenCL SDK for FPGAs R Garg http://www.anandtech.com/show/7334/a-look-at-alteras-opencl-sdk-for-fpgas, 2013 | | 2013 |
Nvidia's GeForce GTX Titan Review, Part 2: Titan's performance unveiled R Smith, R Garg http://www.anandtech.com/show/6774/nvidias-geforce-gtx-titan-part-2-titans …, 2013 | | 2013 |
unPython: Converting Python Numerical Programs into C R Garg, N Amaral Python in Science conference (SciPy 2008), 2008 | | 2008 |