Performance in GPU architectures: Potentials and distances A Lashgar, A Baniasadi 9th Annual Workshop on Duplicating, Deconstructing, and Debunking (WDDD 2011 …, 2011 | 28 | 2011 |
Dynamic Warp Resizing: Analysis and Benefits in High-Performance SIMT A Lashgar, A Baniasadi, A Khonsari 30th International IEEE Conference on Computer Design, ICCD 2012, 502-503, 2012 | 18 | 2012 |
IPMACC: open source openacc to cuda/opencl translator A Lashgar, A Majidi, A Baniasadi arXiv preprint arXiv:1412.1127, 2014 | 16 | 2014 |
Employing software-managed caches in OpenACC: Opportunities and benefits A Lashgar, A Baniasadi ACM Transactions on Modeling and Performance Evaluation of Computing Systems …, 2016 | 14 | 2016 |
Openacc cache directive: Opportunities and optimizations A Lashgar, A Baniasadi 2016 Third Workshop on Accelerator Programming Using Directives (WACCPD), 46-56, 2016 | 12 | 2016 |
Inter-Warp Instruction Temporal Locality in Deep-Multithreaded GPUs A Lashgar, A Baniasadi, A Khonsari 26th International Conference on Architecture of Computing Systems, ARCS …, 2013 | 12 | 2013 |
HARP: Harnessing Inactive Threads in Many-Core Processors A Lashgar, A Khonsari, A Baniasadi ACM Transactions on Embedded Computing Systems 13 (3s), Article 114, 2014 | 9 | 2014 |
Warp size impact in GPUs: large or small? A Lashgar, A Baniasadi, A Khonsari Proceedings of the 6th Workshop on General Purpose Processor Using Graphics …, 2013 | 9 | 2013 |
Loop perforation in OpenACC A Lashgar, E Atoofian, A Baniasadi 2018 IEEE Intl Conf on Parallel & Distributed Processing with Applications …, 2018 | 7 | 2018 |
Understanding outstanding memory request handling resources in gpgpus A Lashgar, E Salehi, A Baniasadi proceedings of The Sixth International Symposium on Highly Efficient …, 2015 | 7 | 2015 |
Investigating Warp Size Impact in GPUs A Lashgar, A Baniasadi, A Khonsari arXiv preprint arXiv:1205.4967, 2012 | 7 | 2012 |
A case study in reverse engineering gpgpus: Outstanding memory handling resources A Lashgar, E Salehi, A Baniasadi ACM SIGARCH Computer Architecture News 43 (4), 15-21, 2016 | 6 | 2016 |
A case against small data types in gpgpus A Lashgar, A Baniasadi 2014 IEEE 25th International Conference on Application-Specific Systems …, 2014 | 5 | 2014 |
IPMACC: Translating OpenACC API to OpenCL A Lashgar, A Majidi, A Baniasadi In poster session of The 3rd International Workshop on OpenCL (IWOCL), IWOCL, 2015 | 4 | 2015 |
Towards green GPUs: Warp size impact analysis A Lashgar, A Baniasadi, A Khonsari 2013 International Green Computing Conference Proceedings, 1-6, 2013 | 3 | 2013 |
Efficient implementation of OpenACC cache directive on NVIDIA GPUs A Lashgar, A Baniasadi International Journal of High Performance Computing and Networking 13 (1), 35-53, 2019 | 2 | 2019 |
Dynamic Warp Resizing in High-Performance SIMT A Lashgar, A Baniasadi, A Khonsari arXiv preprint arXiv:1208.2374, 2012 | 2 | 2012 |
TELEPORT: Hardware/software alternative to CUDA shared memory programming A Lashgar, E Atoofian, A Baniasadi Microprocessors and Microsystems 63, 169-181, 2018 | 1 | 2018 |
Rethinking prefetching in gpgpus: Exploiting unique opportunities A Lashgar, A Baniasadi 2015 IEEE 17th International Conference on High Performance Computing and …, 2015 | 1 | 2015 |
Addressing software-managed cache development effort in GPGPUs A Lashgar | | 2017 |