CuSha: vertex-centric graph processing on GPUs F Khorasani, K Vora, R Gupta, LN Bhuyan Proceedings of the 23rd international symposium on High-performance parallel …, 2014 | 325 | 2014 |
Scalable SIMD-Efficient Graph Processing on GPUs F Khorasani, R Gupta, LN Bhuyan Proceedings of the International Conference on Parallel Architectures and …, 2015 | 168 | 2015 |
RegMutex: Inter-Warp GPU Register Time-Sharing F Khorasani, HA Esfeden, A Farmahini-Farahani, N Jayasena, V Sarkar International Symposium on Computer Architecture (ISCA), 2018 | 52 | 2018 |
Stadium Hashing: Scalable and Flexible Hashing on GPUs F Khorasani, ME Belviranli, R Gupta, LN Bhuyan Proceedings of the International Conference on Parallel Architectures and …, 2015 | 46 | 2015 |
Efficient Warp Execution in Presence of Divergence with Collaborative Context Collection F Khorasani, R Gupta, LN Bhuyan Proceedings of the 48th International Symposium on Microarchitecture, 204-215, 2015 | 45 | 2015 |
CORF: Coalescing operand register file for GPUs H Asghari Esfeden, F Khorasani, H Jeon, D Wong, N Abu-Ghazaleh Proceedings of the Twenty-Fourth International Conference on Architectural …, 2019 | 42 | 2019 |
In-Register Parameter Caching for Dynamic Neural Nets with Virtual Persistent Processor Specialization F Khorasani, HA Esfeden, N Abu-Ghazaleh, V Sarkar 2018 51st Annual IEEE/ACM International Symposium on Microarchitecture …, 2018 | 36 | 2018 |
CuMAS: Data Transfer Aware Multi-Application Scheduling for Shared GPUs ME Belviranli, F Khorasani, LN Bhuyan, R Gupta Proceedings of the International Conference on Supercomputing (ICS), 2016 | 34 | 2016 |
Eliminating Intra-warp Load Imbalance in Irregular Nested Patterns via Collaborative Task Engagement F Khorasani, B Rowe, R Gupta, LN Bhuyan Proceedings of the International Parallel and Distributed Processing …, 2016 | 19 | 2016 |
Dyna: Toward a Self-optimizing Declarative Language for Machine Learning Applications T Vieira, M Francis-Landau, NW Filardo, F Khorasani, J Eisner Proceedings of the 1st ACM SIGPLAN International Workshop on Machine …, 2017 | 11 | 2017 |
Parmat: A parallel generator for large r-mat graphs F Khorasani, K Vora, R Gupta Proceedings of the 24th International Conference on Parallel Architectures …, 2015 | 7 | 2015 |
High Performance Vertex-Centric Graph Analytics on GPUs F Khorasani University of California, Riverside, 2016 | 4 | 2016 |
Enabling Work-Efficiency for High Performance Vertex-Centric Graph Analytics on GPUs F Khorasani, K Vora, R Gupta, LN Bhuyan Proceedings of the Seventh Workshop on Irregular Applications: Architectures …, 2017 | 3 | 2017 |
Compiler-assisted inter-simd-group register sharing F Khorasani, A Farmahini-Farahani, NS Jayasena US Patent App. 15/935,399, 2018 | 2* | 2018 |
High Performance and Scalable Graph Computation on GPUs F Khorasani Sustainable Interdependent Networks, 67-75, 2018 | 2 | 2018 |
LightPlay: Efficient Replay with GPUs M Feng, F Khorasani, R Gupta, LN Bhuyan International Workshop on Languages and Compilers for Parallel Computing …, 2014 | | 2014 |