Mystic: Predictive scheduling for gpu based cloud servers using machine learning Y Ukidave, X Li, D Kaeli 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2016 | 90 | 2016 |
Fast Fourier Transform (FFT) on GPUs Y Ukidave, G Schirner, D Kaeli Numerical Computations with GPUs, 339-361, 2014 | 52* | 2014 |
Griffin: Hardware-software support for efficient page migration in multi-gpu systems T Baruah, Y Sun, AT Dinçer, SA Mojumder, JL Abellán, Y Ukidave, ... 2020 IEEE International Symposium on High Performance Computer Architecture …, 2020 | 47 | 2020 |
Nupar: A benchmark suite for modern gpu architectures Y Ukidave, FN Paravecino, L Yu, C Kalra, A Momeni, Z Chen, N Materise, ... Proceedings of the 6th ACM/SPEC International Conference on Performance …, 2015 | 45 | 2015 |
Performance of the NVIDIA Jetson TK1 in HPC Y Ukidave, D Kaeli, U Gupta, K Keville 2015 IEEE International Conference on Cluster Computing, 533-534, 2015 | 38 | 2015 |
Gnnmark: A benchmark suite to characterize graph neural network training on gpus T Baruah, K Shivdikar, S Dong, Y Sun, SA Mojumder, K Jung, JL Abellán, ... 2021 IEEE International Symposium on Performance Analysis of Systems and …, 2021 | 36 | 2021 |
Runtime support for adaptive spatial partitioning and inter-kernel communication on gpus Y Ukidave, C Kalra, D Kaeli, P Mistry, D Schaa 2014 IEEE 26th International Symposium on Computer Architecture and High …, 2014 | 32 | 2014 |
GPU-accelerated HMM for speech recognition L Yu, Y Ukidave, D Kaeli 2014 43rd International Conference on Parallel Processing Workshops, 395-402, 2014 | 26 | 2014 |
Valar: A benchmark suite to study the dynamic behavior of heterogeneous systems P Mistry, Y Ukidave, D Schaa, D Kaeli Proceedings of the 6th Workshop on General Purpose Processor Using Graphics …, 2013 | 25 | 2013 |
Exploring the heterogeneous design space for both performance and reliability R Ubal, D Schaa, P Mistry, X Gong, Y Ukidave, Z Chen, G Schirner, ... Proceedings of the 51st Annual Design Automation Conference, 1-6, 2014 | 20 | 2014 |
Valkyrie: Leveraging inter-tlb locality to enhance gpu performance T Baruah, Y Sun, SA Mojumder, JL Abellán, Y Ukidave, A Joshi, N Rubin, ... Proceedings of the ACM International Conference on Parallel Architectures …, 2020 | 16 | 2020 |
Exploring the features of OpenCL 2.0 S Mukherjee, X Gong, L Yu, C McCardwell, Y Ukidave, T Dao, ... Proceedings of the 3rd International Workshop on OpenCL, 1-5, 2015 | 16 | 2015 |
Quantifying the Energy Efficiency of FFT on Heterogeneous Platforms Y Ukidave, A Ziabari, P Mistry, G Schirner, D Kaeli 2013 IEEE International Symposium on Performance Analysis of Systems and …, 2013 | 16 | 2013 |
Analyzing power efficiency of optimization techniques and algorithm design methods for applications on heterogeneous platforms Y Ukidave, AK Ziabari, P Mistry, G Schirner, D Kaeli The International journal of high performance computing applications 28 (3 …, 2014 | 12 | 2014 |
Exploring the efficiency of the opencl pipe semantic on an FPGA A Momeni, H Tabkhi, Y Ukidave, G Schirner, D Kaeli ACM SIGARCH Computer Architecture News 43 (4), 52-57, 2016 | 10 | 2016 |
Performance evaluation and optimization mechanisms for inter-operable graphics and computation on gpus Y Ukidave, X Gong, D Kaeli Proceedings of Workshop on General Purpose Processing Using GPUs, 37-45, 2014 | 7 | 2014 |
Analyzing optimization techniques for power efficiency on heterogeneous platforms Y Ukidave, DR Kaeli 2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013 | 7 | 2013 |
Architectural and Runtime Enhancements for Dynamically Controlled Multi-Level Concurrency on GPUs Y Ukidave Northeastern University, 2015 | 5 | 2015 |
A framework for profiling and performance monitoring of heterogeneous applications P Mistry, Y Ukidave, D Schaa, D Kaeli Programmability Issues for Heterogeneous Multicores (MULTIPROG-2013), 2013 | 4 | 2013 |
Feedback guided split workgroup dispatch for gpus YS Ukidave, J Kalamatianos, BM Beckmann US Patent App. 15/965,231, 2019 | 3 | 2019 |