FPDeep: Acceleration and load balancing of CNN training on FPGA clusters T Geng, T Wang, A Sanaullah, C Yang, R Xu, R Patel, M Herbordt 2018 IEEE 26th Annual International Symposium on Field-Programmable Custom …, 2018 | 104 | 2018 |
The future of FPGA acceleration in datacenters and the cloud C Bobda, JM Mbongue, P Chow, M Ewais, N Tarafdar, JC Vega, K Eguro, ... ACM Transactions on Reconfigurable Technology and Systems (TRETS) 15 (3), 1-42, 2022 | 96 | 2022 |
A framework for acceleration of CNN training on deeply-pipelined FPGA clusters with work and weight load balancing T Geng, T Wang, A Sanaullah, C Yang, R Patel, M Herbordt 2018 28th international conference on field programmable logic and …, 2018 | 63 | 2018 |
Fully integrated FPGA molecular dynamics simulations C Yang, T Geng, T Wang, R Patel, Q Xiong, A Sanaullah, C Wu, J Sheng, ... Proceedings of the International Conference for High Performance Computing …, 2019 | 58 | 2019 |
Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks A Sanaullah, C Yang, Y Alexeev, K Yoshii, MC Herbordt BMC bioinformatics 19, 19-31, 2018 | 54 | 2018 |
HPC on FPGA clouds: 3D FFTs and implications for molecular dynamics J Sheng, C Yang, A Sanaullah, M Papamichael, A Caulfield, MC Herbordt 2017 27th International Conference on Field Programmable Logic and …, 2017 | 52 | 2017 |
Fpga hpc using opencl: Case study in 3d fft A Sanaullah, MC Herbordt Proceedings of the 9th International Symposium on Highly-Efficient …, 2018 | 43 | 2018 |
An empirically guided optimization framework for FPGA OpenCL A Sanaullah, R Patel, M Herbordt 2018 International Conference on Field-Programmable Technology (FPT), 46-53, 2018 | 41 | 2018 |
OpenCL for HPC with FPGAs: Case study in molecular electrostatics C Yang, J Sheng, R Patel, A Sanaullah, V Sachdeva, MC Herbordt 2017 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2017 | 36 | 2017 |
Design and implementation of a low cost Solar Panel emulator A Sanaullah, HA Khan 2015 IEEE 42nd photovoltaic specialist conference (PVSC), 1-5, 2015 | 30 | 2015 |
FPGA-Accelerated Particle-Grid Mapping A Sanaullah, A Khoshparvar, MC Herbordt 2016 IEEE 24th Annual International Symposium on Field-Programmable Custom …, 2016 | 26 | 2016 |
Survey and future trends for FPGA cloud architectures H Shahzad, A Sanaullah, M Herbordt 2021 IEEE High Performance Extreme Computing Conference (HPEC), 1-10, 2021 | 24 | 2021 |
Unlocking performance-programmability by penetrating the intel FPGA OpenCL toolflow A Sanaullah, MC Herbordt 2018 IEEE High Performance extreme Computing Conference (HPEC), 1-8, 2018 | 21 | 2018 |
Relational memory: Native in-memory accesses on rows and columns S Roozkhosh, D Hoornaert, JH Mun, TI Papon, A Sanaullah, U Drepper, ... arXiv preprint arXiv:2109.14349, 2021 | 13 | 2021 |
Benchmarking heterogeneous hpc systems including reconfigurable fabrics: Community aspirations for ideal comparisons P Jamieson, A Sanaullah, M Herbordt 2018 IEEE High Performance extreme Computing Conference (HPEC), 1-6, 2018 | 13 | 2018 |
Application aware tuning of reconfigurable multi-layer perceptron architectures A Sanaullah, C Yang, Y Alexeev, K Yoshii, MC Herbordt 2018 IEEE High Performance extreme Computing Conference (HPEC), 1-9, 2018 | 9 | 2018 |
SimBSP: Enabling RTL Simulation for Intel FPGA OpenCL Kernels A Sanaullah, C Yang, D Crawley, M Herbordt Proc. Heterogeneous High Performance Reconfigurable Computing, 2018 | 9 | 2018 |
Accelerated Particle-Grid Mapping A Sanaullah, K Lewis, M Herbordt Proceedings of the ACM/IEEE International Conference for High Performance …, 2016 | 8 | 2016 |
Reinforcement learning strategies for compiler optimization in high level synthesis H Shahzad, A Sanaullah, S Arora, R Munafo, X Yao, U Drepper, ... 2022 IEEE/ACM Eighth Workshop on the LLVM Compiler Infrastructure in HPC …, 2022 | 7 | 2022 |
OpenCL for HPC/FPGAs: Case Study with 3D FFT A Sanaullah, M Herbordt 9th International Symposium on Highly-Efficient Accelerators and …, 2017 | 7 | 2017 |