Gandiva: Introspective cluster scheduling for deep learning W Xiao, R Bhardwaj, R Ramjee, M Sivathanu, N Kwatra, Z Han, P Patel, ... 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2018 | 508 | 2018 |
The demikernel datapath os architecture for microsecond-scale datacenter systems I Zhang, A Raybuck, P Patel, K Olynyk, J Nelson, OSN Leija, A Martinez, ... Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles …, 2021 | 88 | 2021 |
A server-based approach for predictable GPU access control H Kim, P Patel, S Wang, RR Rajkumar 2017 IEEE 23rd International Conference on Embedded and Real-Time Computing …, 2017 | 36 | 2017 |
Splitwise: Efficient generative llm inference using phase splitting P Patel, E Choukse, C Zhang, A Shah, Í Goiri, S Maleki, R Bianchini Power 400 (700W), 1.75, 2023 | 30 | 2023 |
The virtual block interface: A flexible alternative to the conventional virtual memory framework N Hajinazar, P Patel, M Patel, K Kanellopoulos, S Ghose, ... 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture …, 2020 | 30 | 2020 |
Analytical enhancements and practical insights for MPCP with self-suspensions P Patel, I Baek, H Kim, R Rajkumar 2018 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS …, 2018 | 30 | 2018 |
SoundWatch: Exploring smartwatch-based deep learning approaches to support sound awareness for deaf and hard of hearing users D Jain, H Ngo, P Patel, S Goodman, L Findlater, J Froehlich Proceedings of the 22nd International ACM SIGACCESS Conference on Computers …, 2020 | 28 | 2020 |
A server-based approach for predictable GPU access with improved analysis H Kim, P Patel, S Wang, RR Rajkumar Journal of Systems Architecture 88, 97-109, 2018 | 22 | 2018 |
Towards improved power management in cloud gpus P Patel, Z Gong, S Rizvi, E Choukse, P Misra, T Anderson, A Sriraman IEEE Computer Architecture Letters 22 (2), 141-144, 2023 | 12 | 2023 |
Timershield: Protecting High-Priority Tasks from Low-Priority Timer Interference P Patel, M Vanga, BB Brandenburg 2017 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS …, 2017 | 12 | 2017 |
Srifty: Swift and thrifty distributed neural network training on the cloud L Luo, P West, P Patel, A Krishnamurthy, L Ceze Proceedings of Machine Learning and Systems 4, 833-847, 2022 | 7 | 2022 |
Characterizing Power Management Opportunities for LLMs in the Cloud P Patel, E Choukse, C Zhang, Í Goiri, B Warrier, N Mahalingam, ... Proceedings of the 29th ACM International Conference on Architectural …, 2024 | 4 | 2024 |
Polca: Power oversubscription in llm cloud providers P Patel, E Choukse, C Zhang, Í Goiri, B Warrier, N Mahalingam, ... arXiv preprint arXiv:2308.12908, 2023 | 4 | 2023 |
Hybrid Computing for Interactive Datacenter Applications P Patel, K Lim, K Jhunjhunwalla, A Martinez, M Demoulin, J Nelson, ... arXiv preprint arXiv:2304.04488, 2023 | 4 | 2023 |
SoundWatch: deep learning for sound accessibility on smartwatches D Jain, H Ngo, P Patel, S Goodman, K Nguyen, R Grossman-Kahn, ... Communications of the ACM 65 (6), 100-108, 2022 | 3 | 2022 |
The magazine archive includes every article published in Communications of the ACM for over the past 50 years. UN Umesh, MQ Huynh, L Jessup Communications of the ACM 48 (6), 82-87, 2005 | 3 | 2005 |
An agile pathway towards carbon-aware clouds P Patel, T Gregersen, T Anderson Proceedings of the 2nd Workshop on Sustainable Computer Systems, 1-8, 2023 | | 2023 |
Predictable GPU Arbitration for Fixed-Priority Real-Time Systems P Patel Birla Institute of Technology and Science, Pilani, 2017 | | 2017 |
File Systems are not Enough: Rethinking the Storage API for Microsecond-Scale Cloud Applications A Martinez, K Lim, P Patel, I Zhang, D Ports, J Nelson, T Anderson | | |
Designing Equitable Data Center Scheduling Systems S Rangarajan, X Chen, P Patel, J Wang, A Sriraman | | |