Heteroflow: An accelerator programming model with decoupled data placement for software-defined fpgas S Xiang, YH Lai, Y Zhou, H Chen, N Zhang, D Pal, Z Zhang Proceedings of the 2022 ACM/SIGDA International Symposium on Field …, 2022 | 26 | 2022 |
RapidLayout: Fast Hard Block Placement of FPGA-optimized Systolic Arrays Using Evolutionary Algorithm N Zhang, X Chen, N Kapre ACM Transactions on Reconfigurable Technology and Systems (TRETS) 15 (4), 1-23, 2022 | 13 | 2022 |
Codedvtr: Codebook-based sparse voxel transformer with geometric guidance T Zhao, N Zhang, X Ning, H Wang, L Yi, Y Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 10 | 2022 |
Understanding the potential of fpga-based spatial acceleration for large language model inference H Chen, J Zhang, Y Du, S Xiang, Z Yue, N Zhang, Y Cai, Z Zhang ACM Transactions on Reconfigurable Technology and Systems, 2024 | 9 | 2024 |
Serving Multi-DNN Workloads on FPGAs: a Coordinated Architecture, Scheduling, and Mapping Perspective S Zeng, G Dai, N Zhang, X Yang, H Zhang, Z Zhu, H Yang, Y Wang IEEE Transactions on Computers, 2022 | 6 | 2022 |
Accelerator design with decoupled hardware customizations: benefits and challenges D Pal, YH Lai, S Xiang, N Zhang, H Chen, J Casas, P Cocchini, Z Yang, ... Proceedings of the 59th ACM/IEEE Design Automation Conference, 1351-1354, 2022 | 5 | 2022 |
aw_nas: A modularized and extensible nas framework X Ning, C Tang, W Li, S Yang, T Zhao, N Zhang, T Lu, S Liang, H Yang, ... arXiv preprint arXiv:2012.10388, 2020 | 4 | 2020 |
Allo: A Programming Model for Composable Accelerator Design H Chen, N Zhang, S Xiang, Z Zeng, M Dai, Z Zhang Proceedings of the ACM on Programming Languages 8 (PLDI), 593-620, 2024 | 3 | 2024 |
A comprehensive evaluation of fpga-based spatial acceleration of llms H Chen, J Zhang, Y Du, S Xiang, Z Yue, N Zhang, Y Cai, Z Zhang Proceedings of the 2024 ACM/SIGDA International Symposium on Field …, 2024 | 1 | 2024 |
Formal Verification of Source-to-Source Transformations for HLS LN Pouchet, E Tucker, N Zhang, H Chen, D Pal, G Rodríguez, Z Zhang Proceedings of the 2024 ACM/SIGDA International Symposium on Field …, 2024 | 1 | 2024 |
Supporting a Virtual Vector Instruction Set on a Commercial Compute-in-SRAM Accelerator C Golden, D Ilan, C Huang, N Zhang, Z Zhang, C Batten IEEE Computer Architecture Letters, 2023 | | 2023 |
RapidLayout: Fast Hard Block Placement of FPGA-optimized Systolic Arrays Using Evolutionary Algorithm N Zhang, X Chen, N Kapre 30th International Conference on Field Programmable Logic and Applications (FPL), 2020 | | 2020 |