PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning W Niu, X Ma, S Lin, S Wang, X Qian, X Lin, Y Wang, B Ren ASPLOS '20: Proceedings of the Twenty-Fifth International Conference on …, 2020 | 291 | 2020 |
Pconv: The missing but desirable sparsity in dnn weight pruning for real-time execution on mobile devices X Ma, FM Guo, W Niu, X Lin, J Tang, K Ma, B Ren, Y Wang Proceedings of the AAAI conference on artificial intelligence 34 (04), 5117-5124, 2020 | 202 | 2020 |
Spvit: Enabling faster vision transformers via latency-aware soft token pruning Z Kong, P Dong, X Ma, X Meng, W Niu, M Sun, X Shen, G Yuan, B Ren, ... European conference on computer vision, 620-640, 2022 | 174 | 2022 |
Dnnfusion: accelerating deep neural networks execution with advanced operator fusion W Niu, J Guan, Y Wang, G Agrawal, B Ren Proceedings of the 42nd ACM SIGPLAN International Conference on Programming …, 2021 | 142 | 2021 |
Yolobile: Real-time object detection on mobile devices via compression-compilation co-design Y Cai, H Li, G Yuan, W Niu, Y Li, X Tang, B Ren, Y Wang Proceedings of the AAAI conference on artificial intelligence 35 (2), 955-963, 2021 | 113 | 2021 |
Mest: Accurate and fast memory-economic sparse training framework on the edge G Yuan, X Ma, W Niu, Z Li, Z Kong, N Liu, Y Gong, Z Zhan, C He, Q Jin, ... Advances in Neural Information Processing Systems 34, 20838-20850, 2021 | 93 | 2021 |
SIMD Parallelization of Applications that Traverse Irregular Data Structures B Ren, G Agrawal, JR Larus, T Mytkowicz, T Poutanen, W Schulte 2013 International Symposium on Code Generation and Optimization, 1-10, 2013 | 78 | 2013 |
Rtmobile: Beyond real-time mobile acceleration of rnns for speech recognition P Dong, S Wang, W Niu, C Zhang, S Lin, Z Li, Y Gong, B Ren, X Lin, ... 2020 57th ACM/IEEE Design Automation Conference (DAC), 1-6, 2020 | 64 | 2020 |
Sparcl: Sparse continual learning on the edge Z Wang, Z Zhan, Y Gong, G Yuan, W Niu, T Jian, B Ren, S Ioannidis, ... Advances in Neural Information Processing Systems 35, 20366-20380, 2022 | 58 | 2022 |
Achieving on-mobile real-time super-resolution with neural architecture and pruning search Z Zhan, Y Gong, P Zhao, G Yuan, W Niu, Y Wu, T Zhang, M Jayaweera, ... Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 53 | 2021 |
MemXCT: Memory-centric X-ray CT reconstruction with massive parallelization M Hidayetoğlu, T Biçer, SG De Gonzalo, B Ren, D Gürsoy, R Kettimuthu, ... Proceedings of the International Conference for High Performance Computing …, 2019 | 43 | 2019 |
Real-time data analysis and autonomous steering of synchrotron light source experiments T Bicer, D Gursoy, R Kettimuthu, IT Foster, B Ren, V De Andrede, ... 2017 IEEE 13th International Conference on e-Science (e-Science), 59-68, 2017 | 41 | 2017 |
Efficient and Simplified Parallel Graph Processing over CPU and MIC L Chen, X Huo, B Ren, S Jain, G Agrawal International Parallel & Distributed Processing Symposium 2015, 2015 | 38 | 2015 |
Atmem: Adaptive data placement in graph applications on heterogeneous memories Y Chen, IB Peng, Z Peng, X Liu, B Ren Proceedings of the 18th ACM/IEEE International Symposium on Code Generation …, 2020 | 37 | 2020 |
Efficient execution of recursive programs on commodity vector hardware B Ren, Y Jo, S Krishnamoorthy, K Agrawal, M Kulkarni Proceedings of the 36th ACM SIGPLAN Conference on Programming Language …, 2015 | 34 | 2015 |
26ms inference time for resnet-50: Towards real-time execution of all dnns on smartphone W Niu, X Ma, Y Wang, B Ren arXiv preprint arXiv:1905.00571, 2019 | 33 | 2019 |
Microspec: Speculation-centric fine-grained parallelization for fsm computations J Qiu, Z Zhao, B Ren Proceedings of the 2016 International Conference on Parallel Architectures …, 2016 | 33 | 2016 |
Petascale XCT: 3D image reconstruction with hierarchical communications on multi-GPU nodes M Hidayetoğlu, T Bicer, SG De Gonzalo, B Ren, V De Andrade, D Gursoy, ... SC20: International Conference for High Performance Computing, Networking …, 2020 | 31 | 2020 |
Npas: A compiler-aware framework of unified network pruning and architecture search for beyond real-time mobile acceleration Z Li, G Yuan, W Niu, P Zhao, Y Li, Y Cai, X Shen, Z Zhan, Z Kong, Q Jin, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 29 | 2021 |
Comet: A domain-specific compilation of high-performance computational chemistry E Mutlu, R Tian, B Ren, S Krishnamoorthy, R Gioiosa, J Pienaar, G Kestor International Workshop on Languages and Compilers for Parallel Computing, 87-103, 2020 | 29 | 2020 |