Accommodating transformer onto fpga: Coupling the balanced model compression and fpga-implementation optimization P Qi, Y Song, H Peng, S Huang, Q Zhuge, EHM Sha Proceedings of the 2021 on Great Lakes Symposium on VLSI (GLSVLSI), 163-168, 2021 | 52 | 2021 |
Accelerating framework of transformer by hardware design and model compression co-optimization P Qi, EHM Sha, Q Zhuge, H Peng, S Huang, Z Kong, Y Song, B Li 2021 IEEE/ACM International Conference On Computer Aided Design (ICCAD), 1-9, 2021 | 49 | 2021 |
Dancing along Battery: Enabling Transformer with Run-time Reconfigurability on Mobile Devices Y Song, W Jiang, B Li, P Qi, Q Zhuge, EHM Sha, S Dasgupta, Y Shi, ... Proceedings of the 58th Annual Design Automation Conference (DAC) 2021, 2021 | 22 | 2021 |
Loop interchange and tiling for multi-dimensional loops to minimize write operations on NVMs R Xu, EHM Sha, Q Zhuge, Y Song, H Wang Journal of Systems Architecture 135, 102799, 2023 | 6 | 2023 |
Optimizing data placement for hybrid sram+ racetrack memory spm in embedded systems R Xu, EHM Sha, Q Zhuge, Y Song, H Wang, L Shi IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2022 | 6 | 2022 |
BSC: Block-based Stochastic Computing to Enable Accurate and Efficient Tinyml Y Song, EHM Sha, Q Zhuge, R Xu, Y Zhang, B Li, L Yang 2022 27th Asia and South Pacific Design Automation Conference (ASP-DAC), 314-319, 2022 | 5 | 2022 |
Optimizing Efficiency of Machine Learning Based Hard Disk Failure Prediction by Two-Layer Classification-Based Feature Selection H Wang, Q Zhuge, EHM Sha, R Xu, Y Song Applied Sciences 13 (13), 7544, 2023 | 3 | 2023 |
Optimal loop tiling for minimizing write operations on nvms with complete memory latency hiding R Xu, EHM Sha, Q Zhuge, Y Song, J Lin 2022 27th Asia and South Pacific Design Automation Conference (ASP-DAC), 389-394, 2022 | 3 | 2022 |
Hardware-aware Neural Architecture Search for Stochastic Computing-based Neural Networks on Tiny Devices Y Song, EHM Sha, Q Zhuge, R Xu, X Xu, B Li, L Yang Journal of Systems Architecture 135, 102810, 2023 | 2 | 2023 |
Mera: Memory Reduction and Acceleration for Quantum Circuit Simulation via Redundancy Exploration Y Song, EHM Sha, L Xu, Q Zhuge, Z Shao arXiv preprint arXiv:2411.15332, 2024 | | 2024 |
MuDP: Multi-Granularity Data Placement for Uniform Loops on Spm-Dram Architectures to Minimize Latency Y Du, EHM Sha, Y Song, Y Guo, L Xu, Q Zhuge Frontiers of Computer Science (FCS) 19 (195107), 2024 | | 2024 |
Parallel Block-based Stochastic Computing with Adapted Quantization Y ZHANG, Q ZHUGE, EHM SHA, Y SONG Journal of East China Normal University (Natural Science) 2024 (2), 76, 2024 | | 2024 |
QuanPath: Achieving One-Step Communication for Distributed Quantum Circuit Simulation Y Song, EHM Sha, Q Zhuge, W Xiao, Q Dai, L Xu Quantum Information Processing 23 (1), 1, 2023 | | 2023 |
Efficient Algorithm for Full-state Quantum Circuit Simulation with DD Compression while maintaining Accuracy Y Song, EHM Sha, Q Zhuge, R Xu, H Wang Quantum Information Processing 22 (11), 413, 2023 | | 2023 |
RR-SC: Run-Time Reconfigurable Framework for Stochastic Computing-Based Neural Networks on Edge Devices YSEHMSQZRXH Wang Journal of Computer Research and Development 61 (4), 840-855, 2023 | | 2023 |
Pseudo-Log: Restore Global Data Facing Power Failures with Minimum NVM Write E Sha, Y Liao, Q Zhuge, R Xu, Y Song, J Liu 2022 IEEE 24th Int Conf on High Performance Computing & Communications; 8th …, 2022 | | 2022 |
Efficient Checkpoint under Unstable Power Supplies on NVM based Devices J Liu, E Sha, Q Zhuge, R Xu, Y Song 2022 IEEE 24th Int Conf on High Performance Computing & Communications; 8th …, 2022 | | 2022 |