Conformer: Convolution-augmented transformer for speech recognition A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang, J Yu, W Han, S Wang, ... arXiv preprint arXiv:2005.08100, 2020 | 2918 | 2020 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 1204 | 2023 |
An attention-based spatiotemporal lstm network for next poi recommendation L Huang, Y Ma, S Wang, Y Liu IEEE Transactions on Services Computing 14 (6), 1585-1597, 2019 | 169 | 2019 |
Bigssl: Exploring the frontier of large-scale semi-supervised learning for automatic speech recognition Y Zhang, DS Park, W Han, J Qin, A Gulati, J Shor, A Jansen, Y Xu, ... IEEE Journal of Selected Topics in Signal Processing 16 (6), 1519-1532, 2022 | 168 | 2022 |
BFloat16: The secret to high performance on Cloud TPUs S Wang, P Kanwar Google Cloud Blog 4 (1), 2019 | 134 | 2019 |
Making memristive neural network accelerators reliable B Feinberg, S Wang, E Ipek 2018 IEEE International Symposium on High Performance Computer Architecture …, 2018 | 134 | 2018 |
Gspmd: general and scalable parallelization for ml computation graphs Y Xu, HJ Lee, D Chen, B Hechtman, Y Huang, R Joshi, M Krikun, ... arXiv preprint arXiv:2105.04663, 2021 | 107 | 2021 |
Enabling scientific computing on memristive accelerators B Feinberg, UKR Vengalam, N Whitehair, S Wang, E Ipek 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture …, 2018 | 100 | 2018 |
Overlap communication with dependent computation via decomposition in large deep learning models S Wang, J Wei, A Sabne, A Davis, B Ilbeyi, B Hechtman, D Chen, ... Proceedings of the 28th ACM International Conference on Architectural …, 2022 | 41 | 2022 |
Scale mlperf-0.6 models on google tpu-v3 pods S Kumar, V Bitorff, D Chen, C Chou, B Hechtman, HJ Lee, N Kumar, ... arXiv preprint arXiv:1909.09756, 2019 | 40 | 2019 |
Gpuguard: Mitigating contention based side and covert channel attacks on gpus Q Xu, H Naghibijouybari, S Wang, N Abu-Ghazaleh, M Annavaram Proceedings of the ACM International Conference on Supercomputing, 497-509, 2019 | 40 | 2019 |
Reducing data movement energy via online data clustering and encoding S Wang, E Ipek International Symposium on Microarchitecture (MICRO), 2016 | 37 | 2016 |
Automatic cross-replica sharding of weight update in data-parallel training Y Xu, HJ Lee, D Chen, H Choi, B Hechtman, S Wang arXiv preprint arXiv:2004.13336, 2020 | 31 | 2020 |
Development and characterization of a novel air-breathing micro direct methanol fuel cell stack for portable applications X Liu, B Zhang, Y Zhang, H He, J Li, S Wang, Z Yuan, H Deng Journal of Micromechanics and Microengineering 20, 2010 | 20 | 2010 |
Effect of design and operating parameters on dynamic response of a micro direct methanol fuel cell Y Zhang, H He, Z Yuan, S Wang, X Liu International journal of hydrogen energy 36 (3), 2230-2236, 2011 | 19 | 2011 |
Exploring the limits of Concurrency in ML Training on Google TPUs S Kumar, Y Wang, C Young, J Bradbury, N Kumar, D Chen, A Swing Proceedings of Machine Learning and Systems 3, 81-92, 2021 | 18 | 2021 |
Content aware refresh: Exploiting the asymmetry of DRAM retention errors to reduce the refresh frequency of less vulnerable data S Wang, MN Bojnordi, X Guo, E Ipek IEEE Transactions on Computers 68 (3), 362-374, 2018 | 18 | 2018 |
Learning to fuse A Abdolrashidi, Q Xu, S Wang, S Roy, Y Zhou NeurIPS ML for Systems Workshop, 2019 | 8 | 2019 |
Enabling energy efficient Hybrid Memory Cube systems with erasure codes S Wang, Y Song, MN Bojnordi, E Ipek International Symposium on Low Power Electronics and Design (ISLPED), 2015 | 8 | 2015 |
Silicon-based micro direct methanol fuel cell with an N-inputs-N-outputs anode flow pattern Y Zhang, L Wang, Z Yuan, S Wang, J Li, X Liu Chinese Science Bulletin 56 (8), 826-829, 2011 | 8 | 2011 |