A highly-efficient error detection technique for general matrix multiplication using tiled processing on SIMD architecture CS Mummidi, S Bal, BF Goldstein, S Srinivasan, S Kundu 2022 IEEE 40th International Conference on Computer Design (ICCD), 529-536, 2022 | 3 | 2022 |
A lightweight error-resiliency mechanism for deep neural networks BF Goldstein, VC Ferreira, S Srinivasan, D Das, AS Nery, S Kundu, ... 2021 22nd International Symposium on Quality Electronic Design (ISQED), 311-316, 2021 | 15 | 2021 |
A runtime support mechanism for fast mode switching of a self-morphing core for power efficiency S Srinivasan, N Kurella, I Koren, R Rodrigues, S Kundu Proceedings of the 23rd international conference on Parallel architectures …, 2014 | 3 | 2014 |
A study of BFLOAT16 for deep learning training K Dhiraj, M Dheevatsa, M Naveen, D Dipankar, B Kunal, A Sasikanth, ... arXiv preprint arXiv:1905.12322, 2019 | 7 | 2019 |
A study of BFLOAT16 for deep learning training D Kalamkar, D Mudigere, N Mellempudi, D Das, K Banerjee, S Avancha, ... arXiv preprint arXiv:1905.12322, 2019 | 323 | 2019 |
A study of BFLOAT16 for deep learning training (2019) D Kalamkar, D Mudigere, N Mellempudi, D Das, K Banerjee, S Avancha, ... arXiv preprint arXiv:1905.12322, 1905 | 14 | 1905 |
A Study of BFLOAT16 for Deep Learning Training. arXiv e-prints, art D Kalamkar, D Mudigere, N Mellempudi, D Das, K Banerjee, S Avancha, ... arXiv preprint arXiv:1905.12322 3, 2019 | 2 | 2019 |
A study on polymorphing superscalar processor dynamically to improve power efficiency S Srinivasan, R Rodrigues, A Annamalai, I Koren, S Kundu 2013 IEEE Computer Society Annual Symposium on VLSI (ISVLSI), 46-51, 2013 | 7 | 2013 |
A wavelet-based spatio-temporal heat dissipation model for reordering of program phases to produce temperature extremes in a chip S Srinivasan, KP Ganeshpure, S Kundu IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2012 | 7 | 2012 |
Astra-sim: Enabling sw/hw co-design exploration for distributed dl training platforms S Rashidi, S Sridharan, S Srinivasan, T Krishna 2020 IEEE International Symposium on Performance Analysis of Systems and …, 2020 | 50 | 2020 |
Astra-sim2. 0: Modeling hierarchical networks and disaggregated systems for large-model training at scale W Won, T Heo, S Rashidi, S Sridharan, S Srinivasan, T Krishna 2023 IEEE International Symposium on Performance Analysis of Systems and …, 2023 | 22 | 2023 |
Cross-level protection of circuits against faults and malicious attacks V Tomashevich, S Srinivasan, F Foerg, I Polian 2012 IEEE 18th International On-Line Testing Symposium (IOLTS), 150-155, 2012 | 17 | 2012 |
Dynamic Processor Reconfiguration for Power, Performance and Reliability Management S Srinivasan | | 2016 |
Dynamic reconfiguration vs. dvfs: A comparative study on power efficiency of processors S Srinivasan, N Kurella, I Koren, S Kundu 2016 29th International Conference on VLSI Design and 2016 15th …, 2016 | 6 | 2016 |
Efficient communication acceleration for next-gen scale-up deep learning training platforms S Rashidi, S Sridharan, S Srinivasan, M Denton, T Krishna arXiv preprint, 2020 | 1 | 2020 |
Enabling compute-communication overlap in distributed deep learning training platforms S Rashidi, M Denton, S Sridharan, S Srinivasan, A Suresh, J Nie, ... 2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture …, 2021 | 37 | 2021 |
Enabling compute-communication overlap in distributed training platforms S Rashidi, S Sridharan, S Srinivasan, M Denton, A Suresh, J Nie, ... Procedings of 48th International Symposium on Computer Architecture (ISCA), 2021 | | 2021 |
Exploring heterogeneity within a core for improved power efficiency S Srinivasan, N Kurella, I Koren, S Kundu IEEE Transactions on Parallel and Distributed Systems 27 (4), 1057-1069, 2015 | 18 | 2015 |
Exploring multi-dimensional hierarchical network topologies for efficient distributed training of trillion parameter dl models W Won, S Rashidi, S Srinivasan, T Krishna arXiv preprint arXiv:2109.11762, 2021 | 3 | 2021 |
Extending sparse tensor accelerators to support multiple compression formats E Qin, G Jeong, W Won, SC Kao, H Kwon, S Srinivasan, D Das, GE Moon, ... 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2021 | 16 | 2021 |