Logaider: A tool for mining potential correlations of hpc log events S Di, R Gupta, M Snir, E Pershey, F Cappello 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2017 | 66 | 2017 |
Characterizing and understanding hpc job failures over the 2k-day life of ibm bluegene/q system S Di, H Guo, E Pershey, M Snir, F Cappello 2019 49th Annual IEEE/IFIP International Conference on Dependable Systems …, 2019 | 34 | 2019 |
Exploring properties and correlations of fatal events in a large-scale hpc system S Di, H Guo, R Gupta, ER Pershey, M Snir, F Cappello IEEE Transactions on Parallel and Distributed Systems 30 (2), 361-374, 2018 | 30 | 2018 |
Theta and mira at argonne national laboratory MR Fahey, Y Alexeev, B Allcock, BS Allen, R Balakrishnan, A Benali, ... Contemporary High Performance Computing, 31-61, 2019 | 6 | 2019 |
An Approach for Efficient Processing of Machine Operational Data B Lenard, E Pershey, Z Nault, A Rasin International Conference on Database and Expert Systems Applications, 129-146, 2023 | 2 | 2023 |
Argonne Leadership Computing Facility (2020 Operational Assessment Report) S Ramprakash, HS Som, M Fahey, E Shemon, D Martin, K Riley, ... Argonne National Lab.(ANL), Argonne, IL (United States), 2020 | | 2020 |