Understanding GPU errors on large-scale HPC systems and the implications for system design and operation D Tiwari, S Gupta, J Rogers, D Maxwell, P Rech, S Vazhkudai, D Oliveira, ... 2015 IEEE 21st International Symposium on High Performance Computer …, 2015 | 187 | 2015 |
Evaluation and mitigation of radiation-induced soft errors in graphics processing units DAG de Oliveira, LL Pilla, T Santini, P Rech IEEE Transactions on Computers 65 (3), 791-804, 2016 | 103 | 2016 |
Experimental and analytical study of xeon phi reliability D Oliveira, L Pilla, N DeBardeleben, S Blanchard, H Quinn, I Koren, ... Proceedings of the International Conference for High Performance Computing …, 2017 | 61 | 2017 |
Modern GPUs radiation sensitivity evaluation and mitigation through duplication with comparison DAG Oliveira, P Rech, HM Quinn, TD Fairbanks, L Monroe, SE Michalak, ... IEEE Transactions on Nuclear Science 61 (6), 3115-3122, 2014 | 46 | 2014 |
Reliability evaluation of mixed-precision architectures FF dos Santos, C Lunardi, D Oliveira, F Libano, P Rech 2019 IEEE International Symposium on High Performance Computer Architecture …, 2019 | 42 | 2019 |
Code-dependent and architecture-dependent reliability behaviors V Fratin, D Oliveira, C Lunardi, F Santos, G Rodrigues, P Rech 2018 48th Annual IEEE/IFIP International Conference on Dependable Systems …, 2018 | 42 | 2018 |
Radiation-induced error criticality in modern HPC parallel accelerators DAG De Oliveira, LL Pilla, M Hanzich, V Fratin, F Fernandes, C Lunardi, ... 2017 IEEE International Symposium on High Performance Computer Architecture …, 2017 | 39 | 2017 |
GPGPUs ECC efficiency and efficacy DAG Oliveira, P Rech, LL Pilla, POA Navaux, L Carro 2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and …, 2014 | 32 | 2014 |
Time-to-solution and energy-to-solution: a comparison between arm and xeon EL Padoin, DAG de Oliveira, P Velho, POA Navaux 2012 Third Workshop on Applications for Multi-Core Architecture, 48-53, 2012 | 29 | 2012 |
Carol-fi: an efficient fault-injection tool for vulnerability evaluation of modern hpc parallel accelerators D Oliveira, V Frattin, P Navaux, I Koren, P Rech Proceedings of the Computing Frontiers Conference, 295-298, 2017 | 28 | 2017 |
Evaluating performance and energy on arm-based clusters for high performance computing EL Padoin, DAG de Oliveira, P Velho, POA Navaux 2012 41st International Conference on Parallel Processing Workshops, 165-172, 2012 | 24 | 2012 |
High-energy versus thermal neutron contribution to processor and memory error rates D Oliveira, FF dos Santos, GP Dávila, C Cazzaniga, C Frost, ... IEEE Transactions on Nuclear Science 67 (6), 1161-1168, 2020 | 19 | 2020 |
Scalability and Energy Efficiency of HPC cluster with ARM MPSoC EL Padoin, DAG de Oliveira, P Velho, POA Navaux, B Videau, ... Proc. of 11th Workshop on Parallel and Distributed Processing, 2013 | 16 | 2013 |
Qufi: a quantum fault injector to measure the reliability of qubits and quantum circuits D Oliveira, E Giusto, E Dri, N Casciola, B Baheri, Q Guan, B Montrucchio, ... 2022 52nd Annual IEEE/IFIP International Conference on Dependable Systems …, 2022 | 11 | 2022 |
Radiation sensitivity of high performance computing applications on kepler-based GPGPUs DAG Oliveira, CB Lunardi, LL Pilla, P Rech, POA Navaux, L Carro Dependable Systems and Networks (DSN), 2014 44th Annual IEEE/IFIP …, 2014 | 11 | 2014 |
A systematic methodology to compute the quantum vulnerability factors for quantum circuits D Oliveiray, E Giusto, B Baheriz, Q Guanz, B Montrucchio, P Rechx IEEE Transactions on Dependable and Secure Computing, 2023 | 9 | 2023 |
Thermal neutrons: a possible threat for supercomputers and safety critical applications D Oliveira, S Blanchard, N Debardeleben, FF Dos Santos, GP Dávila, ... 2020 IEEE European Test Symposium (ETS), 1-6, 2020 | 9 | 2020 |
Thermal neutrons: a possible threat for supercomputer reliability D Oliveira, S Blanchard, N DeBardeleben, F Fernandes dos Santos, ... The Journal of Supercomputing 77, 1612-1634, 2021 | 8 | 2021 |
Increasing the efficiency and efficacy of selective-hardening for parallel applications D Oliveira, P Navaux, P Rech 2019 IEEE International Symposium on Defect and Fault Tolerance in VLSI and …, 2019 | 7 | 2019 |
Experimental and analytical analysis of sorting algorithms error criticality for HPC and large servers applications C Lunardi, H Quinn, L Monroe, D Oliveira, P Navaux, P Rech IEEE Transactions on Nuclear Science 64 (8), 2169-2178, 2017 | 7 | 2017 |