Micro-Architectural features as soft-error markers in embedded safety-critical systems: preliminary study

D Kasap, A Carpegna, A Savino… - 2023 IEEE European …, 2023 - ieeexplore.ieee.org
Radiation-induced soft errors are one of the most challenging issues in Safety Critical Real-
Time Embedded System (SACRES) reliability, usually handled using different flavors of …

Characterization of the impact of soft errors on iterative methods

BO Mutlu, G Kestor, J Manzano, O Unsal… - 2018 IEEE 25th …, 2018 - ieeexplore.ieee.org
Soft errors caused by transient bit flips have the potential to significantly impact an
application's behavior. This has motivated the design of an array of techniques to detect …

Towards end-to-end sdc detection for hpc applications equipped with lossy compression

S Li, S Di, K Zhao, X Liang, Z Chen… - … Conference on Cluster …, 2020 - ieeexplore.ieee.org
Data reduction techniques have been widely demanded and used by large-scale high
performance computing (HPC) applications because of vast volumes of data to be produced …

Ground-truth prediction to accelerate soft-error impact analysis for iterative methods

BO Mutlu, G Kestor, A Cristal, O Unsal… - 2019 IEEE 26th …, 2019 - ieeexplore.ieee.org
Understanding the impact of soft errors on applications can be expensive. Often, it requires
an extensive error injection campaign involving numerous runs of the full application in the …

Error resilience of three GMRES implementations under fault injection

JA Moríñigo, A Bustos, R Mayo-García - The Journal of Supercomputing, 2022 - Springer
The resilience behavior of three GMRES prototyped implementations (with Incomplete LU,
Flexible and randomized-SVD—based preconditioners) has been analyzed with a soft …

Detection and correction of silent errors in the conjugate gradient algorithm

G Meurant - Numerical Algorithms, 2023 - Springer
We propose a new way to detect and correct silent errors in the conjugate gradient
algorithm. The detection criterion is simple, cheap to implement, and can be used at each …

Work-in-Progress: Accuracy-Area Efficient Online Fault Detection for Robust Neural Network Software-Embedded Microcontrollers

J Chang, S Oh, D Park - 2022 International Conference on …, 2022 - ieeexplore.ieee.org
Detecting transient faults in safety-critical neural network (NN) applications operated on
embedded systems has become a concern, but it is challenging to achieve high accuracy …

FPDetect Efficient Reasoning About Stencil Programs Using Selective Direct Evaluation

A Das, S Krishnamoorthy, I Briggs… - ACM Transactions on …, 2020 - dl.acm.org
We present FPDetect, a low-overhead approach for detecting logical errors and soft errors
affecting stencil computations without generating false positives. We develop an offline …

An extensive study on iterative solver resilience: characterization, detection and prediction

BO Mutlu - 2019 - upcommons.upc.edu
Soft errors caused by transient bit flips have the potential to significantly impactan
applicalion's behavior. This has motivated the design of an array of techniques to detect …

[图书][B] Designing Efficient and Resilient Lossy Compressors for Large-Scale Scientific Computing

S Li - 2020 - search.proquest.com
Extremely large scale scientific simulation applications have been very important in many
scientific domains including cosmology, climate, fluid dynamics, chemistry and so on. It has …