Error injection-based study of soft error propagation in AMD Bulldozer microprocessor module

C Constantinescu, M Butler… - IEEE/IFIP International …, 2012 - ieeexplore.ieee.org
Single-event upsets (SEU) and single-event transients (SET) may lead to crashes or even
silent data corruption (SDC) in microprocessors. Error detection and recovery features are …

Detecting performance degradation in cloud systems using LSTM autoencoders

S Chouliaras, S Sotiriadis - International Conference on Advanced …, 2021 - Springer
Cloud computing technology is on the rise as it provides an easy to scale environment for
Internet users in terms of computational resources. At the same time, cloud providers …

Error detector placement for soft computing applications

A Thomas, K Pattabiraman - ACM Transactions on Embedded …, 2016 - dl.acm.org
The scaling of Silicon devices has exacerbated the unreliability of modern computer
systems, and power constraints have necessitated the involvement of software in hardware …

One bit is (not) enough: An empirical study of the impact of single and multiple bit-flip errors

B Sangchoolie, K Pattabiraman… - 2017 47th annual IEEE …, 2017 - ieeexplore.ieee.org
Recent studies have shown that technology and voltage scaling are expected to increase
the likelihood that particle-induced soft errors manifest as multiple-bit errors. This raises …

Accelerated online error detection in many-core microprocessor architectures

M Kaliorakis, M Psarakis, N Foutris… - 2014 IEEE 32nd VLSI …, 2014 - ieeexplore.ieee.org
Forthcoming many-core processors are expected to be highly unreliable due to their high
design complexity and aggressive manufacturing technology scaling. Online functional …

nZDC: A compiler technique for near zero silent data corruption

M Didehban, A Shrivastava - Proceedings of the 53rd Annual Design …, 2016 - dl.acm.org
Exponentially growing rate of soft errors makes reliability a major concern in modern
processor design. Since software-oriented approaches offer flexible protection even in off …

Selective duplication and selective comparison for data flow error detection

VB Thati, J Vankeirsbilck, J Boydens… - 2019 4th International …, 2019 - ieeexplore.ieee.org
Embedded systems' hardware can be impacted by soft errors, which can cause data flow
errors in the systems' software. In this paper, we present a novel software-based approach to …

Characterization of error-tolerant applications when protecting control data

DD Thaker, D Franklin, J Oliver… - 2006 IEEE …, 2006 - ieeexplore.ieee.org
Soft errors have become a significant concern and recent studies have measured the"
architectural vulnerability factor" of systems to such errors, or conversely, the potential that a …

Sampler: Pmu-based sampling to detect memory errors latent in production software

S Silvestro, H Liu, T Zhang, C Jung… - 2018 51st Annual …, 2018 - ieeexplore.ieee.org
Deployed software is still faced with numerous in-production memory errors. They can
significantly affect system reliability and security, causing application crashes, erratic …

Modeling input-dependent error propagation in programs

G Li, K Pattabiraman - 2018 48th Annual IEEE/IFIP …, 2018 - ieeexplore.ieee.org
Transient hardware faults are increasing in computer systems due to shrinking feature sizes.
Traditional methods to mitigate such faults are through hardware duplication, which incurs …