Survey on Redundancy Based-Fault tolerance methods for Processors and Hardware accelerators-Trends in Quantum Computing, Heterogeneous Systems and …

S Venkatesha, R Parthasarathi - ACM Computing Surveys, 2024 - dl.acm.org
Rapid progress in the CMOS technology for the past 25 years has increased the
vulnerability of processors towards faults. Subsequently, focus of computer architects shifted …

Regional soft error vulnerability and error propagation analysis for GPGPU applications

I Öz, ÖF Karadaş - The Journal of Supercomputing, 2022 - Springer
The wide use of GPUs for general-purpose computations as well as graphics programs
makes soft errors a critical concern. Evaluating the soft error vulnerability of GPGPU …

Efficient thread‐to‐core mapping alternatives for application‐level redundant multithreading

S Arslan, O Ünsal - Concurrency and Computation: Practice …, 2023 - Wiley Online Library
Redundant multithreading (RMT) is an effective thread‐level replication method to improve
the reliability requirements of applications. Although it significantly improves the robustness …

Transient fault tolerance on multicore processor in amp mode

J Li, Y Wang - … Conference on Dependable Systems and Their …, 2021 - ieeexplore.ieee.org
Multicore processors are expected to play a key role in the future of safety critical embedded
systems. Besides of performance, energy consumption and cost, reliability of multicore …

GCFI: A High Accurate Compiler-based Fault Injection for Transient Hardware Faults

HAH Ahmad, Y Sedaghat - 2022 CPSSI 4th International …, 2022 - ieeexplore.ieee.org
Recently, with increasing system complexity and advanced technology scaling, there is a
severe need for accurate fault injection (FI) techniques in the reliability evaluation of safety …

TCC: GPGPU Architecture for Instruction Decoder and Control Flow Error Detection

KK Raghunandana, YP KR… - … Symposium on Design …, 2024 - ieeexplore.ieee.org
The devices fabricated with the latest sub-nanometer technology node have a higher
probability of parametric and wear-out failures, operational faults, and manufacturing …

Systems and Debugging Supports for Hardware Designs

J Ma - 2024 - deepblue.lib.umich.edu
The development and deployment of hardware and software have traditionally been quite
distinct. Software benefits from an agile development cycle, aided by a wide array of …

Enhancing River Monitoring Embedded System using Time Redundancy Fault Tolerance to Resolve Transient Sensor Fault

MHH Ichsan, MAJ Habibie… - Journal of Information …, 2023 - jitecs.ub.ac.id
Reading data from sensors in the context of the Internet of Things (IoT) is one of the main
parameters of a high-reliability system. Data reading by sensors is prone to errors due to …