Survey on Redundancy Based-Fault tolerance methods for Processors and Hardware accelerators-Trends in Quantum Computing, Heterogeneous Systems and …

S Venkatesha, R Parthasarathi - ACM Computing Surveys, 2024 - dl.acm.org
Rapid progress in the CMOS technology for the past 25 years has increased the
vulnerability of processors towards faults. Subsequently, focus of computer architects shifted …

Silent data corruptions: Microarchitectural perspectives

G Papadimitriou, D Gizopoulos - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Today more than ever before, academia, manufacturers, and hyperscalers acknowledge the
major challenge of silent data corruptions (SDCs) and aim on solutions to minimize its …

Silent data errors: Sources, detection, and modeling

A Singh, S Chakravarty, G Papadimitriou… - 2023 IEEE 41st VLSI …, 2023 - ieeexplore.ieee.org
Chip manufacturers and hyperscalers are becoming increasingly aware of the problem
posed by Silent Data Errors (SDE) and are taking steps to address it. Major computing …

Gem5-marvel: Microarchitecture-level resilience analysis of heterogeneous soc architectures

O Chatzopoulos, G Papadimitriou… - … Symposium on High …, 2024 - ieeexplore.ieee.org
In this paper, we present gem5-MARVEL, the first consolidated microarchitecture-level fault
injection infrastructure for heterogeneous System-on-Chip architectures comprising CPUs of …

Silent data corruptions: The stealthy saboteurs of digital integrity

G Papadimitriou, D Gizopoulos… - 2023 IEEE 29th …, 2023 - ieeexplore.ieee.org
Silent Data Corruptions (SDCs) pose a significant threat to the integrity of digital systems.
These stealthy saboteurs silently corrupt data, remaining undetected by traditional error …

Characterizing and Improving Resilience of Accelerators to Memory Errors in Autonomous Robots

D Shah, ZY Xue, K Pattabiraman… - ACM Transactions on …, 2023 - dl.acm.org
Motion planning is a computationally intensive and well-studied problem in autonomous
robots. However, motion planning hardware accelerators (MPA) must be soft-error resilient …

Estimating the failures and silent errors rates of cpus across isas and microarchitectures

D Gizopoulos, G Papadimitriou… - … IEEE International Test …, 2023 - ieeexplore.ieee.org
Silent data corruptions (SDCs) pose a significant challenge to the reliable operation of
modern microprocessors. As the need for enhanced performance and reliability continues to …

EUROPULS: NEUROmorphic energy-efficient secure accelerators based on Phase change materials aUgmented siLicon photonicS

F Pavanello, C Marchand, I O'Connor… - 2023 IEEE European …, 2023 - ieeexplore.ieee.org
This special session paper introduces the Horizon Europe NEUROPULS project, which
targets the development of secure and energy-efficient RISC-V interfaced neuromorphic …

Characterizing Soft-Error Resiliency in Arm's Ethos-U55 Embedded Machine Learning Accelerator

A Tyagi, R Jeyapaul, C Zhou… - … Analysis of Systems …, 2024 - ieeexplore.ieee.org
As Neural Processing Units (NPU) or accelerators are increasingly deployed in a variety of
applications including safety critical applications such as autonomous vehicle, and medical …

FastFlip: Compositional Error Injection Analysis

K Joshi, R Singh, T Bassetto, S Adve, D Marinov… - arXiv preprint arXiv …, 2024 - arxiv.org
Instruction-level error injection analyses aim to find instructions where errors often lead to
unacceptable outcomes like Silent Data Corruptions (SDCs). These analyses require …