H Cho, CY Cher, T Shepherd, S Mitra - Proceedings of the 52Nd Annual …, 2015 - dl.acm.org
The effects of soft errors in processor cores have been widely studied. However, little has been published about soft errors in uncore components, such as memory subsystem and I/O …
X Xu, HH Huang - 10th Workshop on Hot Topics in System Dependability …, 2014 - usenix.org
Hardware errors are no longer the exceptions in modern cloud data centers. Although virtualization provides software failure isolation across different virtual machines (VM), the …
The continuous scaling of electronic components has led to the development of high- performance microprocessors that are suitable even for safety-critical applications where …
S Kundu, O Khan - 2016 29th International Conference on VLSI …, 2016 - ieeexplore.ieee.org
With increasing density of power, traditional frequency scaling of processors came to an end. The power wall forced the industry to seek performance from parallel processing …
A Shye, J Blomstedt, T Moseley… - … on Dependable and …, 2008 - ieeexplore.ieee.org
Transient faults are emerging as a critical concern in the reliability of general-purpose microprocessors. As architectural trends point toward multicore designs, there is substantial …
The premise behind this thesis is the observation that Operating Systems (OS), being the foundation behind operations of computing systems, are complex entities and also subject to …
Fault tolerance is a key obstacle to next generation extreme-scale systems. As systems scale, the Mean Time To Interrupt (MTTI) decreases proportionally. As a result, extreme …
W Gu, Z Kalbarczyk, RK Iyer - DSN, 2004 - researchgate.net
The goals of this study are:(i) to compare Linux kernel (2.4. 22) behavior under a broad range of errors on two target processors—the Intel Pentium 4 (P4) running RedHat Linux 9.0 …
As microservice and cloud computing operations increasingly adopt automation, the importance of models for fostering resilient and efficient adaptive architectures becomes …