F Rosa, L Ost, R Reis, S Davidmann… - 2017 15th IEEE …, 2017 - ieeexplore.ieee.org
Reliability is rapidly emerging as a major design metric in both embedded and high performance computing (HPC) domains. Such systems are integrating modern multicore …
N Soundararajan, A Sivasubramaniam… - ACM SIGMETRICS …, 2010 - dl.acm.org
Multicores have become the platform of choice across all market segments. Cost-effective protection against soft errors is important in these environments, due to the need to move to …
Software engineers are using different compilers and parallel programming models (eg, Pthreads, OpenMP) to take the best performance offered by multicore systems. Both …
FR Da Rosa, R Reis, L Ost - 2018 IEEE 9th Latin American …, 2018 - ieeexplore.ieee.org
Increasing chip power densities allied to the continuous technology shrink are making emerging multiprocessor embedded systems more vulnerable to radiation-induced transient …
Software reliability is an essential design metric in emerging large-scale multiprocessor embedded systems. Designers should identify soft error susceptibility of multiple …
This paper presents an in-depth characterization of the resiliency of more than 5 million HPC application runs completed during the first 518 production days of Blue Waters, a 13.1 …
Novel computing architectures offer the possibility to execute float point operations with different precisions. The execution of reduced precision operations, when acceptable for …
S Kundu, O Khan - 2016 29th International Conference on VLSI …, 2016 - ieeexplore.ieee.org
With increasing density of power, traditional frequency scaling of processors came to an end. The power wall forced the industry to seek performance from parallel processing …
This book describes the benefits and drawbacks inherent in the use of virtual platforms (VPs) to perform fast and early soft error assessment of multicore systems. The authors show that …