This paper presents a novel method to enhance the reliability of image classification models during deployment in the face of transient hardware errors. By utilizing enriched text …
J Jia, Y Liu, G Zhang, Y Gao, D Qian - Frontiers of Computer Science, 2023 - Springer
With the scaling up of high-performance computing systems in recent years, their reliability has been descending continuously. Therefore, system resilience has been regarded as one …
Progress in high-performance computing (HPC) systems has led to complex applications that stress the I/O subsystem by creating vast amounts of data. Lossy compression reduces …
R Patton, P Date, S Kulkarni… - 2022 IEEE/ACM …, 2022 - ieeexplore.ieee.org
Neuromorphic computing technology continues to make strides in the development of new algorithms, devices, and materials. In addition, applications have begun to emerge where …
High performance computing systems are required to solve grand challenges in many scientific disciplines. These systems assemble many components to be powerful enough for …
Modern systems at scale are increasingly susceptible to transient hardware errors at current technology sizes from natural phenomena such as high-energy particle strikes (also called …
Due to improvements in high-performance computing (HPC) systems, researchers have created powerful applications capable of solving previously intractable problems. While …