DFT exchange: sharing perspectives on the workhorse of quantum chemistry and materials science

AM Teale, T Helgaker, A Savin, C Adamo… - Physical chemistry …, 2022 - pubs.rsc.org
In this paper, the history, present status, and future of density-functional theory (DFT) is
informally reviewed and discussed by 70 workers in the field, including molecular scientists …

[HTML][HTML] Toward exascale resilience: 2014 update

F Cappello, G Al, W Gropp, S Kale, B Kramer… - … and Innovations: an …, 2014 - dl.acm.org
Resilience is a major roadblock for HPC executions on future exascale systems. These
systems will typically gather millions of CPU cores running up to a billion threads …

Fast and accurate online video object segmentation via tracking parts

J Cheng, YH Tsai, WC Hung… - Proceedings of the …, 2018 - openaccess.thecvf.com
Online video object segmentation is a challenging task as it entails to process the image
sequence timely and accurately. To segment a target object through the video, numerous …

Supervision-by-registration: An unsupervised approach to improve the precision of facial landmark detectors

X Dong, SI Yu, X Weng, SE Wei… - Proceedings of the …, 2018 - openaccess.thecvf.com
In this paper, we present supervision-by-registration, an unsupervised approach to improve
the precision of facial landmark detectors on both images and video. Our key observation is …

FT-CNN: Algorithm-based fault tolerance for convolutional neural networks

K Zhao, S Di, S Li, X Liang, Y Zhai… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) are becoming more and more important for solving
challenging and critical problems in many fields. CNN inference applications have been …

BinFI an efficient fault injector for safety-critical machine learning systems

Z Chen, G Li, K Pattabiraman… - Proceedings of the …, 2019 - dl.acm.org
As machine learning (ML) becomes pervasive in high performance computing, ML has
found its way into safety-critical domains (eg, autonomous vehicles). Thus the reliability of …

On the diversity of cluster workloads and its impact on research results

G Amvrosiadis, JW Park, GR Ganger… - 2018 USENIX Annual …, 2018 - usenix.org
Six years ago, Google released an invaluable set of scheduler logs which has already been
used in more than 450 publications. We find that the scarcity of other data sources, however …

waLBerla: A block-structured high-performance framework for multiphysics simulations

M Bauer, S Eibl, C Godenschwager, N Kohl… - … & Mathematics with …, 2021 - Elsevier
Programming current supercomputers efficiently is a challenging task. Multiple levels of
parallelism on the core, on the compute node, and between nodes need to be exploited to …

Understanding GPU errors on large-scale HPC systems and the implications for system design and operation

D Tiwari, S Gupta, J Rogers, D Maxwell… - 2015 IEEE 21st …, 2015 - ieeexplore.ieee.org
Increase in graphics hardware performance and improvements in programmability has
enabled GPUs to evolve from a graphics-specific accelerator to a general-purpose …

A low-cost fault corrector for deep neural networks through range restriction

Z Chen, G Li, K Pattabiraman - 2021 51st Annual IEEE/IFIP …, 2021 - ieeexplore.ieee.org
The adoption of deep neural networks (DNNs) in safety-critical domains has engendered
serious reliability concerns. A prominent example is hardware transient faults that are …