Priority research directions for in situ data management: Enabling scientific discovery from diverse data sources

T Peterka, D Bard, JC Bennett… - … Journal of High …, 2020 - journals.sagepub.com
In January 2019, the US Department of Energy, Office of Science program in Advanced
Scientific Computing Research, convened a workshop to identify priority research directions …

Performance and power efficient massive parallel computational model for HPC heterogeneous exascale systems

MU Ashraf, FA Eassa, AA Albeshri, A Algarni - IEEE Access, 2018 - ieeexplore.ieee.org
The emerging high-performance computing Exascale supercomputing system, which is
anticipated to be available in 2020, will unravel many scientific mysteries. This extraordinary …

Resilience design patterns: A structured approach to resilience at extreme scale

S Hukerikar, C Engelmann - arXiv preprint arXiv:1708.07422, 2017 - arxiv.org
Reliability is a serious concern for future extreme-scale high-performance computing (HPC)
systems. While the HPC community has developed various resilience solutions, the solution …

MPI performance engineering with the MPI tool interface: the integration of MVAPICH and TAU

S Ramesh, A Mahéo, S Shende, AD Malony… - Proceedings of the 24th …, 2017 - dl.acm.org
MPI implementations are becoming increasingly complex and highly tunable, and thus
scalability limitations can come from numerous sources. The MPI Tools Interface (MPI_T) …

Systemwide power management with Argo

D Ellsworth, T Patki, S Perarnau, S Seo… - 2016 IEEE …, 2016 - ieeexplore.ieee.org
The Argo project is a DOE initiative for designing a modular operating system/runtime for the
next generation of supercomputers. A key focus area in this project is power management …

Resilience design patterns-a structured approach to resilience at extreme scale (version 1.0)

S Hukerikar, C Engelmann - arXiv preprint arXiv:1611.02717, 2016 - arxiv.org
In this document, we develop a structured approach to the management of HPC resilience
based on the concept of resilience-based design patterns. A design pattern is a general …

[PDF][PDF] Toward exascale computing systems: An energy efficient massive parallel computational model

MU Ashraf, FA Eassa, AA Albeshri… - International Journal of …, 2018 - academia.edu
The emerging Exascale supercomputing system expected till 2020 will unravel many
scientific mysteries. This extreme computing system will achieve a thousand-fold increase in …

[PDF][PDF] Improving Performance In Hpc System Under Power Consumptions Limitations

MU Ashraf, A Arshad, R Aslam - International Journal of Advanced …, 2019 - researchgate.net
Today's High-Performance Computing (HPC) systems require significant usage of"
supercomputers" and extensiveparallel processing approaches for solving complicated …

In situ workflows at exascale: system software to the rescue

M Dreher, S Perarnau, T Peterka, K Iskra… - Proceedings of the In …, 2017 - dl.acm.org
Implementing an in situ workflow involves several challenges related to data placement, task
scheduling, efficient communications, scalability, and reliability. Most of the current …

Architectural principles and experimentation of distributed high performance virtual clusters

AJ Younge - 2016 - search.proquest.com
With the advent of virtualization and Infrastructure-as-a-Service (IaaS), the broader scientific
computing community is considering the use of clouds for their scientific computing needs …