Massively parallel first-principles simulation of electron dynamics in materials

EW Draeger, X Andrade, JA Gunnels, A Bhatele… - Journal of Parallel and …, 2017 - Elsevier
We present a highly scalable, parallel implementation of first-principles electron dynamics
coupled with molecular dynamics (MD). By using optimized kernels, network topology aware …

Combing the communication hairball: Visualizing parallel execution traces using logical time

KE Isaacs, PT Bremer, I Jusufi… - IEEE transactions on …, 2014 - ieeexplore.ieee.org
With the continuous rise in complexity of modern supercomputers, optimizing the
performance of large-scale parallel programs is becoming increasingly challenging …

Identifying the culprits behind network congestion

A Bhatele, AR Titus, JJ Thiagarajan… - 2015 IEEE …, 2015 - ieeexplore.ieee.org
Network congestion is one of the primary causes of performance degradation, performance
variability and poor scaling in communication-heavy parallel applications. However, the …

Fmi: Fast and cheap message passing for serverless functions

M Copik, R Böhringer, A Calotoiu… - Proceedings of the 37th …, 2023 - dl.acm.org
Serverless functions provide elastic scaling and a fine-grained billing model, making
Function-as-a-Service (FaaS) an attractive programming model. However, for distributed …

Predicting application performance using supervised learning on communication features

N Jain, A Bhatele, MP Robson, T Gamblin… - Proceedings of the …, 2013 - dl.acm.org
Task mapping on torus networks has traditionally focused on either reducing the maximum
dilation or average number of hops per byte for messages in an application. These metrics …

Quantum dynamics simulation of electrons in materials on high-performance computers

A Schleife, EW Draeger, VM Anisimov… - … in Science & …, 2014 - ieeexplore.ieee.org
Advancement in high-performance computing allows us to calculate properties of
increasingly complex materials with unprecedented accuracy. At the same time, to take full …

Reducing communication in algebraic multigrid with multi-step node aware communication

A Bienz, WD Gropp, LN Olson - The International Journal of …, 2020 - journals.sagepub.com
Algebraic multigrid (AMG) is often viewed as a scalable O (n) solver for sparse linear
systems. Yet, AMG lacks parallel scalability due to increasingly large costs associated with …

Evaluation of an interference-free node allocation policy on fat-tree clusters

SD Pollard, N Jain, S Herbein… - … Conference for High …, 2018 - ieeexplore.ieee.org
Interference between jobs competing for network bandwidth on a fat-tree cluster can cause
significant variability and degradation in performance. These performance issues can be …

Performance optimality or reproducibility: that is the question

T Patki, JJ Thiagarajan, A Ayala, TZ Islam - Proceedings of the …, 2019 - dl.acm.org
The era of extremely heterogeneous supercomputing brings with itself the devil of increased
performance variation and reduced reproducibility. There is a lack of understanding in the …

Fast and high quality topology-aware task mapping

M Deveci, K Kaya, B Uçar… - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
Considering the large number of processors and the size of the interconnection networks on
exactable-capable supercomputers, mapping concurrently executable and communicating …