M Perdacher, C Plant, C Böhm - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
The LU decomposition is an essential element used in many linear algebra applications. Furthermore, it is used in LINPACK to benchmark the performance of modern multi-core …
Hiding communication behind useful computation is an important performance programming technique but remains an inscrutable programming exercise even for the expert. We present …
Charm++ is a parallel programming system that evolved over the past 20 years to become a well-established system for programming parallel science and engineering applications, in …
T Nguyen, SB Baden - 2015 IEEE International Conference on …, 2015 - ieeexplore.ieee.org
Lookahead is a well-known technique for masking communication in matrix factorization, but at the cost of complicating application software. We present a new approach, based on …
АА Пазников, МГ Курносов… - Проблемы …, 2015 - cyberleninka.ru
В работе рассматривается задача отображения параллельных MPI-программ на иерархические кластерные вычислительные системы (ВС). Требуется по заданному …
The decomposition of a dense matrix into lower and upper triangular matrices is an important linear algebra kernel that used in scientific and engineering applications. To …
Current implementations of process groups (subcommunicators) have non-scalable (O (group size)) memory footprints and even worse time complexities for setting up …
KR Islam, S Tabassum, T Adhikary… - 2016 5th International …, 2016 - ieeexplore.ieee.org
Energy scarcity at homes is becoming a critical issue due to exponential growth of energy consumption by numerous smart home appliances. Renewable energy sources help to …
Today's microprocessors consist of multiple cores each of which can perform multiple additions, multiplications, or other operations simultaneously in one clock cycle. In shared …