Parallel i/o evaluation techniques and emerging hpc workloads: A perspective

S Neuwirth, AK Paul - 2021 IEEE International Conference on …, 2021 - ieeexplore.ieee.org
Emerging workloads such as artificial intelligence, big data analytics and complex multi-step
workflows alongside future exascale applications are anticipated future HPC workloads …

Performance prediction of parallel applications: a systematic literature review

J Flores-Contreras, HA Duran-Limon… - The Journal of …, 2021 - Springer
Different techniques for estimating the execution time of parallel applications have been
studied for the last 25 years. These approaches have proposed different methods for …

Fine-grained powercap allocation for power-constrained systems based on multi-objective machine learning

M Hao, W Zhang, Y Wang, G Lu, F Wang… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Power capping is an important solution to keep the system within a fixed power constraint.
However, for the over-provisioned and power-constrained systems, especially the future …

Bootstrapping in-situ workflow auto-tuning via combining performance models of component applications

T Shu, Y Guo, J Wozniak, X Ding, I Foster… - Proceedings of the …, 2021 - dl.acm.org
In an in-situ workflow, multiple components such as simulation and analysis applications are
coupled with streaming data transfers. The multiplicity of possible configurations …

Predicting Software Performance with Divide-and-Learn

J Gong, T Chen - Proceedings of the 31st ACM Joint European Software …, 2023 - dl.acm.org
Predicting the performance of highly configurable software systems is the foundation for
performance testing and quality assurance. To that end, recent work has been relying on …

[HTML][HTML] AI-driven performance modeling for AI inference workloads

M Sponner, B Waschneck, A Kumar - Electronics, 2022 - mdpi.com
Deep Learning (DL) is moving towards deploying workloads not only in cloud datacenters,
but also to the local devices. Although these are mostly limited to inference tasks, it still …

Scheduling workflow tasks with unknown task execution time by combining machine-learning and greedy-optimization

Y Yang, H Shen, H Tian - IEEE Transactions on Services …, 2024 - ieeexplore.ieee.org
Workflow tasks are time-sensitive and their task completion utility, ie, value of task
completion, is inversely proportional to their completion time. Existing solutions to the NP …

G-slide: A gpu-based sub-linear deep learning engine via lsh sparsification

Z Pan, F Zhang, H Li, C Zhang, X Du… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Deep learning has been one of the trendiest research topics. However, as data quantities
rise exponentially, training large neural networks can become prohibitively expensive with …

Performance-detective: automatic deduction of cheap and accurate performance models

L Schmid, M Copik, A Calotoiu, D Werle… - Proceedings of the 36th …, 2022 - dl.acm.org
The many configuration options of modern applications make it difficult for users to select a
performance-optimal configuration. Performance models help users in understanding …

Performance prediction of parallel applications using artificial neuronal network and graph representation

S Chokri, S Baroud, S Belhaous… - Concurrency and …, 2022 - Wiley Online Library
Modeling and predicting the performance of HPC applications is exceedingly important for
several purposes. These incorporate scheduling and managing tasks, understanding …