Using meta-heuristics and machine learning for software optimization of parallel computing systems: a systematic literature review

S Memeti, S Pllana, A Binotto, J Kołodziej, I Brandic - Computing, 2019 - Springer
While modern parallel computing systems offer high performance, utilizing these powerful
computing resources to the highest possible extent demands advanced knowledge of …

A profile-based ai-assisted dynamic scheduling approach for heterogeneous architectures

T Geng, M Amaris, S Zuckerman, A Goldman… - International Journal of …, 2022 - Springer
While heterogeneous architectures are increasing popular with High Performance
Computing systems, their effectiveness depends on how efficient the scheduler is at …

Optimization of heterogeneous systems with AI planning heuristics and machine learning: a performance and energy aware approach

S Memeti, S Pllana - Computing, 2021 - Springer
Heterogeneous computing systems provide high performance and energy efficiency.
However, to optimally utilize such systems, solutions that distribute the work across host …

: Cost Based Hardware Optimization for Asymmetric Multicore Processors

JKV Sreelatha, S Balachandran… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
Heterogeneous Multiprocessors (HMPs) are popular due to their energy efficiency over
Symmetric Multicore Processors (SMPs). Asymmetric Multicore Processors (AMPs) are a …

Auto-tuning methodology for configuration and application parameters of hybrid CPU+ GPU parallel systems based on expert knowledge

P Czarnul, P Rościszewski - 2019 International Conference on …, 2019 - ieeexplore.ieee.org
Auto-tuning of configuration and application parameters allows to achieve significant
performance gains in many contemporary compute-intensive applications. Feasible search …

Cloud-backed mobile cognition: Power-efficient deep learning in the autonomous vehicle era

A Vega, A Buyuktosunoglu, D Callegaro, M Levorato… - Computing, 2022 - Springer
Low-power embedded technology offers a roadmap for enabling deep learning (DL)
applications in mobile scenarios, like future autonomous vehicles. However, the lack of …

Pattern learning based parallel ant colony optimization

X Jin, W Zheng, S Mo, Y Qu, X Jin… - … on Parallel and …, 2017 - ieeexplore.ieee.org
Ant colony optimization (ACO) can be used to solve complex optimization problems in
engineering, economic management and military strategy. Most of these are NP hard …

[PDF][PDF] Automatic methods for distribution of data-parallel programs on multi-device heterogeneous platforms

K Moreń - 2024 - core.ac.uk
This thesis deals with the problem of finding effective methods for programming and
distributing data-parallel applications for heterogeneous multiprocessor systems. These …

[图书][B] Comprehensive Parallel Performance Profiling, Characterisation, and Optimisation

R Neill - 2020 - search.proquest.com
Modern hardware systems consist of highly-complex multi-processor hardware architectures
that are often supported by specialised accelerator devices. This has resulted in an …

Programming and Optimization of Big-Data Applications on Heterogeneous Computing Systems

S Memeti - 2018 - diva-portal.org
Furthermore, we propose programming abstractions, a source-to-source compiler, and a run-
time system for heterogeneous stream computing. Given a source code annotated with …