Supporting utilities for heterogeneous embedded image processing platforms (STHEM): An overview

A Sadek, A Muddukrishna, L Kalms, A Djupdal… - … Architectures, Tools, and …, 2018 - Springer
The TULIPP project aims to simplify development of embedded vision applications with low-
power and real-time requirements by providing a complete image processing system …

Traveler: Navigating task parallel traces for performance analysis

SA Sakin, A Bigelow, R Tohid… - … on Visualization and …, 2022 - ieeexplore.ieee.org
Understanding the behavior of software in execution is a key step in identifying and fixing
performance issues. This is especially important in high performance computing contexts …

A visual performance analysis framework for task‐based parallel applications running on hybrid clusters

V Garcia Pinto, L Mello Schnorr… - Concurrency and …, 2018 - Wiley Online Library
Summary Programming paradigms in High‐Performance Computing have been shifting
toward task‐based models that are capable of adapting readily to heterogeneous and …

Cloud-based federated boosting for mobile crowdsensing

Z Wang, Y Yang, Y Liu, X Liu, BB Gupta… - arXiv preprint arXiv …, 2020 - arxiv.org
The application of federated extreme gradient boosting to mobile crowdsensing apps brings
several benefits, in particular high performance on efficiency and classification. However, it …

TULIPP: Towards ubiquitous low-power image processing platforms

T Kalb, L Kalms, D Göhringer, C Pons… - 2016 International …, 2016 - ieeexplore.ieee.org
Many industrial domains rely on vision-based applications which require to comply with
severe performance and embedded requirements. Tulipp will develop a reference platform …

Visual performance analysis of memory behavior in a task-based runtime on hybrid platforms

LL Nesi, S Thibault, L Stanisic… - 2019 19th IEEE/ACM …, 2019 - ieeexplore.ieee.org
Programming parallel applications for heterogeneous HPC platforms is much more
straightforward when using the task-based programming paradigm. The simplicity exists …

Measuring Thread Timing to Assess the Feasibility of Early-bird Message Delivery

WP Marts, MGF Dosanjh, W Schonbein… - Proceedings of the …, 2023 - dl.acm.org
Early-bird communication is a communication/computation overlap technique that combines
fine-grained communication with partitioned communication to improve application run-time …

Analysis and optimization of task granularity on the Java virtual machine

A Rosà, E Rosales, W Binder - ACM Transactions on Programming …, 2019 - dl.acm.org
Task granularity, ie, the amount of work performed by parallel tasks, is a key performance
attribute of parallel applications. On the one hand, fine-grained tasks (ie, small tasks carrying …

Visualizing Correctness Issues in OpenMP Programs

F Jin, A Tao, L Yu, V Sarkar - International Workshop on OpenMP, 2024 - Springer
Past work on OpenMP program visualization has mainly centered on performance analysis.
This paper explores how the visualization of computation graphs assists programmers in …

Providing in-depth performance analysis for heterogeneous task-based applications with starvz

VG Pinto, LL Nesi, MC Miletto… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
Task-based parallelism has adequately addressed the coding complexity required to fully
exploit the processing power offered by omnipresent hybrid CPU/GPU supercomputers …