Efficiency and productivity for decision making on low-power heterogeneous CPU+ GPU SoCs

DA Constantinescu, A Navarro, F Corbera… - The Journal of …, 2021 - Springer
Markov decision processes provide a formal framework for a computer to make decisions
autonomously and intelligently when the effects of its actions are not deterministic. This …

[HTML][HTML] Evaluating ARM and RISC-V Architectures for High-Performance Computing with Docker and Kubernetes

V Dakić, L Mršić, Z Kunić, G Đambić - Electronics, 2024 - mdpi.com
This paper thoroughly assesses the ARM and RISC-V architectures in the context of high-
performance computing (HPC). It includes an analysis of Docker and Kubernetes …

Balancing graph processing workloads using work stealing on heterogeneous CPU-FPGA systems

M Agostini, F O'Brien, T Abdelrahman - Proceedings of the 49th …, 2020 - dl.acm.org
We propose, implement and evaluate a work stealing based scheduler, called HWS, for
graph processing on heterogeneous CPU-FPGA systems that tightly couple the CPU and …

[HTML][HTML] Lightweight asynchronous scheduling in heterogeneous reconfigurable systems

A Rodríguez, A Navarro, K Nikov… - Journal of Systems …, 2022 - Elsevier
The trend for heterogeneous embedded systems is the integration of accelerators and
general-purpose CPU cores on the same die. In these integrated architectures, like the Zynq …

[HTML][HTML] FPGA-embedded optimization algorithm to maximize the acetate productivity in a dark fermentation process

J de Jesús Colín-Robles, I Torres-Zúñiga… - Journal of Process …, 2024 - Elsevier
This paper presents an optimization strategy to online maximize the acetate productivity rate
in a dark fermentation (DF) process. The Golden Section Search algorithm is used to …

Cooperative software-hardware acceleration of K-means on a tightly coupled CPU-FPGA system

TS Abdelrahman - ACM Transactions on Architecture and Code …, 2020 - dl.acm.org
We consider software-hardware acceleration of K-means clustering on the Intel Xeon+
FPGA platform. We design a pipelined accelerator for K-means and combine it with CPU …

A prefetch-aware scheduling for FPGA-based multi-task graph systems

R Ramezani - The Journal of Supercomputing, 2020 - Springer
In partially run-time reconfigurable (PRR) FPGAs, hardware tasks should be configured
before their execution. The configuration delay imposed by the reconfiguration process …

CF-DAML: Distributed automated machine learning based on collaborative filtering

P Liu, F Pan, X Zhou, S Li, L Jin - Applied Intelligence, 2022 - Springer
The search for a good machine learning (ML) model takes a long time and requires the
considerations of many alternatives, including data preprocessing, algorithm selection, and …

CPU vs. GPU: Performance comparison of OpenCL Applications on a Heterogeneous Architecture

MN Nadir, MS Rathore, A Hayat, JA Mansoor - Journal of Computing & …, 2024 - jcbi.org
The objective of researchers and developers has always been to attain superior
performance for their computing applications. In this regard, the use of Graphic Processing …

[PDF][PDF] Automatic methods for distribution of data-parallel programs on multi-device heterogeneous platforms

K Moreń - 2024 - core.ac.uk
This thesis deals with the problem of finding effective methods for programming and
distributing data-parallel applications for heterogeneous multiprocessor systems. These …