cuHinesBatch: Solving multiple Hines systems on GPUs human brain project

P Valero-Lara, I Martínez-Perez, AJ Pena… - Procedia Computer …, 2017 - Elsevier
The simulation of the behavior of the Human Brain is one of the most important challenges
today in computing. The main problem consists of finding efficient ways to manipulate and …

Static graphs for coding productivity in openacc

L Toledo, P Valero-Lara, J Vetter… - 2021 IEEE 28th …, 2021 - ieeexplore.ieee.org
The main contribution of this work is to increase the coding productivity for GPU
programming by using the concept of Static Graphs. To do so, we have combined the new …

Many-task computing on many-core architectures

P Valero-Lara, P Nookala, FL Pelayo, J Jansson… - … Computing: Practice and …, 2016 - scpe.org
Abstract Many-Task Computing (MTC) is a common scenario for multiple parallel systems,
such as cluster, grids, cloud and supercomputers, but it is not so popular in shared memory …

Tasking in accelerators: performance evaluation

L Toledo, AJ Peña, S Catalán… - 2019 20th International …, 2019 - ieeexplore.ieee.org
In this work, we analyze the implications and results of implementing dynamic parallelism,
concurrent kernels and CUDA Graphs to solve task-oriented problems. As a benchmark we …

Simulating the behavior of the Human Brain on GPUs

P Valero-Lara, I Martínez-Pérez… - Oil & Gas Science …, 2018 - ogst.ifpenergiesnouvelles.fr
The simulation of the behavior of the Human Brain is one of the most important challenges in
computing today. The main problem consists of finding efficient ways to manipulate and …

Full-overlapped concurrent kernels

P Valero-Lara, FL Pelayo - ARCS 2015-The 28th International …, 2015 - ieeexplore.ieee.org
This work focuses on executing multiple kernels in the same GPU device simultaneously.
We have done a comparison among three of the most well known strategies to reach that …

Towards Enhancing Coding Productivity for GPU Programming Using Static Graphs

L Toledo, P Valero-Lara, JS Vetter, AJ Peña - Electronics, 2022 - mdpi.com
The main contribution of this work is to increase the coding productivity of GPU programming
by using the concept of Static Graphs. GPU capabilities have been increasing significantly in …

CPU-GPU computing: Overview, optimization, and applications

X Fei, K Li, W Yang, K Li - … and Applications in Next-Generation High …, 2016 - igi-global.com
Heterogeneous and hybrid computing has been heavily studied in the field of parallel and
distributed computing in recent years. It can work on a single computer, or in a group of …

Extreme Fine-Grained Parallelism on Modern Many-Core Architectures

P Nookala - 2022 - search.proquest.com
Processors with 100s of threads of execution and GPUs with 1000s of cores are among the
state-of-the-art in high-end computing systems. This transition to many-core computing has …

[PDF][PDF] Towards Enhancing Coding Productivity for GPU Programming Using Static Graphs. Electronics 2022, 11, 1307

L Toledo, P Valero-Lara, JS Vetter, AJ Peña - 2022 - academia.edu
The main contribution of this work is to increase the coding productivity of GPU programming
by using the concept of Static Graphs. GPU capabilities have been increasing significantly in …