Performance and energy effects on task-based parallelized applications: User-directed versus manual vectorization

H Caminal, D Caballero, JM Cebrián, R Ferrer… - The Journal of …, 2018 - Springer
Heterogeneity, parallelization and vectorization are key techniques to improve the
performance and energy efficiency of modern computing systems. However, programming …

Analyzing the impact of programming models for efficient communication overlap in high-speed networks

G Utrera, M Gil, X Martorell - 2014 International Conference on …, 2014 - ieeexplore.ieee.org
Exascale applications for civil engineering, simulations and other fields related with current
research make intensive use of large sparse matrices. A characteristic of these matrices is …

SIMD@ OpenMP: a programming model approach to leverage SIMD features

DL Caballero de Gea - 2015 - upcommons.upc.edu
SIMD instruction sets are a key feature in current general purpose and high performance
architectures. SIMD instructions apply in parallel the same operation to a group of data …

Programming models and scheduling techniques for heterogeneous architectures

J Planas Carbonell - 2015 - upcommons.upc.edu
There is a clear trend nowadays to use heterogeneous high-performance computers, as
they offer considerably greater computing power than homogeneous CPU systems …

Evaluating the performance impact of communication imbalance in sparse matrix-vector multiplication

G Utrera, M Gil, X Martorell - 2015 23rd Euromicro International …, 2015 - ieeexplore.ieee.org
HPC applications make intensive use of large sparse matrices with the matrix-vector product
representing a significant fraction of the total run-time. These matrices are characterized by …

[PDF][PDF] Exploiting multi-level parallelism in streaming applications for heterogeneous

A Balevic - Journal of Parallel Programming, 1991 - scholarlypublications …
Exploiting Multi-Le el Parallelism in Streaming Applications for Heterogeneous Platforms ith
GPUs Page 1 Exploiting Multi-Le el Parallelism in Streaming Applications for …

Implementació en HDL d'un arbre binari de cerca auto-balancejat

À Mercadé Ibáñez - 2017 - upcommons.upc.edu
Amb la proliferació de les arquitectures multi-core i many-core, s' han emprat molts esforços
en l'especificació i la implementació de nous models de programació, que facilitessin als …

[图书][B] Enhancing the scalability of many-core systems towards utilizing fine-grain parallelism in task-based programming models

T Dallou - 2017 - search.proquest.com
In the past few years, it has been foreseeable that Moore's law is coming to an end. This law,
based on the observation that the number of transistors in an integrated chip doubles every …

Υλοποίηση και αξιολόγηση μετροπρογραμμάτων με κάρτες γραφικών

ΒΓ Πρίσκας - 2012 - dspace.lib.ntua.gr
O Παράλληλος προγραμματισμός είναι γνωστός από παλαιότερα. Πίσω στις δεκαετίες του'50
και του'60 χρησιμοποιήθηκε από τους ειδικους στα πλαίσια των εφαρμογών για …