[图书][B] FPGAs for software programmers

D Koch, F Hannig, D Ziener - 2016 - Springer
Dirk Koch · Frank Hannig Daniel Ziener Editors Page 1 Dirk Koch · Frank Hannig Daniel Ziener
Editors FPGAs for Software Programmers Page 2 FPGAs for Software Programmers Page 3 …

The LEAP FPGA operating system

K Fleming, M Adler - FPGAs for software programmers, 2016 - Springer
FPGAs offer attractive power and performance for many applications, especially relative to
traditional sequential architectures. In spite of these advantages, FPGAs have been …

Lessons learned on MPI+ threads communication

R Zambre… - … Conference for High …, 2022 - ieeexplore.ieee.org
Hybrid MPI+ threads programming is gaining prominence, but, in practice, applications
perform slower with it compared to the MPI everywhere model. The most critical challenge to …

[图书][B] High Performance Parallel Runtimes: Design and Implementation

M Klemm, J Cownie - 2021 - books.google.com
This book focuses on the theoretical and practical aspects of parallel programming systems
for today's high performance multi-core processors and discusses the efficient …

Parallel isotope differential modeling for instationary 13C fluxomics at the genome scale

Z Zhang, Z Liu, Y Meng, Z Chen, J Han, Y Wei… - Biotechnology for …, 2020 - Springer
Background A precise map of the metabolic fluxome, the closest surrogate to the
physiological phenotype, is becoming progressively more important in the metabolic …

Logically parallel communication for fast mpi+ threads applications

R Zambre, D Sahasrabudhe, H Zhou… - … on Parallel and …, 2021 - ieeexplore.ieee.org
Supercomputing applications are increasingly adopting the MPI+ threads programming
model over the traditional “MPI everywhere” approach to better handle the disproportionate …

Improving the Scalability of GPU Synchronization Primitives

P Dalmia, R Mahapatra, J Intan… - … on Parallel and …, 2022 - ieeexplore.ieee.org
General-purpose GPU applications increasingly use synchronization to enforce ordering
between many threads accessing shared data. Accordingly, recently there has been a push …

Strategy for data-flow synchronizations in stencil parallel computations on multi-/manycore systems

L Szustak - The Journal of Supercomputing, 2018 - Springer
In this paper, an innovative strategy for the data-flow synchronization in shared-memory
systems is proposed. This strategy assumes to synchronize only interdependent threads …

High-level synthesis

JMP Cardoso, M Weinhardt - FPGAs for Software Programmers, 2016 - Springer
The compilation of high-level languages, such as software programming languages, to
FPGAs is of paramount importance for the mainstream adoption of FPGAs. An efficient …

Optimizing barrier synchronization on ARMv8 many-core architectures

W Gao, J Fang, C Huang, C Xu… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
Synchronization operations are commonly seen in OpenMP programs where a parallel
construct often works with an explicit or implicit barrier operation. While OpenMP …