Acceleration with long vector architectures: Implementation and evaluation of the FFT kernel on NEC SX‐Aurora and RISC‐V vector extension

P Vizcaino, F Mantovani, R Ferrer… - Concurrency and …, 2023 - Wiley Online Library
Novel architectures leveraging long and variable vector lengths like the NEC SX‐Aurora or
the vector extension of RISCV are appearing as promising solutions on the supercomputing …

A portable coding strategy to exploit vectorization on combustion simulations

F Banchelli, G Oyarzun, M Garcia-Gasulla… - Computers & …, 2024 - Elsevier
The complexity of combustion simulations demands the latest high-performance computing
tools to accelerate its time-to-solution results. A current trend on HPC systems is the …

Exploiting Vector Code Semantics for Efficient Data Cache Prefetching

F Martínez Palau, M Torrents, A Armejach… - Proceedings of the 38th …, 2024 - dl.acm.org
Emerging workloads from domains like high performance computing, data analytics or deep
learning consume large amounts of memory bandwidth. To mitigate this problem, computing …

Experiments on speeding up the recursive fast Fourier transform by using AVX-512 SIMD instructions

G Sansone, M Cococcioni - International Conference on Applications in …, 2022 - Springer
Abstract The Fast Fourier Transform is probably one of the most studied algorithms of all
time. New techniques regarding hardware and software are often applied and tested on it …