Test-driving RISC-V Vector hardware for HPC

JKL Lee, M Jamieson, N Brown, R Jesus - International Conference on …, 2023 - Springer
Abstract Whilst the RISC-V Vector extension (RVV) has been ratified, at the time of writing
both hardware implementations and open source software support are still limited for …

Performance left on the table: an evaluation of compiler autovectorization for RISC-V

N Adit, A Sampson - IEEE Micro, 2022 - ieeexplore.ieee.org
Next-generation length-agnostic vector instruction set architecture (ISA) designs, the RISC-V
vector extension, and ARM's scalable vector extension enable software portability across …

Unlimited vector extension with data streaming support

JM Domingos, N Neves, N Roma… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org
Unlimited vector extension (UVE) is a novel instruction set architecture extension that takes
streaming and SIMD processing together into the modern computing scenario. It aims to …

Exploring source-to-source compiler transformation of OpenMP SIMD constructs for Intel AVX and Arm SVE vector architectures

P Flynn, X Yi, Y Yan - … of the Thirteenth International Workshop on …, 2022 - dl.acm.org
Over the past decade, SIMD (single instruction multiple data) or vector architectures have
made significant advances, now existing across a wide range of devices from commodity …

Compiling for vector extensions with stream-based specialization

N Neves, JM Domingos, N Roma, P Tomas… - IEEE Micro, 2022 - ieeexplore.ieee.org
To overcome the current performance wall, data streaming and data-flow computing
paradigms have been gradually making their way into the general-purpose domain …

Communications signal processing using RISC-V vector extension

V Razilov, E Matúš, G Fettweis - 2022 International Wireless …, 2022 - ieeexplore.ieee.org
Flexible and scalable solutions will be needed for future communications processing
systems. RISC-V processors enhanced with vector processing capabilities as specified by …

A Study of Performance Programming of CPU, GPU accelerated Computers and SIMD Architecture

X Yi - arXiv preprint arXiv:2409.10661, 2024 - arxiv.org
Parallel computing is a standard approach to achieving high-performance computing (HPC).
Three commonly used methods to implement parallel computing include: 1) applying …

Vectorization cost modeling for NEON, AVX and SVE

A Pohl, B Cosenza, B Juurlink - Performance Evaluation, 2020 - Elsevier
Compiler optimization passes employ cost models to determine if a code transformation will
yield performance improvements. When this assessment is inaccurate, compilers apply …

Accelerating level 2 blas based on arm sve

X Wan, N Gu, J Su - 2021 4th international conference on …, 2021 - ieeexplore.ieee.org
Scalable Vector Extension (SVE) is a special vector instruction set architecture recently
released by ARM, which is a vector extension for A64 instruction set of ARMv8-A …

[PDF][PDF] Vector length agnostic SIMD parallelism on modern processor architectures with the focus on Arm's SVE

B Brank - 2023 - elekpub.bib.uni-wuppertal.de
High-Performance Computing (HPC) has seen a substantial increase in computing power
over the recent decade. In June 2008, the first petascale system was introduced, which …