Tools for reduced precision computation: a survey

S Cherubin, G Agosta - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
The use of reduced precision to improve performance metrics such as computation latency
and power consumption is a common practice in the embedded systems field. This practice …

[HTML][HTML] Clava: C/C++ source-to-source compilation using LARA

J Bispo, JMP Cardoso - SoftwareX, 2020 - Elsevier
This article presents Clava, a Clang-based source-to-source compiler, that accepts scripts
written in LARA, a JavaScript-based DSL with special constructs for code queries, analysis …

A holistic approach to automatic mixed-precision code generation and tuning for affine programs

J Xu, G Song, B Zhou, F Li, J Hao, J Zhao - Proceedings of the 29th ACM …, 2024 - dl.acm.org
Reducing floating-point (FP) precision is used to trade the quality degradation of a numerical
program's output for performance, but this optimization coincides with type casting, whose …

Embedded operating system optimization through floating to fixed point compiler transformation

D Cattaneo, A Di Bello, S Cherubin… - 2018 21st Euromicro …, 2018 - ieeexplore.ieee.org
Architectures targeted at embedded systems often have limited floating point computation
capabilities, and in many cases do not provide any hardware support. In this work, we …

The ANTAREX domain specific language for high performance computing

C Silvano, G Agosta, A Bartolini, AR Beccari… - Microprocessors and …, 2019 - Elsevier
The ANTAREX project relies on a Domain Specific Language (DSL) based on Aspect
Oriented Programming (AOP) concepts to allow applications to enforce extra functional …

SeTHet-Sending Tuned numbers over DMA onto Heterogeneous clusters: an automated precision tuning story

G Magnani, D Cattaneo, L Denisov… - Proceedings of the 21st …, 2024 - dl.acm.org
Energy and performance optimization of embedded hardware and software is of critical
importance to achieve the overall system goals. In this work, we study the optimization of …

Antarex: A dsl-based approach to adaptively optimizing and enforcing extra-functional properties in high performance computing

C Silvano, G Agosta, A Bartolini… - 2018 21st Euromicro …, 2018 - ieeexplore.ieee.org
The ANTAREX project relies on a Domain Specific Language (DSL) based on Aspect
Oriented Programming (AOP) concepts to allow applications to enforce extra functional …

Interleaved Execution of Approximated CUDA Kernels in Iterative Applications

G Freytag, CA Künas, P Rech… - 2024 32nd Euromicro …, 2024 - ieeexplore.ieee.org
Fine-tuning the floating-point precision of arithmetic operations in applications can be
extremely challenging and time-consuming, especially in iterative applications where the …

Everything you Need to Know About Reduced Mixed Precision Computation in Numerical Programs

DB Khalifa, M Martel - 2023 - hal.science
In the recent decade, precision tuning becomes one of the key techniques to obtain
significant gains in performance and energy efficiency. This process consists of substituting …

FBTuner: A Feedback-Directed Approach for Safe Mixed-Precision Tuning

X Li, G Gopalakrishnan - 2024 IEEE 24th International …, 2024 - ieeexplore.ieee.org
Porting high-performance computing (HPC) applications to lower or mixed-precision formats
offers potential benefits, such as reduced computation and power consumption. However …