Graph neural networks with configuration cross-attention for tensor compilers

D Khizbullin, ER de Andrade, TH Nguyen… - arXiv preprint arXiv …, 2024 - arxiv.org
With the recent popularity of neural networks comes the need for efficient serving of
inference workloads. A neural network inference workload can be represented as a …

[PDF][PDF] Research and Design of Neural Processing Architectures Optimized for Embedded Applications

B Wu - 2024 - core.ac.uk
Deploying neural networks on edge devices and bringing them into our daily lives is
attracting more and more attention. However, its expensive computational cost makes many …

[PDF][PDF] MASTERARBEIT/MASTER'S THESIS

A Wolff - 2022 - phaidra.univie.ac.at
In the past few years, there has been an immense increase in the volume of collected data
worldwide. Dealing with the continuously growing amount of data requires two strategies …