Approximate Computing Survey, Part I: Terminology and Software & Hardware Approximation Techniques

V Leon, MA Hanif, G Armeniakos, X Jiao… - arXiv preprint arXiv …, 2023 - arxiv.org
The rapid growth of demanding applications in domains applying multimedia processing
and machine learning has marked a new era for edge and cloud computing. These …

MERCI: efficient embedding reduction on commodity hardware via sub-query memoization

Y Lee, SH Seo, H Choi, HU Sul, S Kim, JW Lee… - Proceedings of the 26th …, 2021 - dl.acm.org
Deep neural networks (DNNs) with embedding layers are widely adopted to capture
complex relationships among entities within a dataset. Embedding layers aggregate multiple …

Temporal approximate function memoization

G Tziantzioulis, N Hardavellas, S Campanoni - IEEE Micro, 2018 - ieeexplore.ieee.org
Improving the performance of applications is a core target of computer systems research and
has led to the creation of various techniques. Among them is function memoization, an …

TD-NUCA: runtime driven management of NUCA caches in task dataflow programming models

P Caheny, L Alvarez, M Casas… - … Conference for High …, 2022 - ieeexplore.ieee.org
In high performance processors, the design of on-chip memory hierarchies is crucial for
performance and energy efficiency. Current processors rely on large shared Non-Uniform …

Approximate Computing Survey, Part II: Application-Specific & Architectural Approximation Techniques and Applications

V Leon, MA Hanif, G Armeniakos, X Jiao… - arXiv preprint arXiv …, 2023 - arxiv.org
The challenging deployment of compute-intensive applications from domains such Artificial
Intelligence (AI) and Digital Signal Processing (DSP), forces the community of computing …

AxMemo: Hardware-compiler co-design for approximate code memoization

Z Liu, A Yazdanbakhsh, DK Wang… - Proceedings of the 46th …, 2019 - dl.acm.org
Historically, continuous improvements in general-purpose processors have fueled the
economic success and growth of the IT industry. However, the diminishing benefits from …

Runtime-assisted cache coherence deactivation in task parallel programs

P Caheny, L Alvarez, M Valero… - … Conference for High …, 2018 - ieeexplore.ieee.org
With increasing core counts, the scalability of directory-based cache coherence has become
a challenging problem. To reduce the area and power needs of the directory, recent …

Approximate function memoization

P Arundhati, SK Jena, SK Pani - Concurrency and Computation …, 2022 - Wiley Online Library
Function memoization is an optimization technique that reduces a function call overhead
when the same input appears again. A table that stores the previous result is searched and …

Application of approximate matrix multiplication to neural networks and distributed SLAM

B Plancher, CD Brumar, I Brumar… - 2019 IEEE High …, 2019 - ieeexplore.ieee.org
Computational efficiency is a critical constraint for a variety of cutting-edge real-time
applications. In this work, we identify an opportunity to speed up the end-to-end runtime of …

A Comprehensive Survey of Benchmarks for Automated Improvement of Software's Non-Functional Properties

A Blot, J Petke - arXiv preprint arXiv:2212.08540, 2022 - arxiv.org
Performance is a key quality of modern software. Although recent years have seen a spike in
research on automated improvement of software's execution time, energy, memory …