Benchmarking TPU, GPU, and CPU platforms for deep learning

YE Wang, GY Wei, D Brooks - arXiv preprint arXiv:1907.10701, 2019 - arxiv.org
Training deep learning models is compute-intensive and there is an industry-wide trend
towards hardware specialization to improve performance. To systematically benchmark …

Diannao: A small-footprint high-throughput accelerator for ubiquitous machine-learning

T Chen, Z Du, N Sun, J Wang, C Wu, Y Chen… - ACM SIGARCH …, 2014 - dl.acm.org
Machine-Learning tasks are becoming pervasive in a broad range of domains, and in a
broad range of systems (from embedded systems to data centers). At the same time, a small …

Neural acceleration for general-purpose approximate programs

H Esmaeilzadeh, A Sampson, L Ceze… - 2012 45th annual …, 2012 - ieeexplore.ieee.org
This paper describes a learning-based approach to the acceleration of approximate
programs. We describe the Parrot transformation, a program transformation that selects and …

AxBench: A multiplatform benchmark suite for approximate computing

A Yazdanbakhsh, D Mahajan… - IEEE Design & …, 2016 - ieeexplore.ieee.org
Approximate computing is claimed to be a powerful knob for alleviating the peak power and
energy-efficiency issues. However, providing a consistent benchmark suit with diverse …

DianNao family: energy-efficient hardware accelerators for machine learning

Y Chen, T Chen, Z Xu, N Sun, O Temam - Communications of the ACM, 2016 - dl.acm.org
Machine Learning (ML) tasks are becoming pervasive in a broad range of applications, and
in a broad range of systems (from embedded systems to data centers). As computer …

Fathom: Reference workloads for modern deep learning methods

R Adolf, S Rama, B Reagen, GY Wei… - 2016 IEEE …, 2016 - ieeexplore.ieee.org
Deep learning has been popularized by its recent successes on challenging artificial
intelligence problems. One of the reasons for its dominance is also an ongoing challenge …

Stochastic configuration machines for industrial artificial intelligence

D Wang, MJ Felicetti - arXiv preprint arXiv:2308.13570, 2023 - arxiv.org
Real-time predictive modelling with desired accuracy is highly expected in industrial artificial
intelligence (IAI), where neural networks play a key role. Neural networks in IAI require …

General-purpose code acceleration with limited-precision analog computation

R St. Amant, A Yazdanbakhsh, J Park… - ACM SIGARCH …, 2014 - dl.acm.org
As improvements in per-transistor speed and energy efficiency diminish, radical departures
from conventional approaches are becoming critical to improving the performance and …

SNNAP: Approximate computing on programmable SoCs via neural acceleration

T Moreau, M Wyse, J Nelson… - 2015 IEEE 21st …, 2015 - ieeexplore.ieee.org
Many applications that can take advantage of accelerators are amenable to approximate
execution. Past work has shown that neural acceleration is a viable way to accelerate …

[PDF][PDF] Accept: A programmer-guided compiler framework for practical approximate computing

A Sampson, A Baixo, B Ransford… - … Technical Report UW …, 2015 - eecs.umich.edu
Approximate computing trades off accuracy for better performance and energy efficiency. It
offers promising optimization opportunities for a wide variety of modern applications, from …