Exploring the performance bound of cambricon accelerator in end-to-end inference scenario

Y Wang, C Li, C Zeng - International Symposium on Benchmarking …, 2019 - Springer
Deep learning algorithms have become pervasive in a broad range of industrial application
scenarios. DianNao/Cambricon family is a set of energy-efficient hardware accelerators for …

Diannao: A small-footprint high-throughput accelerator for ubiquitous machine-learning

T Chen, Z Du, N Sun, J Wang, C Wu, Y Chen… - ACM SIGARCH …, 2014 - dl.acm.org
Machine-Learning tasks are becoming pervasive in a broad range of domains, and in a
broad range of systems (from embedded systems to data centers). At the same time, a small …

Optimizing memory-access patterns for deep learning accelerators

H Zheng, S Oh, H Wang, P Briggs, J Gai, A Jain… - arXiv preprint arXiv …, 2020 - arxiv.org
Deep learning (DL) workloads are moving towards accelerators for faster processing and
lower cost. Modern DL accelerators are good at handling the large-scale multiply …

DianNao family: energy-efficient hardware accelerators for machine learning

Y Chen, T Chen, Z Xu, N Sun, O Temam - Communications of the ACM, 2016 - dl.acm.org
Machine Learning (ML) tasks are becoming pervasive in a broad range of applications, and
in a broad range of systems (from embedded systems to data centers). As computer …

SimPyler: A Compiler-Based Simulation Framework for Machine Learning Accelerators

Y Braatz, DS Rieber, T Soliman… - 2023 IEEE 34th …, 2023 - ieeexplore.ieee.org
Co-optimization of hardware and software in modern deep neural network (DNN) systems
can be performed using design space exploration (DSE) tools. Leveraging estimation …

Neurometer: An integrated power, area, and timing modeling framework for machine learning accelerators industry track paper

T Tang, S Li, L Nai, N Jouppi… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
As Machine Learning (ML) becomes pervasive in the era of artificial intelligence, ML specific
tools and frameworks are required for architectural research. This paper introduces …

DLFusion: An auto-tuning compiler for layer fusion on deep neural network accelerator

Z Liu, J Leng, Q Chen, C Li, W Zheng… - 2020 IEEE Intl Conf …, 2020 - ieeexplore.ieee.org
Many hardware vendors have introduced specialized deep neural networks (DNN)
accelerators owing to their superior performance and efficiency. As such, how to generate …

A small-footprint accelerator for large-scale neural networks

T Chen, S Zhang, S Liu, Z Du, T Luo, Y Gao… - ACM Transactions on …, 2015 - dl.acm.org
Machine-learning tasks are becoming pervasive in a broad range of domains, and in a
broad range of systems (from embedded systems to data centers). At the same time, a small …

Survey and benchmarking of machine learning accelerators

A Reuther, P Michaleas, M Jones… - 2019 IEEE high …, 2019 - ieeexplore.ieee.org
Advances in multicore processors and accelerators have opened the flood gates to greater
exploration and application of machine learning techniques to a variety of applications …

A Fresh Perspective on DNN Accelerators by Performing Holistic Analysis Across Paradigms

T Glint, CK Jha, M Awasthi, J Mekie - arXiv preprint arXiv:2208.05294, 2022 - arxiv.org
Traditional computers with von Neumann architecture are unable to meet the latency and
scalability challenges of Deep Neural Network (DNN) workloads. Various DNN accelerators …