Mlperf training benchmark

P Mattson, C Cheng, G Diamos… - Proceedings of …, 2020 - proceedings.mlsys.org
Abstract Machine learning is experiencing an explosion of software and hardware solutions,
and needs industry-standard performance benchmarks to drive design and enable …

MLPerf: An industry standard benchmark suite for machine learning performance

P Mattson, VJ Reddi, C Cheng, C Coleman… - IEEE Micro, 2020 - ieeexplore.ieee.org
In this article, we describe the design choices behind MLPerf, a machine learning
performance benchmark that has become an industry standard. The first two rounds of the …

Mlperf inference benchmark

VJ Reddi, C Cheng, D Kanter, P Mattson… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML
applications, the number of different ML inference systems has exploded. Over 100 …

Flashlight: Enabling innovation in tools for machine learning

JD Kahn, V Pratap, T Likhomanenko… - International …, 2022 - proceedings.mlr.press
As the computational requirements for machine learning systems and the size and
complexity of machine learning frameworks increases, essential framework innovation has …

Project adam: Building an efficient and scalable deep learning training system

T Chilimbi, Y Suzue, J Apacible… - 11th USENIX symposium …, 2014 - usenix.org
Large deep neural network models have recently demonstrated state-of-the-art accuracy on
hard visual recognition tasks. Unfortunately such models are extremely time consuming to …

Accounting for variance in machine learning benchmarks

X Bouthillier, P Delaunay, M Bronzi… - Proceedings of …, 2021 - proceedings.mlsys.org
Strong empirical evidence that one machine-learning algorithm A outperforms another one
B, ideally calls for multiple trials optimizing the learning pipeline over sources of variation …

Machine learning systems are stuck in a rut

P Barham, M Isard - Proceedings of the Workshop on Hot Topics in …, 2019 - dl.acm.org
In this paper we argue that systems for numerical computing are stuck in a local basin of
performance and programmability. Systems researchers are doing an excellent job …

{TVM}: An automated {End-to-End} optimizing compiler for deep learning

T Chen, T Moreau, Z Jiang, L Zheng, E Yan… - … USENIX Symposium on …, 2018 - usenix.org
There is an increasing need to bring machine learning to a wide diversity of hardware
devices. Current frameworks rely on vendor-specific operator libraries and optimize for a …

Eflops: Algorithm and system co-design for a high performance distributed training platform

J Dong, Z Cao, T Zhang, J Ye, S Wang… - … Symposium on High …, 2020 - ieeexplore.ieee.org
Deep neural networks (DNNs) have gained tremendous attractions as compelling solutions
for applications such as image classification, object detection, speech recognition, and so …

Easy over hard: A case study on deep learning

W Fu, T Menzies - Proceedings of the 2017 11th joint meeting on …, 2017 - dl.acm.org
While deep learning is an exciting new technique, the benefits of this method need to be
assessed with respect to its computational cost. This is particularly important for deep …