S Chetlur, C Woolley, P Vandermersch… - arXiv preprint arXiv …, 2014 - arxiv.org
We present a library of efficient implementations of deep learning primitives. Deep learning
workloads are computationally intensive, and optimizing their kernels is difficult and time …