A Shekhovtsov, V Yanush, B Flach - arXiv preprint arXiv:2006.03143, 2020 - arxiv.org
In neural networks with binary activations and or binary weights the training by gradient
descent is complicated as the model has piecewise constant response. We consider …