Snowflake: Scaling GNNs to high-dimensional continuous control via parameter freezing C Blake, V Kurin, M Igl, S Whiteson Advances in Neural Information Processing Systems 34, 23983-23992, 2021 | 10 | 2021 |
Training and inference of large language models using 8-bit floating point SP Perez, Y Zhang, J Briggs, C Blake, J Levy-Kramer, P Balanca, ... arXiv preprint arXiv:2309.17224, 2023 | 7 | 2023 |
Sparq attention: Bandwidth-efficient llm inference L Ribar, I Chelombiev, L Hudlass-Galley, C Blake, C Luschi, D Orr arXiv preprint arXiv:2312.04985, 2023 | 6 | 2023 |
Unit scaling: Out-of-the-box low-precision training C Blake, D Orr, C Luschi International Conference on Machine Learning, 2548-2576, 2023 | 3 | 2023 |
The Winnability of Klondike Solitaire and Many Other Patience Games C Blake, IP Gent arXiv preprint arXiv:1906.12314, 2019 | 2* | 2019 |
u-μP: The Unit-Scaled Maximal Update Parametrization C Blake, C Eichenberg, J Dean, L Balles, LY Prince, B Deiseroth, ... 2nd Workshop on Advancing Neural Network Training: Computational Efficiency …, 0 | | |