Memory-efficient backpropagation through large linear layers

文章

学术资源搜索

获得 5 条结果（用时0.03秒）

Memory-efficient backpropagation through large linear layers

Fundamental research and developments in the field of applied artificial intelligence

EV Burnaev, AV Bernstein, VV Vanovskiy… - Doklady …, 2022 - Springer

The present stage of development of artificial intelligence (AI) is based on advanced
technologies and modern methods and algorithms of machine learning (ML) including deep …

被引用次数：8 相关文章所有 4 个版本

[PDF] arxiv.org

Survey on large scale neural network training

J Gusak, D Cherniuk, A Shilova, A Katrutsa… - arXiv preprint arXiv …, 2022 - arxiv.org

Modern Deep Neural Networks (DNNs) require significant memory to store weight,
activations, and other intermediate tensors during training. Hence, many models do not fit …

被引用次数：9 相关文章所有 5 个版本

[PDF] arxiv.org

Quantization of Large Language Models with an Overdetermined Basis

D Merkulov, D Cherniuk, A Rudikov, I Oseledets… - arXiv preprint arXiv …, 2024 - arxiv.org

In this paper, we introduce an algorithm for data quantization based on the principles of
Kashin representation. This approach hinges on decomposing any given vector, matrix, or …

被引用次数：1 相关文章所有 3 个版本

[PDF] ijcai.org

[PDF][PDF] Survey on Efficient Training of Large Neural Networks

DD Shliazhko, I Oseledets, O Beaumont - ijcai.org

Abstract Modern Deep Neural Networks (DNNs) require significant memory to store weight,
activations, and other intermediate tensors during training. Hence, many models don't fit one …

[引用][C] NOVEL METHODS FOR QUANTIZATION AND ACCELERATION OF NEURAL NETWORKS USING EFFICIENT MATRIX REPRESENTATIONS

D CHERNIUK - 2024

高级搜索

QQ 群

Memory-efficient backpropagation through large linear layers

Fundamental research and developments in the field of applied artificial intelligence

Survey on large scale neural network training

Quantization of Large Language Models with an Overdetermined Basis

[PDF][PDF] Survey on Efficient Training of Large Neural Networks

[引用][C] NOVEL METHODS FOR QUANTIZATION AND ACCELERATION OF NEURAL NETWORKS USING EFFICIENT MATRIX REPRESENTATIONS

引用