Optimal message scheduling for aggregation

文章

学术资源搜索

获得 6 条结果（用时0.02秒）

我的图书馆

Optimal message scheduling for aggregation

在引用文章中搜索

[PDF] arxiv.org

Dive into deep learning

A Zhang, ZC Lipton, M Li, AJ Smola - arXiv preprint arXiv:2106.11342, 2021 - arxiv.org

This open-source book represents our attempt to make deep learning approachable,
teaching readers the concepts, the context, and the code. The entire book is drafted in …

被引用次数：1209 相关文章所有 9 个版本

[PDF] mlsys.org

Blink: Fast and generic collectives for distributed ml

G Wang, S Venkataraman… - Proceedings of …, 2020 - proceedings.mlsys.org

Abstract Model parameter synchronization across GPUs introduces high overheads for data-
parallel training at scale. Existing parameter synchronization protocols cannot effectively …

被引用次数：142 相关文章所有 11 个版本

[PDF] acm.org

Parameter hub: a rack-scale parameter server for distributed deep neural network training

L Luo, J Nelson, L Ceze, A Phanishayee… - Proceedings of the …, 2018 - dl.acm.org

Distributed deep neural network (DDNN) training constitutes an increasingly important
workload that frequently runs in the cloud. Larger DNN models and faster compute engines …

被引用次数：153 相关文章所有 6 个版本

[PDF] nsf.gov

Communication algorithm-architecture co-design for distributed deep learning

J Huang, P Majumder, S Kim, A Muzahid… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org

Large-scale distributed deep learning training has enabled developments of more complex
deep neural network models to learn from larger datasets for sophisticated tasks. In …

被引用次数：26 相关文章所有 8 个版本

[PDF] tamu.edu

[引用][C] Efficient Interconnection Network Design for Heterogeneous Architectures

J Huang - 2020

[引用][C] Tree-based allreduce communication on mxnet

C Yang, AWS Amazon - Tech. Rep., 2018

被引用次数：11 相关文章

高级搜索

QQ 群

Optimal message scheduling for aggregation

Dive into deep learning

Blink: Fast and generic collectives for distributed ml

Parameter hub: a rack-scale parameter server for distributed deep neural network training

Communication algorithm-architecture co-design for distributed deep learning

[引用][C] Efficient Interconnection Network Design for Heterogeneous Architectures

[引用][C] Tree-based allreduce communication on mxnet

引用