Gradientflow: Optimizing network performance for large-scale distributed dnn training

P Sun, Y Wen, R Han, W Feng… - IEEE Transactions on Big …, 2019 - ieeexplore.ieee.org
It is important to scale out deep neural network (DNN) training for reducing model training
time. The high communication overhead is one of the major performance bottlenecks for …

GradientFlow: Optimizing Network Performance for Large-Scale Distributed DNN Training

P Sun, Y Wen, R Han, W Feng… - IEEE Transactions on Big …, 2022 - store.computer.org
It is important to scale out deep neural network (DNN) training for reducing model training
time. The high communication overhead is one of the major performance bottlenecks for …

GradientFlow: Optimizing Network Performance for Large-Scale Distributed DNN Training

P Sun, Y Wen, R Han, W Feng, S Yan - IEEE Transactions on Big …, 2022 - computer.org
It is important to scale out deep neural network (DNN) training for reducing model training
time. The high communication overhead is one of the major performance bottlenecks for …