S Shi, X Zhou, S Song,
X Wang, Z Zhu… - Proceedings of …, 2021 - proceedings.mlsys.org
Distributed training techniques have been widely deployed in large-scale deep models
training on dense-GPU clusters. However, on public cloud clusters, due to the moderate …