A Dutta, N Chaki,
RK De - arXiv preprint arXiv:2410.14312, 2024 - arxiv.org
DNN training is extremely time-consuming, necessitating efficient multi-accelerator
parallelization, where a single iteration of training is split over the accelerators. Current …