S Rajasekaran, M Ghobadi, A Akella - arXiv e-prints, 2023 - ui.adsabs.harvard.edu
We present CASSINI, a network-aware job scheduler for machine learning (ML) clusters.
CASSINI introduces a novel geometric abstraction to consider the communication pattern of …