Efficient online scheduling for coflow-aware machine learning clusters

W Li, S Chen, K Li, H Qi, R Xu… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Distributed machine learning (DML) is an increasingly important workload. In a DML job,
each communication phase can comprise a coflow, and there are dependencies among its …

Scheduling mix-coflows in datacenter networks

R Xu, W Li, K Li, X Zhou, H Qi - IEEE Transactions on Network …, 2020 - ieeexplore.ieee.org
Data-parallel applications generate a mix of coflows with and without deadlines. Deadline
coflows are mission-critical and must be completed within deadlines, while the non-deadline …

Coflow scheduling with performance guarantees for data center applications

A Hasnain, H Karl - … Symposium on Cluster, Cloud and Internet …, 2020 - ieeexplore.ieee.org
Data-parallel applications run on cluster of servers in a datacenter and their communication
triggers correlated resource demand on multiple links that can be abstracted as coflow. They …

Selective coflow completion for time-sensitive distributed applications with poco

S Luo, P Fan, H Xing, H Yu - … of the 49th International Conference on …, 2020 - dl.acm.org
Recently, the abstraction of coflow is introduced to capture the collective data transmission
patterns among modern distributed data-parallel application. During processing, coflows …