作者
Giovanni Neglia, Gianmarco Calbi, Don Towsley, Gayane Vardoyan
发表日期
2019/4/29
研讨会论文
IEEE INFOCOM 2019-IEEE Conference on Computer Communications
页码范围
2350-2358
出版商
IEEE
简介
Many learning problems are formulated as minimization of some loss function on a training set of examples. Distributed gradient methods on a cluster are often used for this purpose. In this paper, we study how the variability of task execution times at cluster nodes affects the system throughput. In particular, a simple but accurate model allows us to quantity how the time to solve the minimization problem depends on the network of information exchanges among the nodes. Interestingly, we show that, even when communication overhead may be neglected, the clique is not necessarily the most effective topology, as commonly assumed in previous works.
引用总数
20192020202120222023202425719175
学术搜索中的文章
G Neglia, G Calbi, D Towsley, G Vardoyan - IEEE INFOCOM 2019-IEEE Conference on Computer …, 2019