作者
Wenjia Zheng, Michael Tynes, Henry Gorelick, Ying Mao, Long Cheng, Yantian Hou
发表日期
2019
研讨会论文
ACM the 48th International Conference on Parallel Processing (ICPP '19)
简介
An increasing number of companies are using data analytics to improve their products, services, and business processes. However, learning knowledge effectively from massive data sets always involves nontrivial computational resources. Most businesses thus choose to migrate their hardware needs to a remote cluster computing service (e.g., AWS) or to an in-house cluster facility which is often run at its resource capacity. In such scenarios, where jobs compete for available resources utilizing resources effectively to achieve high-performance data analytics becomes desirable. Although cluster resource management is a fruitful research area having made many advances (e.g., YARN, Kubernetes), few projects have investigated how further optimizations can be made specifically for training multiple machine learning (ML) / deep learning (DL) models. In this work, we introduce FlowCon, a system which is able to …
引用总数
201920202021202220232024210131043
学术搜索中的文章
W Zheng, M Tynes, H Gorelick, Y Mao, L Cheng, Y Hou - Proceedings of the 48th International Conference on …, 2019