作者
Long Cheng, Ying Wang, Qingzhi Liu, Dick HJ Epema, Cheng Liu, Ying Mao, John Murphy
发表日期
2021/1/20
期刊
IEEE Transactions on Parallel and Distributed Systems
卷号
32
期号
6
页码范围
1494-1510
出版商
IEEE
简介
Large data centers are currently the mainstream infrastructures for big data processing. As one of the most fundamental tasks in these environments, the efficient execution of distributed data operators (e.g., join and aggregation) are still challenging current data systems, and one of the key performance issues is network communication time. State-of-the-art methods trying to improve that problem focus on either application-layer data locality optimization to reduce network traffic or on network-layer data flow optimization to increase bandwidth utilization. However, the techniques in the two layers are totally independent from each other, and performance gains from a joint optimization perspective have not yet been explored. In this article, we propose a novel approach called NEAL (NEtwork-Aware Locality scheduling) to bridge this gap, and consequently to further reduce communication time for distributed big data …
引用总数
学术搜索中的文章
L Cheng, Y Wang, Q Liu, DHJ Epema, C Liu, Y Mao… - IEEE Transactions on Parallel and Distributed Systems, 2021