作者
Ingo Müller, Renato Marroquín, Dimitrios Koutsoukos, Mike Wawrzoniak, Sabir Akhadov, Gustavo Alonso
发表日期
2020/6/15
图书
Proceedings of the 16th International Workshop on Data Management on New Hardware
页码范围
1-10
简介
Getting the best performance from the ever-increasing number of hardware platforms has been a recurring challenge for data processing systems. In recent years, the advent of data science with its increasingly numerous and complex types of analytics has made this challenge even more difficult. In practice, system designers are overwhelmed by the number of combinations and typically implement a single analytics type on one platform, leading to repeated implementation effort---and a plethora of semi-compatible tools for data scientists.
In this paper, we propose the "Collection Virtual Machine" (or CVM)---an extensible compiler framework designed to keep the specialization process of data analytics systems tractable. It can capture at the same time the essence of a large span of low-level, hardware-specific implementation techniques as well as high-level operations of different types of analyses. At its core lies a …
引用总数
2020202120222023202411621
学术搜索中的文章
I Müller, R Marroquín, D Koutsoukos, M Wawrzoniak… - Proceedings of the 16th International Workshop on …, 2020