作者
Fabrizio Marozzo, Domenico Talia, Paolo Trunfio
发表日期
2016/7/7
期刊
IEEE Transactions on Services Computing
卷号
11
期号
3
页码范围
480-492
出版商
IEEE
简介
The extraction of useful information from data is often a complex process that can be conveniently modeled as a data analysis workflow. When very large data sets must be analyzed and/or complex data mining algorithms must be executed, data analysis workflows may take very long times to complete their execution. Therefore, efficient systems are required for the scalable execution of data analysis workflows, by exploiting the computing services of the Cloud platforms where data is increasingly being stored. The objective of the paper is to demonstrate how Cloud software technologies can be integrated to implement an effective environment for designing and executing scalable data analysis workflows. We describe the design and implementation of the Data Mining Cloud Framework (DMCF), a data analysis system that integrates a visual workflow language and a parallel runtime with the Software-as-a-Service …
引用总数
201620172018201920202021202220232024137101412351310
学术搜索中的文章
F Marozzo, D Talia, P Trunfio - IEEE Transactions on Services Computing, 2016