查看文章

kaist.ac.kr 中的 [PDF]

PARADISE: Big data analytics using the DBMS tightly integrated with the distributed file system

作者

Jun-Sung Kim, Kyu-Young Whang, Hyuk-Yoon Kwon, Il-Yeol Song

发表日期

2016/5

期刊

World Wide Web

卷号

页码范围

299-322

出版商

Springer US

简介

There has been a lot of research on MapReduce for big data analytics. This new class of systems sacrifices DBMS functionality such as query languages, schemas, or indexes in order to maximize scalability and parallelism. However, as high functionality of the DBMS is considered important for big data analytics as well, there have been a lot of efforts to support DBMS functionality in MapReduce. HadoopDB is the only work that directly utilizes the DBMS for big data analytics in the MapReduce framework, taking advantage of both the DBMS and MapReduce. However, HadoopDB does not support sharability for the entire data since it stores the data into multiple nodes in a shared-nothing manner—i.e., it partitions a job into multiple tasks where each task is assigned to a fragment of data. Due to this limitation, HadoopDB cannot effectively process queries that require internode communication. That is …

引用总数

被引用次数：7

201520162017201820192 2 2 1

学术搜索中的文章

PARADISE: Big data analytics using the DBMS tightly integrated with the distributed file system

JS Kim, KY Whang, HY Kwon, IY Song - World Wide Web, 2016

被引用次数：7 相关文章所有 10 个版本