Apache tez: A unifying framework for modeling and building data processing applications

B Saha, H Shah, S Seth, G Vijayaraghavan… - Proceedings of the …, 2015 - dl.acm.org
The broad success of Hadoop has led to a fast-evolving and diverse ecosystem of
application engines that are building upon the YARN resource management layer. The open …

Big data 2.0 processing systems: Taxonomy and open challenges

F Bajaber, R Elshawi, O Batarfi, A Altalhi… - Journal of Grid …, 2016 - Springer
Data is key resource in the modern world. Big data has become a popular term which is
used to describe the exponential growth and availability of data. In practice, the growing …

[图书][B] Big data 2.0 processing systems

S Sakr, S Sakr - 2016 - Springer
We live in an age of so-called Big Data. The radical expansion and integration of
computation, networking, digital devices, and data storage has provided a robust platform for …

The impact of columnar file formats on SQL‐on‐hadoop engine performance: A study on ORC and Parquet

T Ivanov, M Pergolesi - Concurrency and Computation …, 2020 - Wiley Online Library
Columnar file formats provide an efficient way to store data to be queried by SQL‐on‐
Hadoop engines. Related works consider the performance of processing engine and file …

Efficient OLAP query processing across cuboids in distributed data warehousing environment

S Roy, S Raj, T Chakraborty, A Chakrabarty… - Expert Systems with …, 2024 - Elsevier
This research work introduces a novel approach to enhance the performance of distributed
data warehouses. Distribution of data has been done across multiple data center for the …

Evaluating SQL-on-Hadoop for big data warehousing on not-so-good hardware

MY Santos, C Costa, J Galvão, C Andrade… - Proceedings of the 21st …, 2017 - dl.acm.org
Big Data is currently conceptualized as data whose volume, variety or velocity impose
significant difficulties in traditional techniques and technologies. Big Data Warehousing is …

Big SQL systems: an experimental evaluation

V Aluko, S Sakr - Cluster Computing, 2019 - Springer
Abstract Recently, Big Data systems have been gaining increasing popularity on handling
the massive amounts of data that are continuously generated in our digital world. While the …

On the inequality of the 3V's of Big Data Architectural Paradigms: A case for heterogeneity

T Ivanov, N Korfiatis, RV Zicari - arXiv preprint arXiv:1311.0805, 2013 - arxiv.org
The well-known 3V architectural paradigm for Big Data introduced by Laney (2011),
provides a simplified framework for defining the architecture of a big data platform to be …

Compiler-Assisted Static Checkpoint Insertion.

J Long, WK Fuchs, JA Abraham - FTCS, 1992 - ieeexplore.ieee.org
This paper describes a compiler-assisted approach for static checkpoint insertion. Instead of
fixing the checkpoint location before program execution, a compiler enhanced polling …

Roles of Customer Databases and Database Marketing in Customer Relationship Management

PC Mandal - International Journal of E-Business Research (IJEBR), 2022 - igi-global.com
Abstract Development in information technology helps companies to build customer
databases, perform database marketing, and do relationship management. The study …