Survey of real-time processing systems for big data

X Liu, N Iftikhar, X Xie - Proceedings of the 18th International Database …, 2014 - dl.acm.org
In recent years, real-time processing and analytics systems for big data--in the context of
Business Intelligence (BI)--have received a growing attention. The traditional BI platforms …

From conceptual design to performance optimization of ETL workflows: current state of research and open problems

SMF Ali, R Wrembel - The VLDB Journal, 2017 - Springer
In this paper, we discuss the state of the art and current trends in designing and optimizing
ETL workflows. We explain the existing techniques for:(1) constructing a conceptual and a …

A hybrid ICT-solution for smart meter data analytics

X Liu, PS Nielsen - Energy, 2016 - Elsevier
Smart meters are increasingly used worldwide. Smart meters are the advanced meters
capable of measuring energy consumption at a fine-grained time interval, eg, every 15 min …

Scheduling strategies for efficient ETL execution

A Karagiannis, P Vassiliadis, A Simitsis - Information Systems, 2013 - Elsevier
Extract-transform-load (ETL) workflows model the population of enterprise data warehouses
with information gathered from a large variety of heterogeneous data sources. ETL …

Regression-based online anomaly detection for smart grid data

X Liu, PS Nielsen - arXiv preprint arXiv:1606.05781, 2016 - arxiv.org
With the widely used smart meters in the energy sector, anomaly detection becomes a
crucial mean to study the unusual consumption behaviors of customers, and to discover …

A Fine‐Grained Distribution Approach for ETL Processes in Big Data Environments

M Bala, O Boussaid, Z Alimazighi - Data & Knowledge Engineering, 2017 - Elsevier
Among the so-called “4Vs”(volume, velocity, variety, and veracity) that characterize the
complexity of Big Data, this paper focuses on the issue of “Volume” in order to ensure good …

CloudETL: scalable dimensional ETL for hive

X Liu, C Thomsen, TB Pedersen - Proceedings of the 18th International …, 2014 - dl.acm.org
Extract-Transform-Load (ETL) programs process data into data warehouses (DWs). Rapidly
growing data volumes demand systems that scale out. Recently, much attention has been …

Mapreduce-based dimensional etl made easy

X Liu, C Thomsen, TB Pedersen - Proceedings of the VLDB Endowment, 2012 - dl.acm.org
This paper demonstrates ETLMR, a novel dimensional Extract--Transform--Load (ETL)
programming framework that uses Map-Reduce to achieve scalability. ETLMR has built-in …

P-ETL: Parallel-ETL based on the MapReduce paradigm

M Bala, O Boussaid, Z Alimazighi - 2014 IEEE/ACS 11th …, 2014 - ieeexplore.ieee.org
Big data is an opportunity in the emergence of novel business applications such as “Big
Data Analytics”(BDA). However, these data with non-traditional volumes create a real …

An ETL optimization framework using partitioning and parallelization

X Liu, N Iftikhar - Proceedings of the 30th Annual ACM Symposium on …, 2015 - dl.acm.org
Extract-Transform-Load (ETL) handles large amounts of data and manages workload
through dataflows. ETL dataflows are widely regarded as complex and expensive …