Towards a programmable semantic extract-transform-load framework for semantic data warehouses

RP Deb Nath, K Hose, TB Pedersen - Proceedings of the ACM …, 2015 - dl.acm.org
In order to create better decisions for business analytics, organizations increasingly use
external data, structured, semi-structured and unstructured, in addition to the (mostly …

CITIESData: a smart city data management framework

X Liu, A Heller, PS Nielsen - Knowledge and Information Systems, 2017 - Springer
Smart city data come from heterogeneous sources including various types of the Internet of
Things such as traffic, weather, pollution, noise, and portable devices. They are …

A model-driven framework for ETL process development

Z El Akkaoui, E Zimányi, JN Mazón… - Proceedings of the ACM …, 2011 - dl.acm.org
ETL processes are the backbone component of a data warehouse, since they supply the
data warehouse with the necessary integrated and reconciled data from heterogeneous and …

Performance of the ETL processes in terms of volume and velocity in the cloud: State of the art

PS Diouf, A Boly, S Ndiaye - 2017 4th IEEE international …, 2017 - ieeexplore.ieee.org
The ETL (Extract-Transform-Load) consists of extracting data from various sources,
transforming and loading them into a place called datawarehouse. ETL is a mandatory step …

A Fine‐Grained Distribution Approach for ETL Processes in Big Data Environments

M Bala, O Boussaid, Z Alimazighi - Data & Knowledge Engineering, 2017 - Elsevier
Among the so-called “4Vs”(volume, velocity, variety, and veracity) that characterize the
complexity of Big Data, this paper focuses on the issue of “Volume” in order to ensure good …

CloudETL: scalable dimensional ETL for hive

X Liu, C Thomsen, TB Pedersen - Proceedings of the 18th International …, 2014 - dl.acm.org
Extract-Transform-Load (ETL) programs process data into data warehouses (DWs). Rapidly
growing data volumes demand systems that scale out. Recently, much attention has been …

High-level ETL for semantic data warehouses

RP Deb Nath, O Romero, TB Pedersen… - Semantic …, 2021 - journals.sagepub.com
The popularity of the Semantic Web (SW) encourages organizations to organize and publish
semantic data using the RDF model. This growth poses new requirements to Business …

SLOD-BI: an open data infrastructure for enabling social business intelligence

R Berlanga, L García-Moya, V Nebot… - International Journal of …, 2015 - igi-global.com
The tremendous popularity of web-based social media is attracting the attention of the
industry to take profit from the massive availability of sentiment data, which is considered of …

ETLMR: a highly scalable dimensional ETL framework based on MapReduce

X Liu, C Thomsen, TB Pedersen - … , France, August 29-September 2, 2011 …, 2011 - Springer
Abstract Extract-Transform-Load (ETL) flows periodically populate data warehouses (DWs)
with data from different source systems. An increasing challenge for ETL flows is processing …

Mapreduce-based dimensional etl made easy

X Liu, C Thomsen, TB Pedersen - Proceedings of the VLDB Endowment, 2012 - dl.acm.org
This paper demonstrates ETLMR, a novel dimensional Extract--Transform--Load (ETL)
programming framework that uses Map-Reduce to achieve scalability. ETLMR has built-in …