Data lake management: challenges and opportunities

F Nargesian, E Zhu, RJ Miller, KQ Pu… - Proceedings of the VLDB …, 2019 - dl.acm.org
The ubiquity of data lakes has created fascinating new challenges for data management
research. In this tutorial, we review the state-of-the-art in data management for data lakes …

On data lake architectures and metadata management

P Sawadogo, J Darmont - Journal of Intelligent Information Systems, 2021 - Springer
Over the past two decades, we have witnessed an exponential increase of data production
in the world. So-called big data generally come from transactional systems, and even more …

Data lakes and Optimizing Query

A Katari - Available at SSRN, 2022 - papers.ssrn.com
Data lakes have emerged as a pivotal solution for managing vast amounts of unstructured
and structured data, offering unparalleled scalability and flexibility. This article delves into …

[HTML][HTML] Amplifying domain expertise in clinical data pipelines

P Rahman, A Nandi, C Hebert - JMIR Medical Informatics, 2020 - medinform.jmir.org
Digitization of health records has allowed the health care domain to adopt data-driven
algorithms for decision support. There are multiple people involved in this process: a data …

[图书][B] Relational Data Enrichment by Discovery and Transformation

F Nargesian - 2019 - search.proquest.com
In the context of data preparation for data science, we study the data enrichment problem,
which is the challenge of augmenting a table with relevant data. We consider two …

[图书][B] Amplifying domain expertise in medical data pipelines

P Rahman - 2020 - search.proquest.com
Digitization of medical documents has led to increased availability of data for analysis. This
has induced domains to incorporate data-driven decision-making. However, going from data …