Data lake management: challenges and opportunities

F Nargesian, E Zhu, RJ Miller, KQ Pu… - Proceedings of the VLDB …, 2019 - dl.acm.org
The ubiquity of data lakes has created fascinating new challenges for data management
research. In this tutorial, we review the state-of-the-art in data management for data lakes …

Data lakes: A survey of functions and systems

R Hai, C Koutras, C Quix… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Data lakes are becoming increasingly prevalent for Big Data management and data
analytics. In contrast to traditional 'schema-on-write'approaches such as data warehouses …

An overview about data integration in data lakes

JC Couto, DD Ruiz - 2022 17th Iberian Conference on …, 2022 - ieeexplore.ieee.org
Integrating data in data lakes is essential so we can perform more complex analyses.
However, data lakes are mainly composed of raw data, from structured, semi-structured, and …

VizCommender: Computing text-based similarity in visualization repositories for content-based recommendations

M Oppermann, R Kincaid… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Cloud-based visualization services have made visual analytics accessible to a much wider
audience than ever before. Systems such as Tableau have started to amass increasingly …

Data lakehouse-a novel step in analytics architecture

D Oreščanin, T Hlupić - 2021 44th International Convention on …, 2021 - ieeexplore.ieee.org
Data Lakes, as a modern concept of raw analytical data storage, were presented as a next
step that will take over Data Warehouses. In the course of time after the Data Lakes …

[HTML][HTML] Generative mechanisms of AI implementation: A critical realist perspective on predictive maintenance

A Stohr, P Ollig, R Keller, A Rieger - Information and Organization, 2024 - Elsevier
Artificial intelligence (AI) promises various new opportunities to create and appropriate
business value. However, many organizations–especially those in more traditional …

[HTML][HTML] Data quality for federated medical data lakes

J Eder, VA Shekhovtsov - International Journal of Web Information …, 2021 - emerald.com
Purpose Medical research requires biological material and data collected through biobanks
in reliable processes with quality assurance. Medical studies based on data with unknown …

An approach to extracting topic-guided views from the sources of a data lake

C Diamantini, P Lo Giudice, D Potena, E Storti… - Information Systems …, 2021 - Springer
In the last years, data lakes are emerging as an effective and an efficient support for
information and knowledge extraction from a huge amount of highly heterogeneous and …

Moving beyond set-it-and-forget-it privacy settings on social media

M Mondal, GS Yilmaz, N Hirsch, MT Khan… - Proceedings of the …, 2019 - dl.acm.org
When users post on social media, they protect their privacy by choosing an access control
setting that is rarely revisited. Changes in users' lives and relationships, as well as social …

Managing Personal Identifiable Information in Data Lakes

D Oreščanin, T Hlupić, B Vrdoljak - IEEE access, 2024 - ieeexplore.ieee.org
Privacy is a fundamental human right according to the Universal Declaration of Human
Rights of the United Nations. Adoption of the General Data Protection Regulation (GDPR) in …