Streaming Approach to Schema Profiling

C Forresi, M Francia, E Gallinucci… - European Conference on …, 2023 - Springer
… In this work, the first citizen is the schema of a message in the stream. Streaming … , ie,
collections of key-value pairs, where keys are the schema elements (ie, attributes). It is common to …

Dynamic Stream Clustering for Real-Time Schema Profiling with Dsc+

C Forresi, M Francia, E Gallinucci… - Available at SSRN … - papers.ssrn.com
… key insights about the schema of data in a high-variety context. In this paper, we present
a streaming approach to schema profiling called DSC+. The approach works under the …

Big data stream processing

OC Marcu, P Bouvry - 2024 - hal.science
… The project focuses on establishing a data lake with a unified metadata schema to ensure
… query execution, combining various profiling techniques to optimize code for modern …

Single-column data profiling

H Harmouch - 2020 - publishup.uni-potsdam.de
… for database management and schema reverse engineering. Data profiling has also more
novel … column cardinalities from big data and data streams. We investigate the effect of missing/…

Streaming the web: Reasoning over dynamic data

A Margara, J Urbani, F Van Harmelen, H Bal - Journal of Web Semantics, 2014 - Elsevier
… In this paper, we first report some example application areas that can benefit from stream
reasoning and analyze the requirements they pose. Then, we survey existing approaches in …

Data profiling revisited

F Naumann - ACM SIGMOD Record, 2014 - dl.acm.org
… can be served by data stream management systems [23]. If this is not the case, continuous
… such “soft schema” values is a challenge, and a first step must involve schema profiling as …

A survey on the evolution of stream processing systems

M Fragkoulis, P Carbone, V Kalavri, A Katsifodimos - The VLDB Journal, 2024 - Springer
… model, as implemented by systems of the second generation [16, 37, 164], does not impose
any strict schema or semantics on the input stream elements, other than the presence of a …

Streamcloud: A large scale data streaming system

V Gulisano, R Jimenez-Peris… - 2010 IEEE 30th …, 2010 - ieeexplore.ieee.org
stream and so on. Union is used to merge two or more streams with the same schema into
a single output stream. … this profiling has been made with 30 nodes for all approaches), the …

A network and device aware QoS approach for cloud-based mobile streaming

CF Lai, H Wang, HC Chao… - IEEE transactions on …, 2013 - ieeexplore.ieee.org
… an XML-schema profile and transmit it to the profile agent. The profile agent determines the
… The parameter profile module records the user profile and determines the parameter list …

A survey of distributed data stream processing frameworks

H Isah, T Abughofa, S Mahfuz, D Ajerla… - IEEE …, 2019 - ieeexplore.ieee.org
… Individual records are structured list of attributes and their data types while a schema is a …
title of a hybrid autonomous data analytics framework using a semantic data profiling