Analyzing efficient stream processing on modern hardware

S Zeuch, BD Monte, J Karimov, C Lutz, M Renz… - Proceedings of the …, 2019 - dl.acm.org
Modern Stream Processing Engines (SPEs) process large data volumes under tight latency
constraints. Many SPEs execute processing pipelines using message passing on shared …

SCREEN: Stream data cleaning under speed constraints

S Song, A Zhang, J Wang, PS Yu - Proceedings of the 2015 ACM …, 2015 - dl.acm.org
Stream data are often dirty, for example, owing to unreliable sensor reading, or erroneous
extraction of stock prices. Most stream data cleaning approaches employ a smoothing filter …

Massive scale-out of expensive continuous queries

E Zeitler, T Risch - Proceedings of the VLDB Endowment, 2011 - dl.acm.org
Scalable execution of expensive continuous queries over massive data streams requires
input streams to be split into parallel sub-streams. The query operators are continuously …

[HTML][HTML] Metadata management for scientific databases

P Pinoli, S Ceri, D Martinenghi, L Nanni - Information Systems, 2019 - Elsevier
Most scientific databases consist of datasets (or sources) which in turn include samples (or
files) with an identical structure (or schema). In many cases, samples are associated with …

Transactional stream processing

I Botan, PM Fischer, D Kossmann… - Proceedings of the 15th …, 2012 - dl.acm.org
Many stream processing applications require access to a multitude of streaming as well as
stored data sources. Yet there is no clear semantics for correct continuous query execution …

Stream data cleaning under speed and acceleration constraints

S Song, F Gao, A Zhang, J Wang, PS Yu - ACM Transactions on …, 2021 - dl.acm.org
Stream data are often dirty, for example, owing to unreliable sensor reading or erroneous
extraction of stock prices. Most stream data cleaning approaches employ a smoothing filter …

[图书][B] Data lakes

A Laurent, D Laurent, C Madera - 2020 - books.google.com
The concept of a data lake is less than 10 years old, but they are already hugely
implemented within large companies. Their goal is to efficiently deal with ever-growing …

Efficient approximation and privacy preservation algorithms for real time online evolving data streams

RA Patil, PD Patil - World Wide Web, 2024 - Springer
Because of the processing of continuous unstructured large streams of data, mining real-
time streaming data is a more challenging research issue than mining static data. The …

Semantic stream query optimization exploiting dynamic metadata

L Ding, K Works… - 2011 IEEE 27th …, 2011 - ieeexplore.ieee.org
Data stream management systems (DSMS) processing long-running queries over large
volumes of stream data must typically deliver time-critical responses. We propose the first …

Schema matching and mapping: from usage to evaluation

A Bonifati, Y Velegrakis - … of the 14th International Conference on …, 2011 - dl.acm.org
This tutorial provides an overview of current evaluation techniques for schema matching and
mapping tasks and tools, alongside existing and broadly used evaluation scenarios. The …