Big data stream analysis: a systematic literature review

T Kolajo, O Daramola, A Adebiyi - Journal of Big Data, 2019 - Springer
Recently, big data streams have become ubiquitous due to the fact that a number of
applications generate a huge amount of data at a great velocity. This made it difficult for …

A survey of distributed data stream processing frameworks

H Isah, T Abughofa, S Mahfuz, D Ajerla… - IEEE …, 2019 - ieeexplore.ieee.org
Big data processing systems are evolving to be more stream oriented where each data
record is processed as it arrives by distributed and low-latency computational frameworks on …

Noscope: optimizing neural network queries over video at scale

D Kang, J Emmons, F Abuzaid, P Bailis… - arXiv preprint arXiv …, 2017 - arxiv.org
Recent advances in computer vision-in the form of deep neural networks-have made it
possible to query increasing volumes of video data with high accuracy. However, neural …

The dataflow model: a practical approach to balancing correctness, latency, and cost in massive-scale, unbounded, out-of-order data processing

T Akidau, R Bradshaw, C Chambers… - Proceedings of the …, 2015 - dl.acm.org
Unbounded, unordered, global-scale datasets are increasingly common in day-to-day
business (eg Web logs, mobile usage statistics, and sensor networks). At the same time …

Focus: Querying large video datasets with low latency and low cost

K Hsieh, G Ananthanarayanan, P Bodik… - … USENIX Symposium on …, 2018 - usenix.org
Large volumes of videos are continuously recorded from cameras deployed for traffic control
and surveillance with the goal of answering “after the fact” queries: identify video frames with …

Sonata: Query-driven streaming network telemetry

A Gupta, R Harrison, M Canini, N Feamster… - Proceedings of the …, 2018 - dl.acm.org
Managing and securing networks requires collecting and analyzing network traffic data in
real time. Existing telemetry systems do not allow operators to express the range of queries …

Discretized streams: Fault-tolerant streaming computation at scale

M Zaharia, T Das, H Li, T Hunter, S Shenker… - Proceedings of the …, 2013 - dl.acm.org
Many" big data" applications must act on data in real time. Running these applications at
ever-larger scales requires parallel platforms that automatically handle faults and stragglers …

Awstream: Adaptive wide-area streaming analytics

B Zhang, X Jin, S Ratnasamy, J Wawrzynek… - Proceedings of the 2018 …, 2018 - dl.acm.org
The emerging class of wide-area streaming analytics faces the challenge of scarce and
variable WAN bandwidth. Non-adaptive applications built with TCP or UDP suffer from …

State management in Apache Flink®: consistent stateful distributed stream processing

P Carbone, S Ewen, G Fóra, S Haridi… - Proceedings of the …, 2017 - dl.acm.org
Stream processors are emerging in industry as an apparatus that drives analytical but also
mission critical services handling the core of persistent application logic. Thus, apart from …

Millwheel: Fault-tolerant stream processing at internet scale

T Akidau, A Balikov, K Bekiroğlu, S Chernyak… - Proceedings of the …, 2013 - dl.acm.org
MillWheel is a framework for building low-latency data-processing applications that is widely
used at Google. Users specify a directed computation graph and application code for …