Recent advancements in event processing

M Dayarathna, S Perera - ACM Computing Surveys (CSUR), 2018 - dl.acm.org
Event processing (EP) is a data processing technology that conducts online processing of
event information. In this survey, we summarize the latest cutting-edge work done on EP …

[PDF][PDF] Apache flink: Stream and batch processing in a single engine

P Carbone, A Katsifodimos, S Ewen, V Markl… - The Bulletin of the …, 2015 - diva-portal.org
Apache Flink 1 is an open-source system for processing streaming and batch data. Flink is
built on the philosophy that many classes of data processing applications, including real …

Live video analytics at scale with approximation and {Delay-Tolerance}

H Zhang, G Ananthanarayanan, P Bodik… - … USENIX Symposium on …, 2017 - usenix.org
Video cameras are pervasively deployed for security and smart city scenarios, with millions
of them in large cities worldwide. Achieving the potential of these cameras requires …

Videoedge: Processing camera streams using hierarchical clusters

CC Hung, G Ananthanarayanan… - 2018 IEEE/ACM …, 2018 - ieeexplore.ieee.org
Organizations deploy a hierarchy of clusters-cameras, private clusters, public clouds-for
analyzing live video feeds from their cameras. Video analytics queries have many …

The dataflow model: a practical approach to balancing correctness, latency, and cost in massive-scale, unbounded, out-of-order data processing

T Akidau, R Bradshaw, C Chambers… - Proceedings of the …, 2015 - dl.acm.org
Unbounded, unordered, global-scale datasets are increasingly common in day-to-day
business (eg Web logs, mobile usage statistics, and sensor networks). At the same time …

In-memory big data management and processing: A survey

H Zhang, G Chen, BC Ooi, KL Tan… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
Growing main memory capacity has fueled the development of in-memory big data
management and processing. By eliminating disk I/O bottleneck, it is now possible to support …

Structured streaming: A declarative api for real-time applications in apache spark

M Armbrust, T Das, J Torres, B Yavuz, S Zhu… - Proceedings of the …, 2018 - dl.acm.org
With the ubiquity of real-time data, organizations need streaming systems that are scalable,
easy to use, and easy to integrate into business applications. Structured Streaming is a new …

InferLine: latency-aware provisioning and scaling for prediction serving pipelines

D Crankshaw, GE Sela, X Mo, C Zumar… - Proceedings of the 11th …, 2020 - dl.acm.org
Serving ML prediction pipelines spanning multiple models and hardware accelerators is a
key challenge in production machine learning. Optimally configuring these pipelines to meet …

Approximate query processing: No silver bullet

S Chaudhuri, B Ding, S Kandula - Proceedings of the 2017 ACM …, 2017 - dl.acm.org
In this paper, we reflect on the state of the art of Approximate Query Processing. Although
much technical progress has been made in this area of research, we are yet to see its impact …

Macrobase: Prioritizing attention in fast data

P Bailis, E Gan, S Madden, D Narayanan… - Proceedings of the …, 2017 - dl.acm.org
As data volumes continue to rise, manual inspection is becoming increasingly untenable. In
response, we present MacroBase, a data analytics engine that prioritizes end-user attention …