Discretized streams: Fault-tolerant streaming computation at scale

M Zaharia, T Das, H Li, T Hunter, S Shenker… - Proceedings of the …, 2013 - dl.acm.org
Many" big data" applications must act on data in real time. Running these applications at
ever-larger scales requires parallel platforms that automatically handle faults and stragglers …

Discretized streams: An efficient and {Fault-Tolerant} model for stream processing on large clusters

M Zaharia, T Das, H Li, S Shenker, I Stoica - 4th USENIX Workshop on …, 2012 - usenix.org
Many important “big data” applications need to process data arriving in real time. However,
current programming models for distributed stream processing are relatively low-level, often …

Streaming data integration: Challenges and opportunities

N Tatbul - 2010 IEEE 26th International Conference on Data …, 2010 - ieeexplore.ieee.org
In this position paper, we motivate the need for streaming data integration in three main
forms including across multiple streaming data sources, over multiple stream processing …

Aggregation and Degradation in {JetStream}: Streaming Analytics in the Wide Area

A Rabkin, M Arye, S Sen, VS Pai… - 11th USENIX Symposium …, 2014 - usenix.org
We present JetStream, a system that allows real-time analysis of large, widely-distributed
changing data sets. Traditional approaches to distributed analytics require users to specify …

A survey on IoT big data analytic systems: Current and future

Y Sasaki - IEEE Internet of Things Journal, 2021 - ieeexplore.ieee.org
The Internet of Things (IoT) has become widespread around the world. Since a large
number of diverse devices, such as vehicles, household electrical appliances, smart …

[图书][B] An architecture for fast and general data processing on large clusters

M Zaharia - 2016 - books.google.com
The past few years have seen a major change in computing systems, as growing data
volumes and stalling processor speeds require more and more applications to scale out to …

System and method for data stream processing

M Hsu, Q Chen - US Patent 8,260,803, 2012 - Google Patents
A method and system for processing a data stream are described. The method executes,
until the occurrence of a cut condition, a map function from a set of query processing steps to …

Temporal analytics on big data for web advertising

B Chandramouli, J Goldstein… - 2012 IEEE 28th …, 2012 - ieeexplore.ieee.org
" Big Data" in map-reduce (MR) clusters is often fundamentally temporal in nature, as are
many analytics tasks over such data. For instance, display advertising uses Behavioral …

Continuous analytics over discontinuous streams

S Krishnamurthy, MJ Franklin, J Davis… - Proceedings of the …, 2010 - dl.acm.org
Continuous analytics systems that enable query processing over steams of data have
emerged as key solutions for dealing with massive data volumes and demands for low …

[PDF][PDF] In-situ {MapReduce} for Log Processing

D Logothetis, C Trezzo, KC Webb… - 2011 USENIX Annual …, 2011 - usenix.org
Log analytics are a bedrock component of running many of today's Internet sites. Application
and click logs form the basis for tracking and analyzing customer behaviors and …