[图书][B] Data-intensive text processing with MapReduce

J Lin, C Dyer - 2022 - books.google.com
Our world is being revolutionized by data-driven methods: access to large amounts of data
has generated new insights and opened exciting new opportunities in commerce, science …

[PDF][PDF] Nectar: automatic management of data and computation in datacenters

PK Gunda, L Ravindranath, CA Thekkath, Y Yu… - … USENIX Symposium on …, 2010 - usenix.org
Managing data and computation is at the heart of datacenter computing. Manual
management of data can lead to data loss, wasteful consumption of storage, and laborious …

Declarative networking

BT Loo, T Condie, M Garofalakis, DE Gay… - Communications of the …, 2009 - dl.acm.org
Declarative Networking is a programming methodology that enables developers to concisely
specify network protocols and services, which are directly compiled to a dataflow framework …

Dedalus: Datalog in Time and Space

P Alvaro, WR Marczak, N Conway… - Datalog Reloaded: First …, 2011 - Springer
Recent research has explored using Datalog-based languages to express a distributed
system as a set of logical invariants. Two properties of distributed systems proved difficult to …

Boom analytics: exploring data-centric, declarative programming for the cloud

P Alvaro, T Condie, N Conway, K Elmeleegy… - Proceedings of the 5th …, 2010 - dl.acm.org
Building and debugging distributed software remains extremely difficult. We conjecture that
by adopting a data-centric approach to system design and by employing declarative …

The declarative imperative: experiences and conjectures in distributed logic

JM Hellerstein - ACM SIGMOD Record, 2010 - dl.acm.org
The rise of multicore processors and cloud computing is putting enormous pressure on the
software community to find solutions to the difficulty of parallel and distributed programming …

Etalis: Rule-based reasoning in event processing

D Anicic, P Fodor, S Rudolph, R Stühmer… - Reasoning in event …, 2011 - Springer
Abstract Complex Event Processing (CEP) is concerned with timely detection of complex
events within multiple streams of atomic occurrences, and has useful applications in areas …

Fast checkpoint recovery algorithms for frequently consistent applications

T Cao, M Vaz Salles, B Sowell, Y Yue… - Proceedings of the …, 2011 - dl.acm.org
Advances in hardware have enabled many long-running applications to execute entirely in
main memory. As a result, these applications have increasingly turned to database …

Ambrosia: Providing performant virtual resiliency for distributed applications

J Goldstein, A Abdelhamid, M Barnett… - Proceedings of the …, 2020 - dl.acm.org
When writing today's distributed programs, which frequently span both devices and cloud
services, programmers are faced with complex decisions and coding tasks around coping …

Osprey: Implementing MapReduce-style fault tolerance in a shared-nothing distributed database

C Yang, C Yen, C Tan… - 2010 IEEE 26th …, 2010 - ieeexplore.ieee.org
In this paper, we describe a scheme for tolerating and recovering from mid-query faults in a
distributed shared nothing database. Rather than aborting and restarting queries, our …