[PDF][PDF] 大数据流式计算: 关键技术及系统实例

孙大为, 张广艳, 郑纬民 - 软件学报, 2014 - jos.org.cn
大数据计算主要有批量计算和流式计算两种形态, 目前, 关于大数据批量计算系统的研究和讨论
相对充分, 而如何构建低延迟, 高吞吐且持续可靠运行的大数据流式计算系统是当前亟待解决的 …

Verdi: a framework for implementing and formally verifying distributed systems

JR Wilcox, D Woos, P Panchekha, Z Tatlock… - Proceedings of the 36th …, 2015 - dl.acm.org
Distributed systems are difficult to implement correctly because they must handle both
concurrency and failures: machines may crash at arbitrary points and networks may reorder …

Why does the cloud stop computing? lessons from hundreds of service outages

HS Gunawi, M Hao, RO Suminto, A Laksono… - Proceedings of the …, 2016 - dl.acm.org
We conducted a cloud outage study (COS) of 32 popular Internet services. We analyzed
1247 headline news and public post-mortem reports that detail 597 unplanned outages that …

What bugs live in the cloud? a study of 3000+ issues in cloud systems

HS Gunawi, M Hao, T Leesatapornwongsa… - Proceedings of the …, 2014 - dl.acm.org
We conduct a comprehensive study of development and deployment issues of six popular
and important cloud systems (Hadoop MapReduce, HDFS, HBase, Cassandra, ZooKeeper …

Brownout: Building more robust cloud applications

C Klein, M Maggio, KE Årzén… - Proceedings of the 36th …, 2014 - dl.acm.org
Self-adaptation is a first class concern for cloud applications, which should be able to
withstand diverse runtime changes. Variations are simultaneously happening both at the …

Retro: Targeted resource management in multi-tenant distributed systems

J Mace, P Bodik, R Fonseca, M Musuvathi - 12th USENIX Symposium …, 2015 - usenix.org
In distributed systems shared by multiple tenants, effective resource management is an
important pre-requisite to providing quality of service guarantees. Many systems deployed …

An empirical study on the correctness of formally verified distributed systems

P Fonseca, K Zhang, X Wang… - Proceedings of the Twelfth …, 2017 - dl.acm.org
Recent advances in formal verification techniques enabled the implementation of distributed
systems with machine-checked proofs. While results are encouraging, the importance of …

Chapar: certified causally consistent distributed key-value stores

M Lesani, CJ Bell, A Chlipala - ACM SIGPLAN Notices, 2016 - dl.acm.org
Today's Internet services are often expected to stay available and render high
responsiveness even in the face of site crashes and network partitions. Theoretical results …

Big data stream computing: technologies and instances

孙大为, 张广艳, 郑纬民 - Journal of Software, 2014 - jos.org.cn
大数据计算主要有批量计算和流式计算两种形态, 目前, 关于大数据批量计算系统的研究和讨论
相对充分, 而如何构建低延迟, 高吞吐且持续可靠运行的大数据流式计算系统是当前亟待解决的 …

Predicting the end-to-end tail latency of containerized microservices in the cloud

J Rahman, P Lama - 2019 IEEE International Conference on …, 2019 - ieeexplore.ieee.org
Large-scale web services are increasingly adopting cloud-native principles of application
design to better utilize the advantages of cloud computing. This involves building an …