Spark versus flink: Understanding performance in big data analytics frameworks

OC Marcu, A Costan, G Antoniu… - 2016 IEEE …, 2016 - ieeexplore.ieee.org
Big Data analytics has recently gained increasing popularity as a tool to process large
amounts of data on-demand. Spark and Flink are two Apache-hosted data analytics …

[图书][B] Big data 2.0 processing systems: a survey

S Sakr - 2016 - Springer
We live in an age of so-called Big Data. The radical expansion and integration of
computation, networking, digital devices, and data storage have provided a robust platform …

Comet: batched stream processing for data intensive distributed computing

B He, M Yang, Z Guo, R Chen, B Su, W Lin… - Proceedings of the 1st …, 2010 - dl.acm.org
Batched stream processing is a new distributed data processing paradigm that models
recurring batch computations on incrementally bulk-appended data streams. The model is …

Towards a comprehensive data analytics framework for smart healthcare services

S Sakr, A Elgammal - Big Data Research, 2016 - Elsevier
With the increasing volumes of information gathered via patient monitoring systems,
physicians have been put on increasing pressure for making sophisticated analytical …

A survey on geographically distributed big-data processing using MapReduce

S Dolev, P Florissi, E Gudes… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Hadoop and Spark are widely used distributed processing frameworks for large-scale data
processing in an efficient and fault-tolerant manner on private or public clouds. These big …

From a monolithic big data system to a microservices event-driven architecture

R Laigner, M Kalinowski, P Diniz… - 2020 46th Euromicro …, 2020 - ieeexplore.ieee.org
Context: Data-intensive systems, aka big data systems (BDS), are software systems that
handle a large volume of data in the presence of performance quality attributes, such as …

Supporting scalable analytics with latency constraints

B Li, Y Diao, P Shenoy - Proceedings of the VLDB Endowment, 2015 - dl.acm.org
Recently there has been a significant interest in building big data analytics systems that can
handle both" big data" and" fast data". Our work is strongly motivated by recent real-world …

Bigdatabench: a big data benchmark suite from web search engines

W Gao, Y Zhu, Z Jia, C Luo, L Wang, Z Li… - arXiv preprint arXiv …, 2013 - arxiv.org
This paper presents our joint research efforts on big data benchmarking with several
industrial partners. Considering the complexity, diversity, workload churns, and rapid …

What is big data

E Dumbill - An introduction to the big data landscape.[online] …, 2012 - mhsinformatics.org
Big data is data that exceeds the processing capacity of conventional database systems.
The data is too big, moves too fast, or doesn't fit the strictures of your database architectures …

Big data: From beginning to future

I Yaqoob, IAT Hashem, A Gani, S Mokhtar… - International Journal of …, 2016 - Elsevier
Big data is a potential research area receiving considerable attention from academia and IT
communities. In the digital world, the amounts of data generated and stored have expanded …