Understandable big data: a survey

CK Emani, N Cullot, C Nicolle - Computer science review, 2015 - Elsevier
This survey presents the concept of Big Data. Firstly, a definition and the features of Big Data
are given. Secondly, the different steps for Big Data data processing and the main problems …

[HTML][HTML] Reference architecture and classification of technologies, products and services for big data systems

P Pääkkönen, D Pakkala - Big data research, 2015 - Elsevier
Many business cases exploiting big data have been realised in recent years; Twitter,
LinkedIn, and Facebook are examples of companies in the social networking domain. Other …

Towards automatic optimization of MapReduce programs

S Babu - Proceedings of the 1st ACM symposium on Cloud …, 2010 - dl.acm.org
Timely and cost-effective processing of large datasets has become a critical ingredient for
the success of many academic, government, and industrial organizations. The combination …

On complexity and optimization of expensive queries in complex event processing

H Zhang, Y Diao, N Immerman - Proceedings of the 2014 ACM SIGMOD …, 2014 - dl.acm.org
Pattern queries are widely used in complex event processing (CEP) systems. Existing
pattern matching techniques, however, can provide only limited performance for expensive …

Assisting developers of big data analytics applications when deploying on hadoop clouds

W Shang, ZM Jiang, H Hemmati… - 2013 35th …, 2013 - ieeexplore.ieee.org
Big data analytics is the process of examining large amounts of data (big data) in an effort to
uncover hidden patterns or unknown correlations. Big Data Analytics Applications (BDA …

A simulation approach to evaluating design decisions in mapreduce setups

G Wang, AR Butt, P Pandey… - 2009 IEEE International …, 2009 - ieeexplore.ieee.org
MapReduce has emerged as a model of choice for supporting modern data-intensive
applications. The model is easy-to-use and promising in reducing time-to-solution. It is also …

Examining the stability of logging statements

S Kabinna, CP Bezemer, W Shang, MD Syer… - Empirical Software …, 2018 - Springer
Logging statements (embedded in the source code) produce logs that assist in
understanding system behavior, monitoring choke-points and debugging. Prior work …

Monalytics: online monitoring and analytics for managing large scale data centers

M Kutare, G Eisenhauer, C Wang, K Schwan… - Proceedings of the 7th …, 2010 - dl.acm.org
To effectively manage large-scale data centers and utility clouds, operators must understand
current system and application behaviors. This requires continuous monitoring along with …

Bigdebug: Debugging primitives for interactive big data processing in spark

MA Gulzar, M Interlandi, S Yoo, SD Tetali… - Proceedings of the 38th …, 2016 - dl.acm.org
Developers use cloud computing platforms to process a large quantity of data in parallel
when developing big data analytics. Debugging the massive parallel computations that run …

Studying the characteristics of logging practices in mobile apps: a case study on f-droid

Y Zeng, J Chen, W Shang, TH Chen - Empirical Software Engineering, 2019 - Springer
Logging is a common practice in software engineering. Prior research has investigated the
characteristics of logging practices in system software (eg, web servers or databases) as …