Big Data and virtualization for manufacturing cyber-physical systems: A survey of the current status and future outlook

RF Babiceanu, R Seker - Computers in industry, 2016 - Elsevier
The recent advances in sensor and communication technologies can provide the
foundations for linking the physical manufacturing facility and machine world to the cyber …

A general perspective of Big Data: applications, tools, challenges and trends

L Rodríguez-Mazahua, CA Rodríguez-Enríquez… - The Journal of …, 2016 - Springer
Big Data has become a very popular term. It refers to the enormous amount of structured,
semi-structured and unstructured data that are exponentially generated by high …

LDBC Graphalytics: A benchmark for large-scale graph analysis on parallel and distributed platforms

A Iosup, T Hegeman, WL Ngai, S Heldens… - Proceedings of the …, 2016 - research.tudelft.nl
In this paper we introduce LDBC Graphalytics, a new industrial-grade benchmark for graph
analysis platforms. It consists of six deterministic algorithms, standard datasets, synthetic …

Applying combinatorial test data generation to big data applications

N Li, Y Lei, HR Khan, J Liu, Y Guo - Proceedings of the 31st IEEE/ACM …, 2016 - dl.acm.org
Big data applications (eg, Extract, Transform, and Load (ETL) applications) are designed to
handle great volumes of data. However, processing such great volumes of data is time …

Big data benchmark compendium

T Ivanov, T Rabl, M Poess, A Queralt… - … to Big Data to Internet of …, 2016 - Springer
Abstract The field of Big Data and related technologies is rapidly evolving. Consequently,
many benchmarks are emerging, driven by academia and industry alike. As these …

Understanding big data analytics workloads on modern processors

Z Jia, J Zhan, L Wang, C Luo, W Gao… - … on Parallel and …, 2016 - ieeexplore.ieee.org
Big data analytics workloads are very significant ones in modern data centers, and it is more
and more important to characterize their representative workloads and understand their …

Selecting resources for distributed dataflow systems according to runtime targets

L Thamsen, I Verbitskiy, F Schmidt… - 2016 IEEE 35th …, 2016 - ieeexplore.ieee.org
Distributed dataflow systems like Spark or Flink enable users to analyze large datasets.
Users create programs by providing sequential user-defined functions for a set of well …

Feasibility analysis of AsterixDB and Spark streaming with Cassandra for stream-based processing

P Pääkkönen - Journal of Big Data, 2016 - Springer
For getting up-to-date insight into online services, extracted data has to be processed in
near real time. For example, major big data companies (Facebook, LinkedIn, Twitter) …

How data volume affects spark based data analytics on a scale-up server

AJ Awan, M Brorsson, V Vlassov, E Ayguade - Big Data Benchmarks …, 2016 - Springer
Sheer increase in volume of data over the last decade has triggered research in cluster
computing frameworks that enable web enterprises to extract big insights from big data …

Micro-architectural characterization of apache spark on batch and stream processing workloads

AJ Awan, M Brorsson, V Vlassov… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
While cluster computing frameworks are continuously evolving to provide real-time data
analysis capabilities, Apache Spark has managed to be at the forefront of big data analytics …