On applicability of big data analytics in the closed-loop product lifecycle: Integration of CRISP-DM standard

E Gholamzadeh Nabati, KD Thoben - … of Industries: 13th IFIP WG 5.1 …, 2016 - Springer
The product use data can have an important role in closed-loop product lifecycle
management (CL-PLM), where information feedbacks from the use data can contribute to …

Modeling big data enablers for service operations management

M Nasrollahi, MR Fathi - Big Data and Blockchain for Service Operations …, 2022 - Springer
The purpose of this chapter is to identify big data enablers in Service Operations
Management (SOM) and to analyze the interactions between them. First, we identify forty …

Bijoux: Data generator for evaluating etl process quality

E Nakucçi, V Theodorou, P Jovanovic… - Proceedings of the 17th …, 2014 - dl.acm.org
Obtaining the right set of data for evaluating the fulfillment of different quality standards in the
extract-transform-load (ETL) process design is rather challenging. First, the real data might …

Datamime: Generating Representative Benchmarks by Automatically Synthesizing Datasets

HR Lee, D Sanchez - 2022 55th IEEE/ACM International …, 2022 - ieeexplore.ieee.org
Benchmarks that closely match the behavior of production workloads are crucial to design
and provision computer systems. However, current approaches fall short: First, open-source …

AC: A data generator for evaluation of clustering

W Li, Z Zhou - Authorea Preprints, 2023 - techrxiv.org
Clustering has important applications in many fields. However, there are not enough
benchmark datasets with rich characteristics for the development and evaluation of …

An Approach to Workload Generation for Cloud Benchmarking: a View from Alibaba Trace

J Zhu, B Lu, X Yu, J Xu, T Wo - 2023 IEEE 15th International …, 2023 - ieeexplore.ieee.org
Finding performance bottlenecks through bench-marking is one of the driving forces to
improve the resource provision efficiency of cloud computing. Although existing benchmarks …

SMiPE: estimating the progress of recurring iterative distributed dataflows

J Koch, L Thamsen, F Schmidt… - 2017 18th International …, 2017 - ieeexplore.ieee.org
Distributed dataflow systems such as Apache Spark allow the execution of iterative
programs at large scale on clusters. In production use, programs are often recurring and …

The LDBC Graphalytics Benchmark

A Iosup, A Musaafir, A Uta, AP Pérez… - arXiv preprint arXiv …, 2020 - arxiv.org
In this document, we describe LDBC Graphalytics, an industrial-grade benchmark for graph
analysis platforms. The main goal of Graphalytics is to enable the fair and objective …

[PDF][PDF] Visual analytics of social media for situation awareness

D Thom - 2015 - researchgate.net
“It's after 2001. Where is HAL?” This question was asked 2007 by cognitive science pioneer
Marvin Minsky in a talk about the state of artificial intelligence (AI). He was referring to the …

Performance characterization and optimization of in-memory data analytics on a scale-up server

AJ Awan - 2017 - diva-portal.org
The sheer increase in the volume of data over the last decade has triggered research in
cluster computing frameworks that enable web enterprises to extract big insights from big …