[PDF][PDF] A Generator for Subspace Clusters.

A Beer, NS Schüler, T Seidl - LWDA, 2019 - ceur-ws.org
We introduce a generator for data containing subspace clusters which is accurately tunable
and adjustable to the needs of developers. It is online available and allows to give a …

Akıllı şehirlerde büyük coğrafi veri yönetimi ve analizi: hava kalitesi örneği

AÇ Aydınoğlu, R Bovkır, M Bulut - Geomatik, 2022 - dergipark.org.tr
Bilişim teknolojilerinin gelişmesiyle, veri üretim teknikleri ve toplanan veri hacmi artmıştır.
Akıllı şehir uygulamaları ile sensörler, IoT, internet, giyilebilir teknolojiler gibi farklı veri …

Big Data Analytics in Industry 4.0

MB Ozcan, B Konuk, YM Yesilcimen - Industry 4.0: Technologies …, 2022 - Springer
With the unpredictable development of technology, a wide variety of data is produced in a
very short time from countless sources. Industry 4.0 is a revolution in new technology for the …

A dwarf-based scalable big data benchmarking methodology

W Gao, L Wang, J Zhan, C Luo, D Zheng, Z Jia… - arXiv preprint arXiv …, 2017 - arxiv.org
Different from the traditional benchmarking methodology that creates a new benchmark or
proxy for every possible workload, this paper presents a scalable big data benchmarking …

Towards realistic benchmarking for cloud file systems: Early experiences

Z Ren, W Shi, J Wan - 2014 IEEE International Symposium on …, 2014 - ieeexplore.ieee.org
Over the past few years, cloud file systems such as Google File System (GFS) and Hadoop
Distributed File System (HDFS) have received a lot of research efforts to optimize their …

Pretopology and Topic Modeling for Complex Systems Analysis: Application on Document Classification and Complex Network Analysis

QV Bui - 2018 - theses.hal.science
The work of this thesis presents the development of algorithms for document classification
on the one hand, or complex network analysis on the other hand, based on pretopology, a …

Cobell: Runtime prediction for distributed dataflow jobs in shared clusters

I Verbitskiy, L Thamsen, T Renner… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Distributed dataflow systems have been developed to help users analyze and process large
datasets. While they make it easier for users to develop massively-parallel programs, users …

Statistical data generation using sample data

B Fazekas, A Kiss - New Trends in Databases and Information Systems …, 2018 - Springer
Due to the ever increasing data stored in databases, it is important to develop software
which can generate large numbers of test data that reflect the properties of a given sample …

Scalability and performance analysis of BDPS in clouds

Y Li, D Ou, X Zhou, C Jiang, C Cérin - Computing, 2022 - Springer
The increasing demand for big data processing leads to commercial off-the-shelf (COTS)
and cloud-based big data analytics services. Giant cloud service vendors provide …

BDTUne: Hierarchical correlation-based performance analysis and rule-based diagnosis for big data systems

R Ren, Z Jia, L Wang, J Zhan… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
Although big data systems are in widespread use and there have much research efforts for
improving big data systems performance, efficiently analysing and diagnosing performance …