Benchmarking big data systems: A review

R Han, LK John, J Zhan - IEEE Transactions on Services …, 2017 - ieeexplore.ieee.org
With the fast development of big data systems in recent years, a variety of open-source
benchmarks have been built to evaluate and compare the workloads on these systems, and …

A survey of big data, high performance computing, and machine learning benchmarks

N Ihde, P Marten, A Eleliemy, G Poerwawinata… - … and Benchmarking: 13th …, 2022 - Springer
In recent years, there has been a convergence of Big Data (BD), High Performance
Computing (HPC), and Machine Learning (ML) systems. This convergence is due to the …

[HTML][HTML] An approach to workload generation for modern data centers: A view from Alibaba trace

Y Liang, N Ruan, L Yi, X Su - BenchCouncil Transactions on Benchmarks …, 2024 - Elsevier
Modern data centers provide the foundational infrastructure of cloud computing. Workload
generation, which involves simulating or constructing tasks and transactions to replicate the …

iGen: A realistic request generator for cloud file systems benchmarking

Z Ren, B Xu, W Shi, Y Ren, F Cao… - 2016 IEEE 9th …, 2016 - ieeexplore.ieee.org
Benchmarking is a traditional approach for system performance evaluation and optimization.
Over the past decades, a variety of file systems, eg, GFS, HDFS and Ceph, have been …

AccuracyTrader: Accuracy-aware approximate processing for low tail latency and high result accuracy in cloud online services

R Han, S Huang, F Tang, F Chang… - 2016 45th International …, 2016 - ieeexplore.ieee.org
Modern latency-critical online services such as search engines often process requests by
consulting large input data spanning massive parallel components. Hence the tail latency of …

Benchmarking recommender systems

LA Homann - 2021 - search.proquest.com
Recommender Systeme unterstützen aus einem Angebot von Diensten und Produkten auf
bspw. Verkaufs-oder Unterhaltungsplattformen, diejenigen zu finden, die die persönlichen …

Sarp: Synopsis-based approximate request processing for low latency and small correctness loss in cloud online services

R Han, J Zhan, JVP Luis - International Journal of Parallel Programming, 2016 - Springer
Despite the importance of providing quick responsiveness to user requests for online
services, such request processing is very resource expensive when dealing with large-scale …

Missing Data Recovery in Large-Scale, Sparse Datacenter Traces: An Alibaba Case Study

Y Liang, L Bi, X Su - … Symposium on Cluster, Cloud and Grid …, 2019 - ieeexplore.ieee.org
The trace analysis for datacenter holds a prominent importance for the datacenter
performance optimization. However, due to the error and low execution priority of trace …

[PDF][PDF] Avdanced ECHMM-Based Machine Learning Tools for Complex Applications

We present a novel approach for accurate characterization of workloads. Workloads are
generally described with statistical models and are based on the analysis of resource …

Compressive Sensing based Predictive Online Scheduling with Task Colocation in Cloud Data Center

Y Chan, K Luo, X Chen - 2020 IEEE 26th International …, 2020 - ieeexplore.ieee.org
With the growing size of the cloud data center, the high scheduling efficiency over massive-
scale cloud servers is hard to achieve, particularly when the scheduler requires the full real …