Characterizing and subsetting big data workloads

Z Jia, J Zhan, L Wang, R Han, SA McKee… - 2014 IEEE …, 2014 - ieeexplore.ieee.org
Big data benchmark suites must include a diversity of data and workloads to be useful in
fairly evaluating big data systems and architectures. However, using truly comprehensive …

AIBench: an industry standard internet service AI benchmark suite

W Gao, F Tang, L Wang, J Zhan, C Lan, C Luo… - arXiv preprint arXiv …, 2019 - arxiv.org
Today's Internet Services are undergoing fundamental changes and shifting to an intelligent
computing era where AI is widely employed to augment services. In this context, many …

On big data benchmarking

R Han, X Lu, J Xu - Big Data Benchmarks, Performance Optimization, and …, 2014 - Springer
Big data systems address the challenges of capturing, storing, managing, analyzing, and
visualizing big data. Within this context, developing benchmarks to evaluate and compare …

SparkBench: a spark benchmarking suite characterizing large-scale in-memory data analytics

M Li, J Tan, Y Wang, L Zhang, V Salapura - Cluster Computing, 2017 - Springer
Spark has been increasingly employed by industries for big data analytics recently, due to its
resilience, scalability and efficient in-memory distributed programming model. Meanwhile …

Performance characterization of in-memory data analytics on a modern cloud server

AJ Awan, M Brorsson, V Vlassov… - 2015 IEEE Fifth …, 2015 - ieeexplore.ieee.org
In last decade, data analytics have rapidly progressed from traditional disk-based
processing to modern in-memory processing. However, little effort has been devoted at …

Bigdatabench: A scalable and unified big data and ai benchmark suite

W Gao, J Zhan, L Wang, C Luo, D Zheng… - arXiv preprint arXiv …, 2018 - arxiv.org
Several fundamental changes in technology indicate domain-specific hardware and
software co-design is the only path left. In this context, architecture, system, data …

A survey on data-driven performance tuning for big data analytics platforms

RLC Costa, J Moreira, P Pintor, V dos Santos… - Big Data Research, 2021 - Elsevier
Many research works deal with big data platforms looking forward to data science and
analytics. These are complex and usually distributed environments, composed of several …

Applying combinatorial test data generation to big data applications

N Li, Y Lei, HR Khan, J Liu, Y Guo - Proceedings of the 31st IEEE/ACM …, 2016 - dl.acm.org
Big data applications (eg, Extract, Transform, and Load (ETL) applications) are designed to
handle great volumes of data. However, processing such great volumes of data is time …

Big data benchmark compendium

T Ivanov, T Rabl, M Poess, A Queralt… - … to Big Data to Internet of …, 2016 - Springer
Abstract The field of Big Data and related technologies is rapidly evolving. Consequently,
many benchmarks are emerging, driven by academia and industry alike. As these …

Design and implementation of bandwidth-aware memory placement and migration policies for heterogeneous memory systems

S Yu, S Park, W Baek - Proceedings of the International Conference on …, 2017 - dl.acm.org
Heterogeneous memory systems that comprise memory nodes based on widely-different
device technologies (eg, DRAM and nonvolatile memory (NVM)) are emerging in various …