Workload characterization and optimization of TPC-H queries on Apache Spark

T Chiba, T Onodera - … on Performance Analysis of Systems and …, 2016 - ieeexplore.ieee.org
Besides being an in-memory-oriented computing framework, Spark runs on top of Java
Virtual Machines (JVMs), so JVM parameters must be tuned to improve Spark application …

A many-core architecture for in-memory data processing

SR Agrawal, S Idicula, A Raghavan, E Vlachos… - Proceedings of the 50th …, 2017 - dl.acm.org
For many years, the highest energy cost in processing has been data movement rather than
computation, and energy is the limiting factor in processor design [21]. As the data needed …

A new benchmark harness for systematic and robust evaluation of streaming state stores

E Asyabi, Y Wang, J Liagouris, V Kalavri… - Proceedings of the …, 2022 - dl.acm.org
Modern stream processing systems often rely on embedded key-value stores, like RocksDB,
to manage the state of long-running computations. Evaluating the performance of these …

Performance benchmarking and comparison of NoSQL databases: Redis vs mongodb vs Cassandra using YCSB tool

NB Seghier, O Kazar - 2021 International Conference on …, 2021 - ieeexplore.ieee.org
Big Data is an ensemble of technologies founded on NoSQL databases that enable
scalability of volumes, numbers, and data types. NoSQL databases assert that their …

[PDF][PDF] 数据管理系统评测基准: 从传统数据库到新兴大数据

金澈清, 钱卫宁, 周敏奇, 周傲英 - 计算机学报, 2015 - cjc.ict.ac.cn
摘要大数据时代的到来意味着新技术, 新系统和新产品的出现. 如何客观地比较和评价不同系统
之间的优劣自然成为一个热门研究课题, 这种情形与三十多年前数据库系统蓬勃发展时甚为相似 …

SLA-based scheduling of spark jobs in hybrid cloud computing environments

MT Islam, H Wu, S Karunasekera… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Big data frameworks such as Apache Spark is becoming prominent to perform large-scale
data analytics jobs in various domains. However, due to limited resource availability, the …

PARSEC3. 0: A multicore benchmark suite with network stacks and SPLASH-2X

X Zhan, Y Bao, C Bienia, K Li - ACM SIGARCH Computer Architecture …, 2017 - dl.acm.org
Benchmarks play a very important role in accelerating the development and research of
CMP. As one of them, the PARSEC suite continues to be updated and revised over and over …

[图书][B] Alluxio: A virtual distributed file system

H Li - 2018 - search.proquest.com
The world is entering the data revolution era. Along with the latest advancements of the
Internet, Artificial Intelligence (AI), mobile devices, autonomous driving, and Internet of …

AIBench: an industry standard internet service AI benchmark suite

W Gao, F Tang, L Wang, J Zhan, C Lan, C Luo… - arXiv preprint arXiv …, 2019 - arxiv.org
Today's Internet Services are undergoing fundamental changes and shifting to an intelligent
computing era where AI is widely employed to augment services. In this context, many …

Gapprox: using gallup approach for approximation in big data processing

H Ahmadvand, M Goudarzi, F Foroutan - Journal of Big Data, 2019 - Springer
Abstract As Big Data processing often takes a long time and needs a lot of resources,
sampling and approximate computing techniques may be used to generate a desired …