Big data analytics: a literature review

D Chong, H Shi - Journal of Management Analytics, 2015 - Taylor & Francis
With more and more data generated, it has become a big challenge for traditional
architectures and infrastructures to process large amounts of data within an acceptable time …

{CherryPick}: Adaptively unearthing the best cloud configurations for big data analytics

O Alipourfard, HH Liu, J Chen… - … USENIX Symposium on …, 2017 - usenix.org
Picking the right cloud configuration for recurring big data analytics jobs running in clouds is
hard, because there can be tens of possible VM instance types and even more cluster sizes …

Morpheus: Towards automated {SLOs} for enterprise clusters

SA Jyothi, C Curino, I Menache… - … USENIX symposium on …, 2016 - usenix.org
Modern resource management frameworks for largescale analytics leave unresolved the
problematic tension between high cluster utilization and job's performance predictability …

Dynamic configuration of partitioning in spark applications

A Gounaris, G Kougka, R Tous… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Spark has become one of the main options for large-scale analytics running on top of shared-
nothing clusters. This work aims to make a deep dive into the parallelism configuration and …

Zoolander:{Efficiently} Meeting Very Strict,{Low-Latency}{SLOs}

C Stewart, A Chakrabarti, R Griffith - 10th International Conference on …, 2013 - usenix.org
Internet services access networked storage many times while processing a request. Just a
few slow storage accesses per request can raise response times a lot, making the whole …

Performance modeling of mapreduce jobs in heterogeneous cloud environments

Z Zhang, L Cherkasova, BT Loo - 2013 IEEE Sixth International …, 2013 - ieeexplore.ieee.org
Many companies start using Hadoop for advanced data analytics over large datasets. While
a traditional Hadoop cluster deployment assumes a homogeneous cluster, many enterprise …

Distributed resource management across process boundaries

L Suresh, P Bodik, I Menache, M Canini… - Proceedings of the 2017 …, 2017 - dl.acm.org
Multi-tenant distributed systems composed of small services, such as Service-oriented
Architectures (SOAs) and Micro-services, raise new challenges in attaining high …

The impact of intention of use on the success of big data adoption via organization readiness factor

A Haddad, AA Ameen, M Mukred - International Journal of …, 2018 - ejournal.lucp.net
Big data is one of the most contemporary issues. It is innovative processing solutions for a
variety of new and existing data to provide real business benefits. Unless it is tied to …

Deep learning research and development platform: Characterizing and scheduling with qos guarantees on gpu clusters

Z Chen, W Quan, M Wen, J Fang, J Yu… - … on Parallel and …, 2019 - ieeexplore.ieee.org
Deep learning (DL) has been widely adopted in various domains of artificial intelligence (AI),
achieving dramatic developments in industry and academia. Besides giant AI companies …

Preemptive {ReduceTask} Scheduling for Fair and Fast Job Completion

Y Wang, J Tan, W Yu, L Zhang, X Meng… - … Conference on Autonomic …, 2013 - usenix.org
Hadoop MapReduce adopts a two-phase (map and reduce) scheme to schedule tasks
among data-intensive applications. However, under this scheme, Hadoop schedulers do not …