Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling

M Zaharia, D Borthakur, J Sen Sarma… - Proceedings of the 5th …, 2010 - dl.acm.org
As organizations start to use data-intensive cluster computing systems like Hadoop and
Dryad for more applications, there is a growing need to share clusters between users …

A taxonomy and survey of fault-tolerant workflow management systems in cloud and distributed computing environments

D Poola, MA Salehi, K Ramamohanarao… - … for big data and the cloud, 2017 - Elsevier
During the recent years, workflows have emerged as an important abstraction for
collaborative research and managing complex large-scale distributed data analytics …

A data placement strategy in scientific cloud workflows

D Yuan, Y Yang, X Liu, J Chen - Future Generation Computer Systems, 2010 - Elsevier
In scientific cloud workflows, large amounts of application data need to be stored in
distributed data centres. To effectively store these data, a data manager must intelligently …

[PDF][PDF] Job scheduling for multi-user mapreduce clusters

M Zaharia, D Borthakur, JS Sarma… - … , Tech. Rep. UCB …, 2009 - digitalassets.lib.berkeley.edu
Sharing a MapReduce cluster between users is attractive because it enables statistical
multiplexing (lowering costs) and allows users to share a common large data set. However …

Context-aware data and task placement in edge computing environments

M Breitbach, D Schäfer, J Edinger… - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
Computationally intensive tasks of IoT applications can be offloaded to powerful devices in
the edge. Code offloading reduces energy consumption and increases performance …

Data management challenges of data-intensive scientific workflows

E Deelman, A Chervenak - … on Cluster Computing and the Grid …, 2008 - ieeexplore.ieee.org
Scientific workflows play an important role in today's science. Many disciplines rely on
workflow technologies to orchestrate the execution of thousands of computational tasks …

[PDF][PDF] Efficient algorithms for universal portfolios

AT Kalai, S Vempala - Journal of Machine Learning Research, 2002 - jmlr.org
A constant rebalanced portfolio is an investment strategy that keeps the same distribution of
wealth among a set of stocks from day to day. There has been much work on Cover's …

SABA: A security-aware and budget-aware workflow scheduling strategy in clouds

L Zeng, B Veeravalli, X Li - Journal of parallel and Distributed computing, 2015 - Elsevier
High quality of security service is increasingly critical for Cloud workflow applications.
However, existing scheduling strategies for Cloud systems disregard security requirements …

Enhancing reliability of workflow execution using task replication and spot instances

D Poola, K Ramamohanarao, R Buyya - ACM Transactions on …, 2016 - dl.acm.org
Cloud environments offer low-cost computing resources as a subscription-based service.
These resources are elastically scalable and dynamically provisioned. Furthermore, cloud …

A genetic algorithm based data replica placement strategy for scientific applications in clouds

L Cui, J Zhang, L Yue, Y Shi, H Li… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
Cloud computing is a promising distributed computing platform for big data applications, eg,
scientific applications, since excessive resources can be obtained from cloud services for …