A taxonomy of data grids for distributed data sharing, management, and processing

S Venugopal, R Buyya… - ACM Computing Surveys …, 2006 - dl.acm.org
Data Grids have been adopted as the next generation platform by many scientific
communities that need to share, access, transport, process, and manage large data …

Modeling machine availability in enterprise and wide-area distributed computing environments

D Nurmi, J Brevik, R Wolski - Euro-Par 2005 Parallel Processing: 11th …, 2005 - Springer
In this paper, we consider the problem of modeling machine availability in enterprise-area
and wide-area distributed computing settings. Using availability data gathered from three …

The impact of data replication on job scheduling performance in the data grid

M Tang, BS Lee, X Tang, CK Yeo - Future Generation Computer Systems, 2006 - Elsevier
In the Data Grid environment, the primary goal of data replication is to shorten the data
access time experienced by the job and consequently reduce the job turnaround time. After …

Dynamic replication algorithms for the multi-tier data grid

M Tang, BS Lee, CK Yeo, X Tang - Future generation computer systems, 2005 - Elsevier
Data replication is a common method used to improve the performance of data access in
distributed systems. In this paper, two dynamic replication algorithms, Simple Bottom-Up …

Evaluation of distributed recovery in large-scale storage systems

Q Xin, EL Miller, SJTJE Schwarz - Proceedings. 13th IEEE …, 2004 - ieeexplore.ieee.org
Storage clusters consisting of thousands of disk drives are now being used both for their
large capacity and high throughput. However, their reliability is far worse than that of smaller …

A genetic algorithm based approach for scheduling decomposable data grid applications

S Kim, JB Weissman - International Conference on Parallel …, 2004 - ieeexplore.ieee.org
Data grid technology promises geographically distributed scientists to access and share
physically distributed resources such as compute resource, networks, storage, and most …

Dynamic replication strategies in data grid systems: a survey

U Tos, R Mokadem, A Hameurlain, T Ayav… - The Journal of …, 2015 - Springer
In data grid systems, data replication aims to increase availability, fault tolerance, load
balancing and scalability while reducing bandwidth consumption, and job execution time …

A data and task co-scheduling algorithm for scientific cloud workflows

K Deng, K Ren, M Zhu, J Song - IEEE Transactions on Cloud …, 2015 - ieeexplore.ieee.org
Cloud computing has emerged as a promising computational infrastructure for cost-efficient
workflow execution by provisioning on-demand resources in a pay-as-you-go manner. While …

A deadline and budget constrained scheduling algorithm for eScience applications on data grids

S Venugopal, R Buyya - … Conference on Algorithms and Architectures for …, 2005 - Springer
In this paper, we present an algorithm for scheduling of distributed data intensive Bag-of-
Task applications on Data Grids that have costs associated with requesting, transferring and …

Dynamic data replication in lcg 2008

C Nicholson, DG Cameron, AT Doyle… - Concurrency and …, 2008 - Wiley Online Library
To provide performance access to data from high‐energy physics experiments such as the
Large Hadron Collider (LHC), controlled replication of files among grid sites is required …