Task scheduling in big data platforms: a systematic literature review

M Soualhia, F Khomh, S Tahar - Journal of Systems and Software, 2017 - Elsevier
Abstract Context: Hadoop, Spark, Storm, and Mesos are very well known frameworks in both
research and industrial communities that allow expressing and processing distributed …

Efficient data placement and replication for QoS-aware approximate query evaluation of big data analytics

Q Xia, Z Xu, W Liang, S Yu, S Guo… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Enterprise users at different geographic locations generate large-volume data that is stored
at different geographic datacenters. These users may also perform big data analytics on the …

MapReduce task scheduling in heterogeneous geo-distributed data centers

X Li, F Chen, R Ruiz, J Zhu - IEEE Transactions on Services …, 2021 - ieeexplore.ieee.org
Different data transmission times, processing times which are difficult to predict and node-
dependent access times make MapReduce task scheduling rather complex. In this article …

A cloud-agnostic framework to enable cost-aware scheduling of applications in a multi-cloud environment

F Jiang, K Ferriter, C Castillo - NOMS 2020-2020 IEEE/IFIP …, 2020 - ieeexplore.ieee.org
We have witnessed a surge in both the big data applications being hosted by an assortment
of cloud vendors, and in the astronomical amount of data they produce and consume on a …

Qos-aware proactive data replication for big data analytics in edge clouds

Q Xia, L Bai, W Liang, Z Xu, L Yao, L Wang - Workshop Proceedings of …, 2019 - dl.acm.org
We are in the era of big data and cloud computing, large quantity of computing resource is
desperately needed to detect invaluable information hidden in the coarse big data through …

[HTML][HTML] A hierarchical hadoop framework to process geo-distributed big data

G Di Modica, O Tomarchio - Big Data and Cognitive Computing, 2022 - mdpi.com
In the past twenty years, we have witnessed an unprecedented production of data worldwide
that has generated a growing demand for computing resources and has stimulated the …

Cluster load based content distribution and speculative execution for geographically distributed cloud environment

C Li, M Song, Q Zhang, Y Luo - Computer Networks, 2021 - Elsevier
The scale of big data has shown an explosive growth, which makes the processing of big
data put forward higher requirements on data centers, and a single data center can no …

JHTD: An Efficient Joint Scheduling Framework Based on Hypergraph for Task Placement and Data Transfer Across Geographically Distributed Data Centers

C Jing, P Dan - IEEE Access, 2022 - ieeexplore.ieee.org
As the explosive growth of the data volume, data center is playing a critical role to store and
process huge amount of data. Traditional single data center can no longer to adapt into …

[PDF][PDF] PIVOT: Cost-aware scheduling of data-intensive applications in a cloud-agnostic system

F Jiang, K Ferriter, C Castillo - … Comput. Inst., Univ. North Carolina Chapel …, 2019 - renci.org
We have witnessed a surge in big data applications being hosted by assorted cloud
vendors, and the astronomical amount of data they produce and consume on a daily basis …

A hierarchical Hadoop framework to handle Big Data in geo-distributed computing environments

O Tomarchio, G Di Modica, M Cavallo… - International Journal of …, 2018 - igi-global.com
Advances in the communication technologies, along with the birth of new communication
paradigms leveraging on the power of the social, has fostered the production of huge …