SwiftAnalytics: Optimizing object storage for big data analytics

L Rupprecht, R Zhang, B Owen… - 2017 IEEE …, 2017 - ieeexplore.ieee.org
storage itself to robustly speed up analytics jobs. We make the following contributions: We
propose SwiftAnalytics, an enhanced object storageanalytics frameworks and object storage

Synchronous parallel processing of big-data analytics services to optimize performance in federated clouds

G Jung, N Gnanasambandam… - 2012 IEEE Fifth …, 2012 - ieeexplore.ieee.org
… To achieve the optimal performance of big-data analytics … in a data center, and how to
apportion given big-data to chosen … employed as a big-data analytics service (see Section V). …

Optimizing cost for geo-distributed storage systems in online social networks

J Zhou, J Fan, J Jia, B Cheng, Z Liu - Journal of computational science, 2018 - Elsevier
optimizing social locality we aim to minimize the cost incurred by storage and communications,
while through optimizing … More importantly, we combine data placement and data

[HTML][HTML] Big data analysis and optimization and platform components

K Hsu - Journal of King Saud University-Science, 2022 - Elsevier
… timeliness of data migration, and not enough innovation in data analysis, we design a
more efficient, convenient and easy-to-use big data management platform. Firstly, the big …

Leveraging adaptive I/O to optimize collective data shuffling patterns for big data analytics

B Nicolae, CHA Costa, C Misale… - … and Distributed …, 2016 - ieeexplore.ieee.org
… This paper focuses on high performance, scalability and memory efficiency for data shuffling
in the context of big data analytics over high end infrastructure. To our best knowledge, we …

Pangea: monolithic distributed storage for data analytics

J Zou, A Iyengar, C Jermaine - arXiv preprint arXiv:1808.06094, 2018 - arxiv.org
… system and different types of data should be handled … the storage manager to simultaneously
manage different data … utilizes that information to optimize page replacement decisions …

The MemSQL Query Optimizer: A modern optimizer for real-time analytics in a distributed database

J Chen, S Jindel, R Walzer, R Sen… - Proceedings of the …, 2016 - dl.acm.org
… for a distributed database designed to optimizedistributed query execution plans with fast
optimization times. We discuss the problem of query rewrite decisions in a distributed database

Trends in big data analytics

K Kambatla, G Kollias, V Kumar, A Grama - … of parallel and distributed …, 2014 - Elsevier
… scope of data analytics problems. We describe commonly used hardware platforms for executing
analytics … The obvious advantage is the immediate sharing of optimizations in the virtual …

Cast: Tiering storage for data analytics in the cloud

Y Cheng, MS Iqbal, A Gupta, AR Butt - … parallel and distributed …, 2015 - dl.acm.org
… on different cloud storage services, and … data placement and storage provisioning plan.
Furthermore, we build Cast++ to enhance Cast’s optimization model by incorporating data reuse …

Towards memory-optimized data shuffling patterns for big data analytics

B Nicolae, C Costa, C Misale… - 2016 16th IEEE/ACM …, 2016 - ieeexplore.ieee.org
… the storage layer. To this end, a new generation of in-memory big data analytics frameworks
… By making heavy use of in-memory data caching, Spark minimizes the interactions with the …