The term 'Big Data'has spread rapidly in the framework of Data Mining and Business Intelligence. This new scenario can be defined by means of those problems that cannot be …
The first edition of this book appeared in 1991 when the technology was new and there were not too many products. In the Preface to the first edition, we had quoted Michael Stonebraker …
A large number of cloud services require users to share private data like electronic health records for data analysis or mining, bringing privacy concerns. Anonymizing data sets via …
Enterprises today acquire vast volumes of data from different sources and leverage this information by means of data analysis to support effective decision-making and provide new …
A Shkapsky, M Yang, M Interlandi, H Chiu… - Proceedings of the …, 2016 - dl.acm.org
There is great interest in exploiting the opportunity provided by cloud computing platforms for large-scale analytics. Among these platforms, Apache Spark is growing in popularity for …
Recently, increasingly large amounts of data are generated from a variety of sources. Existing data processing technologies are not suitable to cope with the huge amounts of …
Context: In recent years, the valuable knowledge that can be retrieved from petabyte scale datasets–known as Big Data–led to the development of solutions to process information …
F Ahmad, ST Chakradhar, A Raghunathan… - 2014 USENIX Annual …, 2014 - usenix.org
MapReduce clusters are usually multi-tenant (ie, shared among multiple users and jobs) for improving cost and utilization. The performance of jobs in a multi-tenant MapReduce cluster …
X Sun, Y He, D Wu, JZ Huang - Big Data Mining and Analytics, 2023 - ieeexplore.ieee.org
Distributed computing frameworks are the fundamental component of distributed computing systems. They provide an essential way to support the efficient processing of big data on …