How to use K-means for big data clustering?

R Mussabayev, N Mladenovic, B Jarboui… - Pattern Recognition, 2023 - Elsevier
K-means plays a vital role in data mining and is the simplest and most widely used algorithm
under the Euclidean Minimum Sum-of-Squares Clustering (MSSC) model. However, its …

[HTML][HTML] A survey on platforms for big data analytics

D Singh, CK Reddy - Journal of big data, 2015 - Springer
The primary purpose of this paper is to provide an in-depth analysis of different platforms
available for performing big data analytics. This paper surveys different hardware platforms …

[PDF][PDF] A modified K-means algorithm for big data clustering

SKA Fahad, MM Alam - International Journal of Science, Engineering and …, 2016 - ijcset.net
Amount of data is getting bigger in every moment and this data comes from everywhere;
social media, sensors, search engines, GPS signals, transaction records, satellites, financial …

An improved K‐means algorithm for big data

F Moodi, H Saadatfar - IET Software, 2022 - Wiley Online Library
An improved version of K‐means clustering algorithm that can be applied to big data
through lower processing loads with acceptable precision rates is presented here. In this …

K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data

AM Ikotun, AE Ezugwu, L Abualigah, B Abuhaija… - Information …, 2023 - Elsevier
Advances in recent techniques for scientific data collection in the era of big data allow for the
systematic accumulation of large quantities of data at various data-capturing sites. Similarly …

[PDF][PDF] Applications of clustering techniques in data mining: a comparative study

M Faizan, MF Zuhairi, S Ismail… - International Journal of …, 2020 - academia.edu
In modern scientific research, data analyses are often used as a popular tool across
computer science, communication science, and biological science. Clustering plays a …

Clustering methods for big data analytics

O Nasraoui, CEB N'Cir - Techniques, Toolboxes and Applications, 2019 - Springer
Data has become the lifeblood of today's knowledge-driven economy and society. Big data
clustering aims to summarize, segment, and group large volumes and varieties of data that …

Big data clustering: a review

AS Shirkhorshidi, S Aghabozorgi, TY Wah… - … Science and Its …, 2014 - Springer
Clustering is an essential data mining and tool for analyzing big data. There are difficulties
for applying clustering techniques to big data duo to new challenges that are raised with big …

Efficient algorithm for big data clustering on single machine

RM Alguliyev, RM Aliguliyev… - CAAI Transactions on …, 2020 - Wiley Online Library
Big data analysis requires the presence of large computing powers, which is not always
feasible. And so, it became necessary to develop new clustering algorithms capable of such …

[HTML][HTML] Balancing effort and benefit of K-means clustering algorithms in Big Data realms

J Pérez-Ortega, NN Almanza-Ortega, D Romero - PLoS One, 2018 - journals.plos.org
In this paper we propose a criterion to balance the processing time and the solution quality
of k-means cluster algorithms when applied to instances where the number n of objects is …