K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data

AM Ikotun, AE Ezugwu, L Abualigah, B Abuhaija… - Information …, 2023 - Elsevier
Advances in recent techniques for scientific data collection in the era of big data allow for the
systematic accumulation of large quantities of data at various data-capturing sites. Similarly …

The k-means Algorithm: A Comprehensive Survey and Performance Evaluation

M Ahmed, R Seraj, SMS Islam - Electronics, 2020 - mdpi.com
The k-means clustering algorithm is considered one of the most powerful and popular data
mining algorithms in the research community. However, despite its popularity, the algorithm …

The K-means algorithm evolution

J Pérez-Ortega, NN Almanza-Ortega… - Introduction to data …, 2019 - books.google.com
Clustering is one of the main methods for getting insight on the underlying nature and
structure of data. The purpose of clustering is organizing a set of data into clusters, such that …

Automatic clustering algorithms: a systematic review and bibliometric analysis of relevant literature

AE Ezugwu, AK Shukla, MB Agbaje… - Neural Computing and …, 2021 - Springer
Cluster analysis is an essential tool in data mining. Several clustering algorithms have been
proposed and implemented, most of which are able to find good quality clustering results …

How to use K-means for big data clustering?

R Mussabayev, N Mladenovic, B Jarboui… - Pattern Recognition, 2023 - Elsevier
K-means plays a vital role in data mining and is the simplest and most widely used algorithm
under the Euclidean Minimum Sum-of-Squares Clustering (MSSC) model. However, its …

An effective and efficient algorithm for k-means clustering with new formulation

F Nie, Z Li, R Wang, X Li - IEEE Transactions on Knowledge …, 2022 - ieeexplore.ieee.org
K-means is one of the most simple and popular clustering algorithms, which implemented as
a standard clustering method in most of machine learning researches. The goal of K-means …

A survey on platforms for big data analytics

D Singh, CK Reddy - Journal of big data, 2015 - Springer
The primary purpose of this paper is to provide an in-depth analysis of different platforms
available for performing big data analytics. This paper surveys different hardware platforms …

Performances of k-means clustering algorithm with different distance metrics

TM Ghazal - Intelligent Automation & Soft …, 2021 - research.skylineuniversity.ac.ae
Clustering is the process of grouping the data based on their similar properties. Meanwhile,
it is the categorization of a set of data into similar groups (clusters), and the elements in each …

Yinyang k-means: A drop-in replacement of the classic k-means with consistent speedup

Y Ding, Y Zhao, X Shen, M Musuvathi… - … on machine learning, 2015 - proceedings.mlr.press
This paper presents Yinyang K-means, a new algorithm for K-means clustering. By
clustering the centers in the initial stage, and leveraging efficiently maintained lower and …

K-means clustering versus validation measures: a data distribution perspective

H Xiong, J Wu, J Chen - Proceedings of the 12th ACM SIGKDD …, 2006 - dl.acm.org
K-means is a widely used partitional clustering method. While there are considerable
research efforts to characterize the key features of K-means clustering, further investigation …