An Extensive Empirical Comparison of k-means Initialization Algorithms

S Harris, RC De Amorim - IEEE Access, 2022 - ieeexplore.ieee.org
The k-means clustering algorithm, whilst widely popular, is not without its drawbacks. In this
paper, we focus on the sensitivity of k-means to its initial set of centroids. Since the cluster …

[HTML][HTML] An empirical comparison between stochastic and deterministic centroid initialisation for K-means variations

A Vouros, S Langdell, M Croucher, E Vasilaki - Machine Learning, 2021 - Springer
K-Means is one of the most used algorithms for data clustering and the usual clustering
method for benchmarking. Despite its wide application it is well-known that it suffers from a …

A comparative study of efficient initialization methods for the k-means clustering algorithm

M Emre Celebi, HA Kingravi, PA Vela - arXiv e-prints, 2012 - ui.adsabs.harvard.edu
K-means is undoubtedly the most widely used partitional clustering algorithm. Unfortunately,
due to its gradient descent nature, this algorithm is highly sensitive to the initial placement of …

[PDF][PDF] Initializing k-means Clustering.

C Borgelt, O Yarikova - DATA, 2020 - eplus.uni-salzburg.at
The quality of clustering results obtained with the k-means algorithm depends heavily on the
initialization of the cluster centers. Simply sampling centers uniformly at random from the …

MST-Based Cluster Initialization for K-Means

D Reddy, D Mishra, PK Jana - … , CCSIT 2011, Bangalore, India, January 2-4 …, 2011 - Springer
Clustering is an exploratory data analysis tool that has gained enormous attention in the
recent years specifically for gene expression data analysis. The K-means clustering is a …

A comparative study of efficient initialization methods for the k-means clustering algorithm

ME Celebi, HA Kingravi, PA Vela - Expert systems with applications, 2013 - Elsevier
K-means is undoubtedly the most widely used partitional clustering algorithm. Unfortunately,
due to its gradient descent nature, this algorithm is highly sensitive to the initial placement of …

A new initialisation method for k-means algorithm in the clustering problem: data analysis

A Kazemi, G Khodabandehlouie - International Journal of …, 2018 - inderscienceonline.com
Clustering is one of the most important tasks in exploratory data analysis. One of the
simplest and the most widely used clustering algorithm is K-means which was proposed in …

Linear, deterministic, and order-invariant initialization methods for the k-means clustering algorithm

ME Celebi, HA Kingravi - Partitional clustering algorithms, 2015 - Springer
Over the past five decades, k-means has become the clustering algorithm of choice in many
application domains primarily due to its simplicity, time/space efficiency, and invariance to …

Initializing k-means Clustering by Bootstrap and Data Depth

A Torrente, J Romo - Journal of Classification, 2021 - Springer
The k-means algorithm is widely used in various research fields because of its fast
convergence to the cost function minima; however, it frequently gets stuck in local optima as …

An arithmetic-based deterministic centroid initialization method for the k-means clustering algorithm

MM Mayo - 2016 - csuepress.columbusstate.edu
One of the greatest challenges in k-means clustering is positioning the initial cluster centers,
or centroids, as close to optimal as possible, and doing so in an amount of time deemed …