[PDF][PDF] Improving the Accuracy and Efficiency of the k-means Clustering Algorithm

KAA Nazeer, MP Sebastian - Proceedings of the world congress on …, 2009 - iaeng.org
Emergence of modern techniques for scientific data collection has resulted in large scale
accumulation of data pertaining to diverse fields. Conventional database querying methods …

Analysis of k-means and k-medoids algorithm for big data

P Arora, S Varshney - Procedia Computer Science, 2016 - Elsevier
Clustering plays a very vital role in exploring data, creating predictions and to overcome the
anomalies in the data. Clusters that contain collateral, identical characteristics in a dataset …

An efficient clustering algorithm based on the k-nearest neighbors with an indexing ratio

R Qaddoura, H Faris, I Aljarah - International Journal of Machine Learning …, 2020 - Springer
Clustering is a challenging problem that is commonly used for many applications. It aims at
finding the similarity between data points and grouping similar ones into the same cluster. In …

A linear time-complexity k-means algorithm using cluster shifting

MK Pakhira - 2014 international conference on computational …, 2014 - ieeexplore.ieee.org
The k-means algorithm is known to have a time complexity of O (n 2), where n is the input
data size. This quadratic complexity debars the algorithm from being effectively used in large …

K-means properties on six clustering benchmark datasets

P Fränti, S Sieranoja - Applied intelligence, 2018 - Springer
This paper has two contributions. First, we introduce a clustering basic benchmark. Second,
we study the performance of k-means using this benchmark. Specifically, we measure how …

K and starting means for k-means algorithm

A Fahim - Journal of Computational Science, 2021 - Elsevier
The k-means method aims to divide a set of N objects into k clusters, where each cluster is
represented by the mean value of its objects. This algorithm is simple and converges to local …

Effect of distance metrics in determining k-value in k-means clustering using elbow and silhouette method

DM Saputra, D Saputra, LD Oswari - … international conference on …, 2020 - atlantis-press.com
Clustering is one of the main task in datamining. It is useful to group and cluster the data.
There are a few ways to cluster the data such as partitional-based, hierarchical-based and …

An efficient enhanced k-means clustering algorithm

AM Fahim, AM Salem, FA Torkey… - Journal of Zhejiang …, 2006 - Springer
In k-means clustering, we are given a set of n data points in d-dimensional space ℝ d and an
integer k and the problem is to determine a set of k points in ℝ d, called centers, so as to …

A time-efficient pattern reduction algorithm for k-means clustering

MC Chiang, CW Tsai, CS Yang - Information Sciences, 2011 - Elsevier
This paper presents an efficient algorithm, called pattern reduction (PR), for reducing the
computation time of k-means and k-means-based clustering algorithms. The proposed …

[PDF][PDF] A modified k-means algorithm to avoid empty clusters

MK Pakhira - International Journal of Recent Trends in …, 2009 - researchgate.net
The k-means algorithm is one of the most widely used clustering algorithms and has been
applied in many fields of science and technology. One of the major problems of the k-means …