K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data

AM Ikotun, AE Ezugwu, L Abualigah, B Abuhaija… - Information …, 2023 - Elsevier
Advances in recent techniques for scientific data collection in the era of big data allow for the
systematic accumulation of large quantities of data at various data-capturing sites. Similarly …

The k-means Algorithm: A Comprehensive Survey and Performance Evaluation

M Ahmed, R Seraj, SMS Islam - Electronics, 2020 - mdpi.com
The k-means clustering algorithm is considered one of the most powerful and popular data
mining algorithms in the research community. However, despite its popularity, the algorithm …

GBK-means clustering algorithm: An improvement to the K-means algorithm based on the bargaining game

MJ Rezaee, M Eshkevari, M Saberi… - Knowledge-Based Systems, 2021 - Elsevier
Due to its simplicity, versatility and the diversity of applications to which it can be applied, k-
means is one of the well-known algorithms for clustering data. The foundation of this …

Unsupervised anomaly detection for IoT-based multivariate time series: Existing solutions, performance analysis and future directions

MA Belay, SS Blakseth, A Rasheed, P Salvo Rossi - Sensors, 2023 - mdpi.com
The recent wave of digitalization is characterized by the widespread deployment of sensors
in many different environments, eg, multi-sensor systems represent a critical enabling …

An evolutionary computing-based efficient hybrid task scheduling approach for heterogeneous computing environment

M Sulaiman, Z Halim, M Lebbah, M Waqas… - Journal of Grid …, 2021 - Springer
Task schedule optimization enables to attain high performance in both homogeneous and
heterogeneous computing environments. The primary objective of task scheduling is to …

Efficient algorithm for big data clustering on single machine

RM Alguliyev, RM Aliguliyev… - CAAI Transactions on …, 2020 - Wiley Online Library
Big data analysis requires the presence of large computing powers, which is not always
feasible. And so, it became necessary to develop new clustering algorithms capable of such …

Development of new seed with modified validity measures for k-means clustering

S Manochandar, M Punniyamoorthy… - Computers & Industrial …, 2020 - Elsevier
Conventional k-means clustering is the widely used partitional method, mainly adapted to
machine learning and pattern recognition problems. This algorithm is highly sensitive to …

A novel hybrid multi-verse optimizer with K-means for text documents clustering

AK Abasi, AT Khader, MA Al-Betar, S Naim… - Neural Computing and …, 2020 - Springer
Text clustering has been widely utilized with the aim of partitioning specific document
collection into different subsets using homogeneity/heterogeneity criteria. It has also become …

Parallel batch k-means for Big data clustering

RM Alguliyev, RM Aliguliyev, LV Sukhostat - Computers & Industrial …, 2021 - Elsevier
The application of clustering algorithms is expanding due to the rapid growth of data
volumes. Nevertheless, existing algorithms are not always effective because of high …

Multi-view document clustering based on geometrical similarity measurement

B Diallo, J Hu, T Li, GA Khan, AS Hussein - International Journal of …, 2022 - Springer
Numerous works implemented multi-view clustering algorithms in document clustering. A
challenging problem in document clustering is the similarity metric. Existing multi-view …