K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data

AM Ikotun, AE Ezugwu, L Abualigah, B Abuhaija… - Information …, 2023 - Elsevier
Advances in recent techniques for scientific data collection in the era of big data allow for the
systematic accumulation of large quantities of data at various data-capturing sites. Similarly …

Improved k-means clustering algorithm for big data based on distributed smartphoneneural engine processor

FH Awad, MM Hamad - Electronics, 2022 - mdpi.com
Clustering is one of the most significant applications in the big data field. However, using the
clustering technique with big data requires an ample amount of processing power and …

Heart Disease Prediction Using Hybrid Machine Learning: A Brief Review

M Ahmed, I Husien - Journal of Robotics and Control (JRC), 2024 - journal.umy.ac.id
Cardiovascular disease is a widespread and potentially fatal condition that requires
proactive preventive measures and efficient screening approaches on a global scale. To …

High-density cluster core-based k-means clustering with an unknown number of clusters

A Kumar, A Kumar, R Mallipeddi, DG Lee - Applied Soft Computing, 2024 - Elsevier
The k-means algorithm, known for its simplicity and adaptability, faces challenges related to
manual cluster number selection and sensitivity to initial centroid placement. This paper …

K-Means and J48 Algorithms to Categorize Student Research Abstracts

LK Choi, KB Rii, HW Park - … Journal of Cyber and IT Service …, 2023 - iiast.iaic-publisher.org
Text mining is a rapidly growing field in computer science that is used to extract meaningful
information from text data. This information can be used for various applications, such as …

Tamil offensive language detection: Supervised versus unsupervised learning approaches

V Balakrishnan, V Govindan… - ACM Transactions on …, 2023 - dl.acm.org
Studies on natural language processing are mainly conducted in English, with very few
exploring languages that are under-resourced, including the Dravidian languages. We …

Optimizing K-means for Big Data: A Comparative Study

R Mussabayev, R Mussabayev - arXiv preprint arXiv:2310.09819, 2023 - arxiv.org
This paper presents a comparative analysis of different optimization techniques for the K-
means algorithm in the context of big data. K-means is a widely used clustering algorithm …

A secure two-party Euclidean distance computation scheme through a covert adversarial model based on Paillier encryption

L Su, H Geng, S Guo, S He - IEEE Access, 2023 - ieeexplore.ieee.org
Existing secure two-party Euclidean distance computation schemes are mostly performed
based on a semi-honest model, which faces bottlenecks in computation efficiency and …

Machine learning for major food crops breeding: Applications, challenges, and ways forward

K N. Govaichelvan, D Pathmanathan… - Agronomy …, 2024 - Wiley Online Library
Increasing the production of the three major food crops (MFCs), maize (Zea mays), rice
(Oryza sativa), and wheat (Triticum aestivum), is essential to fulfilling the food demand for …

Open-Circuit Fault Diagnosis for a Modular Multilevel Converter Based on Hybrid Machine Learning

Y An, X Sun, B Ren, X Zhang - IEEE Access, 2024 - ieeexplore.ieee.org
With the wide application of a modular multilevel converter in various power conversion
fields, submodule open-circuit fault diagnostics have attracted increasing attention, as some …