K-means properties on six clustering benchmark datasets

P Fränti, S Sieranoja - Applied intelligence, 2018 - Springer
This paper has two contributions. First, we introduce a clustering basic benchmark. Second,
we study the performance of k-means using this benchmark. Specifically, we measure how …

Optimal neighborhood multiple kernel clustering with adaptive local kernels

J Liu, X Liu, J Xiong, Q Liao, S Zhou… - … on Knowledge and …, 2020 - ieeexplore.ieee.org
Multiple kernel clustering (MKC) algorithm aims to group data into different categories by
optimally integrating information from a group of pre-specified kernels. Though …

Efficiency of random swap clustering

P Fränti - Journal of big data, 2018 - Springer
Random swap algorithm aims at solving clustering by a sequence of prototype swaps, and
by fine-tuning their exact location by k-means. This randomized search strategy is simple to …

Gaussian mixture model with feature selection: An embedded approach

Y Fu, X Liu, S Sarkar, T Wu - Computers & Industrial Engineering, 2021 - Elsevier
Abstract Gaussian Mixture Model (GMM) is a popular clustering algorithm due to its neat
statistical properties, which enable the “soft” clustering and the determination of the number …

Are cluster validity measures (in) valid?

M Gagolewski, M Bartoszuk, A Cena - Information Sciences, 2021 - Elsevier
Internal cluster validity measures (such as the Calinski–Harabasz, Dunn, or Davies–Bouldin
indices) are frequently used for selecting the appropriate number of partitions a dataset …

Solving the large-scale tsp problem in 1 h: Santa claus challenge 2020

R Mariescu-Istodor, P Fränti - Frontiers in Robotics and AI, 2021 - frontiersin.org
The scalability of traveling salesperson problem (TSP) algorithms for handling large-scale
problem instances has been an open problem for a long time. We arranged a so-called …

A spatial filtering inspired three-way clustering approach with application to outlier detection

B Ali, N Azam, A Shah, JT Yao - International Journal of Approximate …, 2021 - Elsevier
Three-way clustering provides an effective framework for clustering of data in the presence
of uncertain, imprecise and incomplete data. In this article, we used ideas inspired from two …

Non-iterative border-peeling clustering algorithm based on swap strategy

H Tu, S Ding, X Xu, H Hou, C Li, L Ding - Information Sciences, 2024 - Elsevier
Border-Peeling algorithm is a recently proposed density based clustering algorithm. The
method of peeling off border points by continuous iteration and calculating the density …

Bayesian cluster enumeration criterion for unsupervised learning

FK Teklehaymanot, M Muma… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
We derive a new Bayesian Information Criterion (BIC) by formulating the problem of
estimating the number of clusters in an observed dataset as maximization of the posterior …

Smoothing Outlier Scores is All You Need to Improve Outlier Detectors

J Yang, S Rahardja, P Fränti - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
We hypothesize that similar objects should have similar outlier scores. To the best of our
knowledge, all existing outlier detectors calculate the outlier score for each object …