Fast and provably good seedings for k-means

O Bachem, M Lucic, H Hassani… - Advances in neural …, 2016 - proceedings.neurips.cc
Seeding-the task of finding initial cluster centers-is critical in obtaining high-quality
clusterings for k-Means. However, k-means++ seeding, the state of the art algorithm, does …

Approximate k-means++ in sublinear time

O Bachem, M Lucic, SH Hassani… - Proceedings of the AAAI …, 2016 - ojs.aaai.org
The quality of K-Means clustering is extremely sensitive to proper initialization. The classic
remedy is to apply k-means++ to obtain an initial set of centers that is provably competitive …

Multi-swap k-means++

L Beretta, V Cohen-Addad… - Advances in Neural …, 2023 - proceedings.neurips.cc
Abstract The $ k $-means++ algorithm of Arthur and Vassilvitskii (SODA 2007) is often the
practitioners' choice algorithm for optimizing the popular $ k $-means clustering objective …

[PDF][PDF] Genetic algorithm with fast greedy heuristic for clustering and location problems

LA Kazakovtsev, AN Antamoshkin - Informatica, 2014 - informatica.si
Genetic Algorithm with Fast Greedy Heuristic for Clustering and Location Problems 1
Introduction Page 1 Informatica 38 (2014) 229–240 229 Genetic Algorithm with Fast Greedy …

A constant-factor bi-criteria approximation guarantee for k-means++

D Wei - Advances in neural information processing systems, 2016 - proceedings.neurips.cc
This paper studies the $ k $-means++ algorithm for clustering as well as the class of $ D^\ell
$ sampling algorithms to which $ k $-means++ belongs. It is shown that for any constant …

Faster Algorithms for the Constrained k-means Problem

A Bhattacharya, R Jaiswal, A Kumar - Theory of computing systems, 2018 - Springer
The classical center based clustering problems such as k-means/median/center assume that
the optimal clusters satisfy the locality property that the points in the same cluster are close …

A unified framework for clustering constrained data without locality property

H Ding, J Xu - Algorithmica, 2020 - Springer
In this paper, we consider a class of constrained clustering problems of points in R^ d R d,
where d could be rather high. A common feature of these problems is that their optimal …

Do you know what q-means?

JF Doriguello, A Luongo, E Tang - arXiv preprint arXiv:2308.09701, 2023 - arxiv.org
Clustering is one of the most important tools for analysis of large datasets, and perhaps the
most popular clustering algorithm is Lloyd's iteration for $ k $-means. This iteration takes $ N …

Distributed and provably good seedings for k-means in constant rounds

O Bachem, M Lucic, A Krause - International Conference on …, 2017 - proceedings.mlr.press
The k-Means++ algorithm is the state of the art algorithm to solve k-Means clustering
problems as the computed clusterings are O (log k) competitive in expectation. However, its …

Approximate clustering with same-cluster queries

N Ailon, A Bhattacharya, R Jaiswal, A Kumar - arXiv preprint arXiv …, 2017 - arxiv.org
Ashtiani et al. proposed a Semi-Supervised Active Clustering framework (SSAC), where the
learner is allowed to make adaptive queries to a domain expert. The queries are of the kind" …