The k-means Algorithm: A Comprehensive Survey and Performance Evaluation

M Ahmed, R Seraj, SMS Islam - Electronics, 2020 - mdpi.com
The k-means clustering algorithm is considered one of the most powerful and popular data
mining algorithms in the research community. However, despite its popularity, the algorithm …

A tutorial on spectral clustering

U Von Luxburg - Statistics and computing, 2007 - Springer
In recent years, spectral clustering has become one of the most popular modern clustering
algorithms. It is simple to implement, can be solved efficiently by standard linear algebra …

Information theoretic measures for clusterings comparison: is a correction for chance necessary?

NX Vinh, J Epps, J Bailey - Proceedings of the 26th annual international …, 2009 - dl.acm.org
Information theoretic based measures form a fundamental class of similarity measures for
comparing clusterings, beside the class of pair-counting based and set-matching based …

Topology and data

G Carlsson - Bulletin of the American Mathematical Society, 2009 - ams.org
AMS :: Bulletin of the American Mathematical Society Skip to Main Content American
Mathematical Society American Mathematical Society MathSciNet Bookstore Publications …

Stability selection

N Meinshausen, P Bühlmann - Journal of the Royal Statistical …, 2010 - academic.oup.com
Estimation of structure, such as in variable selection, graphical modelling or cluster analysis,
is notoriously difficult, especially for high dimensional data. We introduce stability selection …

What can we learn privately?

SP Kasiviswanathan, HK Lee, K Nissim… - SIAM Journal on …, 2011 - SIAM
Learning problems form an important category of computational tasks that generalizes many
of the computations researchers apply to large real-life data sets. We ask, What concept …

Stability approach to regularization selection (stars) for high dimensional graphical models

H Liu, K Roeder, L Wasserman - Advances in neural …, 2010 - proceedings.neurips.cc
A challenging problem in estimating high-dimensional graphical models is to choose the
regularization parameter in a data-dependent way. The standard techniques include $ K …

Acoustic sequences in non‐human animals: a tutorial review and prospectus

A Kershenbaum, DT Blumstein, MA Roch… - Biological …, 2016 - Wiley Online Library
Animal acoustic communication often takes the form of complex sequences, made up of
multiple distinct acoustic units. Apart from the well‐known example of birdsong, other …

[HTML][HTML] What are the true clusters?

C Hennig - Pattern Recognition Letters, 2015 - Elsevier
Abstract Constructivist philosophy and Hasok Chang's active scientific realism are used to
argue that the idea of “truth” in cluster analysis depends on the context and the clustering …

Clustering stability: an overview

U Von Luxburg - Foundations and Trends® in Machine …, 2010 - nowpublishers.com
A popular method for selecting the number of clusters is based on stability arguments: one
chooses the number of clusters such that the corresponding clustering results are" most …