作者
Gaël Beck, Tarn Duong, Mustapha Lebbah, Hanane Azzag, Christophe Cérin
发表日期
2019/12/1
期刊
Journal of Parallel and Distributed Computing
卷号
134
页码范围
128-139
出版商
Academic Press
简介
Mean Shift clustering, as a generalization of the well-known k-means clustering, computes arbitrarily shaped clusters as defined as the basins of attraction to the local modes created by the density gradient ascent paths. Despite its potential for improved clustering accuracy, the Mean Shift approach is a computationally expensive method for unsupervised learning. We introduce two contributions aiming to provide approximate Mean Shift clustering, based on scalable procedures to compute the density gradient ascent and cluster labeling, with a linear time complexity, as opposed to the quadratic time complexity for the exact clustering. Both propositions are based on Locality Sensitive Hashing (LSH) to approximate nearest neighbors. When implemented on a serial system, these approximate methods can be used for moderate sized datasets. To facilitate the analysis of Big Data, a distributed implementation, written …
引用总数
20202021202220232024581051
学术搜索中的文章
G Beck, T Duong, M Lebbah, H Azzag, C Cérin - Journal of Parallel and Distributed Computing, 2019