A two-stage clustering algorithm based on improved k-means and density peak clustering

N Xiao, X Zhou, X Huang, Z Yang - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
N Xiao, X Zhou, X Huang, Z Yang
2019 IEEE International Conference on Big Knowledge (ICBK), 2019ieeexplore.ieee.org
The density peak clustering algorithm (DPC) has been widely concerned by researchers
since it was proposed. Its advantage lies in its ability to achieve efficient clustering based on
two simple assumptions. In DPC, a key step is to manually select the cluster centers
according to the decision graph. The quality of the decision graph determines the quality of
the selected cluster centers and the quality of the clustering result. The quality of the
decision graph is determined by the parameter dc. Although the authors have proposed an …
The density peak clustering algorithm (DPC) has been widely concerned by researchers since it was proposed. Its advantage lies in its ability to achieve efficient clustering based on two simple assumptions. In DPC, a key step is to manually select the cluster centers according to the decision graph. The quality of the decision graph determines the quality of the selected cluster centers and the quality of the clustering result. The quality of the decision graph is determined by the parameter dc. Although the authors have proposed an empirical parameter selection method, this method does not work well in many real-world datasets. Therefore, in these data sets, the user needs to repeatedly adjust the parameter multiple times to get a good decision graph. Thus, manually selecting cluster centers is not an easy task. In this paper, combined with the clustering idea of K-means and DPC, we propose a two-stage clustering algorithm KDPC that can automatically acquire the cluster centers. In the first stage, KDPC uses an improved K-means algorithm to obtain high quality cluster centers. In the second stage, KDPC clusters the remaining data points according to the clustering idea of DPC. Experiments show that KDPC can achieve good clustering effect in both artificial data sets and real-world data sets. In addition, compared with DPC, KDPC can show better clustering effect in data sets with significant difference in density of clusters.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果