作者
Chris Ding, Xiaofeng He, Hongyuan Zha, Horst D Simon
发表日期
2002/12/9
研讨会论文
2002 IEEE International Conference on Data Mining, 2002. Proceedings.
页码范围
147-154
出版商
IEEE
简介
It is well-known that for high dimensional data clustering, standard algorithms such as EM and K-means are often trapped in a local minimum. Many initialization methods have been proposed to tackle this problem, with only limited success. In this paper we propose a new approach to resolve this problem by repeated dimension reductions such that K-means or EM are performed only in very low dimensions. Cluster membership is utilized as a bridge between the reduced dimensional subspace and the original space, providing flexibility and ease of implementation. Clustering analysis performed on highly overlapped Gaussians, DNA gene expression profiles and Internet newsgroups demonstrate the effectiveness of the proposed algorithm.
引用总数
20012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202412816131417101822169131517131791872214155
学术搜索中的文章
C Ding, X He, H Zha, HD Simon - 2002 IEEE International Conference on Data Mining …, 2002