作者
Yan-Ming Zhang, Kaizhu Huang, Guanggang Geng, Cheng-Lin Liu
发表日期
2013
研讨会论文
Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2013, Prague, Czech Republic, September 23-27, 2013, Proceedings, Part II 13
页码范围
660-674
出版商
Springer Berlin Heidelberg
简介
The k nearest neighbors (kNN) graph, perhaps the most popular graph in machine learning, plays an essential role for graph-based learning methods. Despite its many elegant properties, the brute force kNN graph construction method has computational complexity of O(n 2), which is prohibitive for large scale data sets. In this paper, based on the divide-and-conquer strategy, we propose an efficient algorithm for approximating kNN graphs, which has the time complexity of O(l(d + logn)n) only (d is the dimensionality and l is usually a small number). This is much faster than most existing fast methods. Specifically, we engage the locality sensitive hashing technique to divide items into small subsets with equal size, and then build one kNN graph on each subset using the brute force method. To enhance the approximation quality, we repeat this procedure for several times to generate multiple basic …
引用总数
20122013201420152016201720182019202020212022202320241413714122017145134
学术搜索中的文章
YM Zhang, K Huang, G Geng, CL Liu - Machine Learning and Knowledge Discovery in …, 2013