High dimensional similarity joins: Algorithms and performance evaluation

N Ukey, Z Yang, B Li, G Zhang, Y Hu, W Zhang - Sensors, 2023 - mdpi.com

k nearest neighbours (kNN) queries are fundamental in many applications, ranging from
data mining, recommendation system and Internet of Things, to Industry 4.0 framework …

被引用次数：49 相关文章所有 7 个版本

[PDF] umd.edu

Spatial join techniques

EH Jacox, H Samet - ACM Transactions on Database Systems (TODS), 2007 - dl.acm.org

A variety of techniques for performing a spatial join are reviewed. Instead of just
summarizing the literature and presenting each technique in its entirety, distinct components …

被引用次数：327 相关文章所有 15 个版本

[PDF] uci.edu

Efficient record linkage in large data sets

L Jin, C Li, S Mehrotra - Eighth International Conference on …, 2003 - ieeexplore.ieee.org

This paper describes an efficient approach to record linkage. Given two lists of records, the
record-linkage problem consists of determining all pairs that are similar to each other where …

被引用次数：321 相关文章所有 16 个版本

[PDF] lmu.de

The k-Nearest Neighbour Join: Turbo Charging the KDD Process

C Böhm, F Krebs - Knowledge and Information Systems, 2004 - Springer

The similarity join has become an important database primitive for supporting similarity
searches and data mining. A similarity join combines two sets of complex objects such that …

被引用次数：200 相关文章所有 16 个版本

[PDF] psu.edu

Metric space similarity joins

EH Jacox, H Samet - ACM Transactions on Database Systems (TODS), 2008 - dl.acm.org

Similarity join algorithms find pairs of objects that lie within a certain distance ε of each other.
Algorithms that are adapted from spatial join techniques are designed primarily for data in a …

被引用次数：176 相关文章所有 10 个版本

[PDF] psu.edu

A fast similarity join algorithm using graphics processing units

MD Lieberman, J Sankaranarayanan… - 2008 IEEE 24th …, 2008 - ieeexplore.ieee.org

A similarity join operation A BOWTIE epsiv B takes two sets of points A, B and a value epsiv
isin Ropf, and outputs pairs of points p isin A, q isin B, such that the distance D (p, q) les …

被引用次数：180 相关文章所有 11 个版本

[PDF] psu.edu

Efficient exact edit similarity query processing with the asymmetric signature scheme

J Qin, W Wang, Y Lu, C Xiao, X Lin - Proceedings of the 2011 ACM …, 2011 - dl.acm.org

Given a query string Q, an edit similarity search finds all strings in a database whose edit
distance with Q is no more than a given threshold t. Most existing method answering edit …

被引用次数：138 相关文章所有 7 个版本

Data redundancy and duplicate detection in spatial join processing

JP Dittrich, B Seeger - Proceedings of 16th International …, 2000 - ieeexplore.ieee.org

The partition-based spatial-merge join (PBSM) of JM Patel and DJ DeWitt (1996) and the
size separation spatial join (S/sup 3/J) of N. Koudas and KC Sevcik (1997) are considered to …

被引用次数：160 相关文章所有 3 个版本

[PDF] sigmodrecord.org

Epsilon grid order: An algorithm for the similarity join on massive high-dimensional data

C Böhm, B Braunmüller, F Krebs, HP Kriegel - ACM SIGMOD Record, 2001 - dl.acm.org

The similarity join is an important database primitive which has been successfully applied to
speed up applications such as similarity search, data analysis and data mining. The …

被引用次数：165 相关文章所有 14 个版本

[PDF] vldb.org

[PDF][PDF] Gorder: an efficient method for knn join processing

C Xia, H Lu, BC Ooi, J Hu - … of the Thirtieth international conference on Very …, 2004 - vldb.org

An important but very expensive primitive operation of high-dimensional databases is the K-
Nearest Neighbor (KNN) similarity join. The operation combines each point of one dataset …

被引用次数：164 相关文章所有 13 个版本

高级搜索

QQ 群