EH Jacox, H Samet - ACM Transactions on Database Systems (TODS), 2007 - dl.acm.org
A variety of techniques for performing a spatial join are reviewed. Instead of just summarizing the literature and presenting each technique in its entirety, distinct components …
L Jin, C Li, S Mehrotra - Eighth International Conference on …, 2003 - ieeexplore.ieee.org
This paper describes an efficient approach to record linkage. Given two lists of records, the record-linkage problem consists of determining all pairs that are similar to each other where …
C Böhm, F Krebs - Knowledge and Information Systems, 2004 - Springer
The similarity join has become an important database primitive for supporting similarity searches and data mining. A similarity join combines two sets of complex objects such that …
EH Jacox, H Samet - ACM Transactions on Database Systems (TODS), 2008 - dl.acm.org
Similarity join algorithms find pairs of objects that lie within a certain distance ε of each other. Algorithms that are adapted from spatial join techniques are designed primarily for data in a …
A similarity join operation A BOWTIE epsiv B takes two sets of points A, B and a value epsiv isin Ropf, and outputs pairs of points p isin A, q isin B, such that the distance D (p, q) les …
Given a query string Q, an edit similarity search finds all strings in a database whose edit distance with Q is no more than a given threshold t. Most existing method answering edit …
JP Dittrich, B Seeger - Proceedings of 16th International …, 2000 - ieeexplore.ieee.org
The partition-based spatial-merge join (PBSM) of JM Patel and DJ DeWitt (1996) and the size separation spatial join (S/sup 3/J) of N. Koudas and KC Sevcik (1997) are considered to …
C Böhm, B Braunmüller, F Krebs, HP Kriegel - ACM SIGMOD Record, 2001 - dl.acm.org
The similarity join is an important database primitive which has been successfully applied to speed up applications such as similarity search, data analysis and data mining. The …
C Xia, H Lu, BC Ooi, J Hu - … of the Thirtieth international conference on Very …, 2004 - vldb.org
An important but very expensive primitive operation of high-dimensional databases is the K- Nearest Neighbor (KNN) similarity join. The operation combines each point of one dataset …