Survey on exact knn queries over high-dimensional data space

N Ukey, Z Yang, B Li, G Zhang, Y Hu, W Zhang - Sensors, 2023 - mdpi.com
k nearest neighbours (kNN) queries are fundamental in many applications, ranging from
data mining, recommendation system and Internet of Things, to Industry 4.0 framework …

Spatial join techniques

EH Jacox, H Samet - ACM Transactions on Database Systems (TODS), 2007 - dl.acm.org
A variety of techniques for performing a spatial join are reviewed. Instead of just
summarizing the literature and presenting each technique in its entirety, distinct components …

Efficient record linkage in large data sets

L Jin, C Li, S Mehrotra - Eighth International Conference on …, 2003 - ieeexplore.ieee.org
This paper describes an efficient approach to record linkage. Given two lists of records, the
record-linkage problem consists of determining all pairs that are similar to each other where …

The k-Nearest Neighbour Join: Turbo Charging the KDD Process

C Böhm, F Krebs - Knowledge and Information Systems, 2004 - Springer
The similarity join has become an important database primitive for supporting similarity
searches and data mining. A similarity join combines two sets of complex objects such that …

Metric space similarity joins

EH Jacox, H Samet - ACM Transactions on Database Systems (TODS), 2008 - dl.acm.org
Similarity join algorithms find pairs of objects that lie within a certain distance ε of each other.
Algorithms that are adapted from spatial join techniques are designed primarily for data in a …

A fast similarity join algorithm using graphics processing units

MD Lieberman, J Sankaranarayanan… - 2008 IEEE 24th …, 2008 - ieeexplore.ieee.org
A similarity join operation A BOWTIE epsiv B takes two sets of points A, B and a value epsiv
isin Ropf, and outputs pairs of points p isin A, q isin B, such that the distance D (p, q) les …

Efficient exact edit similarity query processing with the asymmetric signature scheme

J Qin, W Wang, Y Lu, C Xiao, X Lin - Proceedings of the 2011 ACM …, 2011 - dl.acm.org
Given a query string Q, an edit similarity search finds all strings in a database whose edit
distance with Q is no more than a given threshold t. Most existing method answering edit …

Data redundancy and duplicate detection in spatial join processing

JP Dittrich, B Seeger - Proceedings of 16th International …, 2000 - ieeexplore.ieee.org
The partition-based spatial-merge join (PBSM) of JM Patel and DJ DeWitt (1996) and the
size separation spatial join (S/sup 3/J) of N. Koudas and KC Sevcik (1997) are considered to …

Epsilon grid order: An algorithm for the similarity join on massive high-dimensional data

C Böhm, B Braunmüller, F Krebs, HP Kriegel - ACM SIGMOD Record, 2001 - dl.acm.org
The similarity join is an important database primitive which has been successfully applied to
speed up applications such as similarity search, data analysis and data mining. The …

[PDF][PDF] Gorder: an efficient method for knn join processing

C Xia, H Lu, BC Ooi, J Hu - … of the Thirtieth international conference on Very …, 2004 - vldb.org
An important but very expensive primitive operation of high-dimensional databases is the K-
Nearest Neighbor (KNN) similarity join. The operation combines each point of one dataset …