Cache-oblivious high-performance similarity join

M Perdacher, C Plant, C Böhm - … of the 2019 International Conference on …, 2019 - dl.acm.org
A similarity join combines vectors based on a distance condition. Typically, such algorithms
apply a filter step (by indexing or sorting) and then refine pairs of candidate vectors. In this …

Approximate similarity search under edit distance using locality-sensitive hashing

S McCauley - arXiv preprint arXiv:1907.01600, 2019 - arxiv.org
Edit distance similarity search, also called approximate pattern matching, is a fundamental
problem with widespread database applications. The goal of the problem is to preprocess …

Similarity search with tensor core units

T D. Ahle, F Silvestri - International Conference on Similarity Search and …, 2020 - Springer
Abstract Tensor Core Units (TCUs) are hardware accelerators developed for deep neural
networks, which efficiently support the multiplication of two dense m * m matrices, where m is …

Implementing distributed similarity joins using locality sensitive hashing

M Aumüller, M Ceccarello - … of the 25th International Conference on …, 2022 - bia.unibz.it
Similarity joins are a basic primitive in data mining. Given two sets of points, we are
interested in reporting all pairs of points whose similarity is above a user-defined threshold …

Accelerating array joining with integrated value-index

H Xing, G Agrawal - Proceedings of the 31st International Conference on …, 2019 - dl.acm.org
Large-scale multidimensional array processing is becoming an increasingly important
problem with the rise of big data, scientific data processing, and machine learning …

LayerLSH: Rebuilding Locality-Sensitive Hashing Indices by Exploring Density of Hash Values

J Ding, Z Liu, Y Zhang, S Gong, G Yu - IEEE Access, 2022 - ieeexplore.ieee.org
Locality-sensitive hashing (LSH) has attracted extensive research efforts for approximate
nearest neighbors (NN) search. However, most of these LSH-based index structures fail to …

[图书][B] Advanced data structures: theory and applications

S Saha, S Shukla - 2019 - taylorfrancis.com
Advanced data structures is a core course in Computer Science which most graduate
program in Computer Science, Computer Science and Engineering, and other allied …

Mathematical Models for Local Sensing Hashes

L Wang, L Wangner - arXiv preprint arXiv:2111.08344, 2021 - arxiv.org
As data volumes continue to grow, searches in data are becoming increasingly time-
consuming. Classical index structures for neighbor search are no longer sustainable due to …

[PDF][PDF] Implementing Distributed Approximate Similarity Joins using Locality Sensitive Hashing

M Aumüller, M Ceccarello - EDBT: International Conference …, 2022 - openproceedings.org
Similarity joins are a basic primitive in data mining. Given two sets of points, we are
interested in reporting all pairs of points whose similarity is above a user-defined threshold …

Towards a Model for LSH

L Wang - arXiv preprint arXiv:2105.05130, 2021 - arxiv.org
As data volumes continue to grow, clustering and outlier detection algorithms are becoming
increasingly time-consuming. Classical index structures for neighbor search are no longer …