ANN-Benchmarks: A benchmarking tool for approximate nearest neighbor algorithms

M Aumüller, E Bernhardsson, A Faithfull - Information Systems, 2020 - Elsevier
This paper describes ANN-Benchmarks, a tool for evaluating the performance of in-memory
approximate nearest neighbor algorithms. It provides a standard interface for measuring the …

ANN-benchmarks: A benchmarking tool for approximate nearest neighbor algorithms

M Aumüller, E Bernhardsson, A Faithfull - International conference on …, 2017 - Springer
This paper describes ANN-Benchmarks, a tool for evaluating the performance of in-memory
approximate nearest neighbor algorithms. It provides a standard interface for measuring the …

Index structures for fast similarity search for binary vectors

DA Rachkovskij - Cybernetics and Systems Analysis, 2017 - Springer
This article reviews index structures for fast similarity search for objects represented by
binary vectors (with components equal to 0 or 1). Structures for both exact and approximate …

Fast Distributed kNN Graph Construction Using Auto-tuned Locality-sensitive Hashing

C Eiras-Franco, D Martínez-Rego, L Kanthan… - ACM Transactions on …, 2020 - dl.acm.org
The k-nearest-neighbors (k NN) graph is a popular and powerful data structure that is used
in various areas of Data Science, but the high computational cost of obtaining it hinders its …

Index structures for fast similarity search for real-valued vectors. I

DA Rachkovskij - Cybernetics and Systems Analysis, 2018 - Springer
This survey paper considers index structures for fast similarity search for objects represented
by real-valued vectors. Index structures based on locality-sensitive hashing and their …

Non-mergeable sketching for cardinality estimation

S Pettie, D Wang, L Yin - arXiv preprint arXiv:2008.08739, 2020 - arxiv.org
Cardinality estimation is perhaps the simplest non-trivial statistical problem that can be
solved via sketching. Industrially-deployed sketches like HyperLogLog, MinHash, and PCSA …

[PDF][PDF] Simple and Efficient Cardinality Estimation in Data Streams

S Pettie, D Wang, L Yin - arXiv preprint arXiv:2008.08739, 2020 - researchgate.net
We study sketching schemes for the cardinality estimation problem in data streams, and
advocate for measuring the efficiency of such a scheme in terms of its MVP: Memory …

[PDF][PDF] Similarity Search: Algorithms for Sets and other High Dimensional Data

TD Ahle - 2019 - pure.itu.dk
Similarity Search: Algorithms for Sets and other High Dimensional Data Page 1 Similarity
Search: Algorithms for Sets and other High Dimensional Data Thomas Dybdahl Ahle Advisor …

Индексные структуры для быстрого поиска по сходству вещественных векторов. II

ДА Рачковский - Кибернетика и системный анализ, 2018 - dspace.nbuv.gov.ua
Дан обзор индексных структур для быстрого поиска по сходству объектов,
представленных вещественными векторами. Рассмотрены структуры как для точного …

Индексные структуры для быстрого поиска по сходству бинарных векторов

ДА Рачковский - Кибернетика и системный анализ, 2017 - irbis-nbuv.gov.ua
Дан обзор индексных структур для быстрого поиска по сходr ству объектов,
представленных бинарными векторами (с компонентами 0 или 1). Рассмотрены …