A review for weighted minhash algorithms

W Wu, B Li, C Luo, W Nejdl - Proceedings of the Web Conference 2021, 2021 - dl.acm.org

Networks are ubiquitous in the real world. Link prediction, as one of the key problems for
network-structured data, aims to predict whether there exists a link between two nodes. The …

被引用次数：47 相关文章所有 3 个版本

[PDF] acm.org

Discovering similarity inclusion dependencies

Y Kaminsky, EHM Pena, F Naumann - … of the ACM on Management of …, 2023 - dl.acm.org

Inclusion dependencies (INDs) are a well-known type of data dependency, specifying that
the values of one column are contained in those of another column. INDs can be used for …

被引用次数：17 相关文章所有 2 个版本

ProvNet: Networked bi-directional blockchain for data sharing with verifiable provenance

C Chenli, W Tang, F Gomulka, T Jung - Journal of Parallel and Distributed …, 2022 - Elsevier

Data sharing is increasingly popular especially for scientific research and business fields
where large volume of datasets need to be used, but it involves data security and privacy …

被引用次数：20 相关文章所有 2 个版本

[PDF] neurips.cc

Locality sensitive hashing in fourier frequency domain for soft set containment search

I Roy, R Agarwal, S Chakrabarti… - Advances in Neural …, 2023 - proceedings.neurips.cc

In many search applications related to passage retrieval, text entailment, and subgraph
search, the query and each'document'is a set of elements, with a document being relevant if …

被引用次数：3 相关文章所有 4 个版本

[PDF] acm.org

Dothash: Estimating set similarity metrics for link prediction and document deduplication

I Nunes, M Heddes, P Vergés, D Abraham… - Proceedings of the 29th …, 2023 - dl.acm.org

Metrics for set similarity are a core aspect of several data mining tasks. To remove duplicate
results in a Web search, for example, a common approach looks at the Jaccard index …

被引用次数：9 相关文章所有 8 个版本

[PDF] arxiv.org

Fast Comparative Analysis of Merge Trees Using Locality Sensitive Hashing

W Lyu, R Sridharamurthy, JM Phillips… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Scalar field comparison is a fundamental task in scientific visualization. In topological data
analysis, we compare topological descriptors of scalar fields—such as persistence diagrams …

被引用次数：2 相关文章所有 9 个版本

[PDF] acm.org

Parallel index-based structural graph clustering and its approximation

T Tseng, L Dhulipala, J Shun - … of the 2021 International Conference on …, 2021 - dl.acm.org

SCAN (Structural Clustering Algorithm for Networks) is a well-studied, widely used graph
clustering algorithm. For large graphs, however, sequential SCAN variants are prohibitively …

被引用次数：27 相关文章所有 10 个版本

[PDF] ucl.ac.uk

Is it overkill? analyzing feature-space concept drift in malware detectors

Z Chen, Z Zhang, Z Kan, L Yang… - 2023 IEEE Security …, 2023 - ieeexplore.ieee.org

Concept drift is a major challenge faced by machine learning-based malware detectors
when deployed in practice. While existing works have investigated methods to detect …

被引用次数：8 相关文章所有 12 个版本

[PDF] acm.org

Weighted minwise hashing beats linear sketching for inner product estimation

A Bessa, M Daliri, J Freire, C Musco, C Musco… - Proceedings of the …, 2023 - dl.acm.org

We present a new approach for independently computing compact sketches that can be
used to approximate the inner product between pairs of high-dimensional vectors. Based on …

被引用次数：7 相关文章所有 6 个版本

An efficient and privacy-preserving range query over encrypted cloud data

W Wang, Y Jin, B Cao - … Conference on Privacy, Security & Trust …, 2022 - ieeexplore.ieee.org

The growing power of cloud computing prompts data owners to outsource their databases to
the cloud. In order to meet the demand of multi-dimensional data processing in big data era …

被引用次数：12 相关文章所有 2 个版本

高级搜索

QQ 群