Hashing-accelerated graph neural networks for link prediction

W Wu, B Li, C Luo, W Nejdl - Proceedings of the Web Conference 2021, 2021 - dl.acm.org
Networks are ubiquitous in the real world. Link prediction, as one of the key problems for
network-structured data, aims to predict whether there exists a link between two nodes. The …

Discovering similarity inclusion dependencies

Y Kaminsky, EHM Pena, F Naumann - … of the ACM on Management of …, 2023 - dl.acm.org
Inclusion dependencies (INDs) are a well-known type of data dependency, specifying that
the values of one column are contained in those of another column. INDs can be used for …

ProvNet: Networked bi-directional blockchain for data sharing with verifiable provenance

C Chenli, W Tang, F Gomulka, T Jung - Journal of Parallel and Distributed …, 2022 - Elsevier
Data sharing is increasingly popular especially for scientific research and business fields
where large volume of datasets need to be used, but it involves data security and privacy …

Locality sensitive hashing in fourier frequency domain for soft set containment search

I Roy, R Agarwal, S Chakrabarti… - Advances in Neural …, 2023 - proceedings.neurips.cc
In many search applications related to passage retrieval, text entailment, and subgraph
search, the query and each'document'is a set of elements, with a document being relevant if …

Dothash: Estimating set similarity metrics for link prediction and document deduplication

I Nunes, M Heddes, P Vergés, D Abraham… - Proceedings of the 29th …, 2023 - dl.acm.org
Metrics for set similarity are a core aspect of several data mining tasks. To remove duplicate
results in a Web search, for example, a common approach looks at the Jaccard index …

Fast Comparative Analysis of Merge Trees Using Locality Sensitive Hashing

W Lyu, R Sridharamurthy, JM Phillips… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Scalar field comparison is a fundamental task in scientific visualization. In topological data
analysis, we compare topological descriptors of scalar fields—such as persistence diagrams …

Parallel index-based structural graph clustering and its approximation

T Tseng, L Dhulipala, J Shun - … of the 2021 International Conference on …, 2021 - dl.acm.org
SCAN (Structural Clustering Algorithm for Networks) is a well-studied, widely used graph
clustering algorithm. For large graphs, however, sequential SCAN variants are prohibitively …

Is it overkill? analyzing feature-space concept drift in malware detectors

Z Chen, Z Zhang, Z Kan, L Yang… - 2023 IEEE Security …, 2023 - ieeexplore.ieee.org
Concept drift is a major challenge faced by machine learning-based malware detectors
when deployed in practice. While existing works have investigated methods to detect …

Weighted minwise hashing beats linear sketching for inner product estimation

A Bessa, M Daliri, J Freire, C Musco, C Musco… - Proceedings of the …, 2023 - dl.acm.org
We present a new approach for independently computing compact sketches that can be
used to approximate the inner product between pairs of high-dimensional vectors. Based on …

An efficient and privacy-preserving range query over encrypted cloud data

W Wang, Y Jin, B Cao - … Conference on Privacy, Security & Trust …, 2022 - ieeexplore.ieee.org
The growing power of cloud computing prompts data owners to outsource their databases to
the cloud. In order to meet the demand of multi-dimensional data processing in big data era …