Achieving efficient and privacy-preserving set containment search over encrypted data

Y Zheng, R Lu, Y Guan, J Shao… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Set containment search, which aims to retrieve all set records containing a specific query
set, has received considerable attention. Meanwhile, due to the dramatic growth of data …

Adaptive top-k overlap set similarity joins

Z Yang, B Zheng, G Li, X Zhao, X Zhou… - 2020 IEEE 36th …, 2020 - ieeexplore.ieee.org
The set similarity join (SSJ) is core functionality in a range of applications, including data
cleaning, near-duplicate object detection, and data integration. Threshold-based SSJ …

Exploiting GPUs for fast intersection of large sets

C Bellas, A Gounaris - Information Systems, 2022 - Elsevier
The main focus of this work is on large set intersection, which is a pivotal operation in
information retrieval, graph analytics and database systems. We aim to experimentally …

Trie and LOUDS hybrid model for efficient e-commerce processing in cloud environment

L Jia, S Li, Y Zhang, Y Chen, X Yuan, J Ding - … Modelling Practice and …, 2024 - Elsevier
Set superset query is widely used in e-commerce processing and many other domains,
particularly in cloud computing environments. Indexing is an efficient way to model e …

LES3: Learning-based exact set similarity search

Y Li, X Yu, N Koudas - arXiv preprint arXiv:2107.10417, 2021 - arxiv.org
Set similarity search is a problem of central interest to a wide variety of applications such as
data cleaning and web search. Past approaches on set similarity search utilize either heavy …

Clustering geospatial data for multiple reference points

Y Zhong, J Li, S Zhu - IEEE Access, 2019 - ieeexplore.ieee.org
Data clustering plays a significant role in geospatial data management and analytics. In this
light, we propose and study a novel geospatial data clustering method for multiple reference …

Siesta: A scalable infrastructure of sequential pattern analysis

I Mavroudopoulos, A Gounaris - IEEE Transactions on Big Data, 2022 - ieeexplore.ieee.org
Sequential pattern analysis has become a mature topic with a lot of techniques for a variety
of sequential pattern mining-related problems. Moreover, tailored solutions for specific …

KOIOS: Top-k Semantic Overlap Set Search

P Mundra, J Zhang, F Nargesian… - 2023 IEEE 39th …, 2023 - ieeexplore.ieee.org
We study the top-k set similarity search problem using semantic overlap. While vanilla
overlap requires exact matches between set elements, semantic overlap allows elements …

Neighborhood skyline on graphs: Concepts, algorithms and applications

Q Zhang, RH Li, H Qin, Y Dai, Y Yuan… - 2023 IEEE 39th …, 2023 - ieeexplore.ieee.org
Neighborhood inclusion, representing that all the neighbors of a vertex are also adjacent to
another vertex, has been recognized as an important relationship between two vertices in a …

Freshjoin: An efficient and adaptive algorithm for set containment join

J Luo, W Zhang, S Shi, H Gao, J Li, W Wu… - Data Science and …, 2019 - Springer
This paper revisits set containment join (SCJ) problem, which uses the subset relationship
(ie, ⊆⊆) as condition to join set-valued attributes of two relations and has many …