Towards efficient index construction and approximate nearest neighbor search in high-dimensional spaces

X Zhao, Y Tian, K Huang, B Zheng, X Zhou - Proceedings of the VLDB …, 2023 - dl.acm.org
The approximate nearest neighbor (ANN) search in high-dimensional spaces is a
fundamental but computationally very expensive problem. Many methods have been …

Scalable feature selection using ReliefF aided by locality‐sensitive hashing

C Eiras‐Franco, B Guijarro‐Berdiñas… - … Journal of Intelligent …, 2021 - Wiley Online Library
Feature selection algorithms, such as ReliefF, are very important for processing high‐
dimensionality data sets. However, widespread use of popular and effective such algorithms …

Graph Neural Networks for Tabular Data Learning: A Survey with Taxonomy and Directions

CT Li, YC Tsai, CY Chen, JC Liao - arXiv preprint arXiv:2401.02143, 2024 - arxiv.org
In this survey, we dive into Tabular Data Learning (TDL) using Graph Neural Networks
(GNNs), a domain where deep learning-based approaches have increasingly shown …

VBLSH: Volume-balancing locality-sensitive hashing algorithm for K-nearest neighbors search

S Zhang, H Lai, W Chen, L Zhang, X Lin, R Xiao - Information Sciences, 2022 - Elsevier
K-nearest neighbors search (K-NNS) is a fundamental problem in many areas of machine
learning and data mining. In an attempt to solve NNS problems by locality-sensitive hashing …

ONION: Online Semantic Autoencoder Hashing for Cross-Modal Retrieval

D Zhang, XJ Wu, G Chen - ACM Transactions on Intelligent Systems and …, 2023 - dl.acm.org
Cross-modal hashing (CMH) has recently received increasing attention with the merit of
speed and storage in performing large-scale cross-media similarity search. However, most …

Entropy Causal Graphs for Multivariate Time Series Anomaly Detection

FG Febrinanto, K Moore, C Thapa, M Liu… - arXiv preprint arXiv …, 2023 - arxiv.org
Many multivariate time series anomaly detection frameworks have been proposed and
widely applied. However, most of these frameworks do not consider intrinsic relationships …

Adaptive incremental transfer learning for efficient performance modeling of big data workloads

M Garralda-Barrio, C Eiras-Franco… - Future Generation …, 2025 - Elsevier
The rise of data-intensive scalable computing systems, such as Apache Spark, has
transformed data processing by enabling the efficient manipulation of large datasets across …

Determinants of diffuse solar radiation in urban and peatland areas based on weather and air pollutants

AL Latifah, AA Auliya, I Syafarina… - Journal of Atmospheric …, 2025 - Elsevier
Understanding solar radiation variability is essential for efficiently planning and managing
solar energy systems. The transmission of solar radiation to the ground is generally affected …

Efficient exact k-nearest neighbor graph construction for billion-scale datasets using gpus with tensor cores

Z Ji, CL Wang - Proceedings of the 36th ACM International Conference …, 2022 - dl.acm.org
Approximate nearest neighbor search plays a fundamental role in many areas, and the k-
nearest neighbor graph (KNNG) becomes a promising solution, especially in high …

Toward more efficient locality‐sensitive hashing via constructing novel hash function cluster

S Zhang, J Huang, R Xiao, X Du… - Concurrency and …, 2021 - Wiley Online Library
Locality‐sensitive hashing (LSH) is widely used in the context of nearest neighbor search of
large‐scale high‐dimensions. However, there are serious imbalance problems between the …