Spatial Query Optimization With Learning

X Zhang, A Eldawy - Proceedings of the VLDB Endowment, 2024 - dl.acm.org
Query optimization is a key component in database management systems (DBMS) and
distributed data processing platforms. Recent research in the database community …

Wisk: A workload-aware learned index for spatial keyword queries

Y Sheng, X Cao, Y Fang, K Zhao, J Qi, G Cong… - Proceedings of the …, 2023 - dl.acm.org
Spatial objects often come with textual information, such as Points of Interest (POIs) with
their descriptions, which are referred to as geo-textual data. To retrieve such data, spatial …

[HTML][HTML] Theoretical analysis of learned database operations under distribution shift through distribution learnability

S Zeighami, C Shahabi - Proceedings of machine learning …, 2024 - pmc.ncbi.nlm.nih.gov
Use of machine learning to perform database operations, such as indexing, cardinality
estimation, and sorting, is shown to provide substantial performance benefits. However …

Neurosketch: Fast and approximate evaluation of range aggregate queries with neural networks

S Zeighami, C Shahabi, V Sharan - … of the ACM on Management of Data, 2023 - dl.acm.org
Range aggregate queries (RAQs) are an integral part of many real-world applications,
where, often, fast and approximate answers for the queries are desired. Recent work has …

A Practical Theory of Generalization in Selectivity Learning

P Wu, H Xu, R Marcus, ZG Ives - arXiv preprint arXiv:2409.07014, 2024 - arxiv.org
Query-driven machine learning models have emerged as a promising estimation technique
for query selectivities. Yet, surprisingly little is known about the efficacy of these techniques …

Toward interpretable and actionable data analysis with explanations and causality

S Roy - Proceedings of the VLDB Endowment, 2022 - dl.acm.org
We live in a world dominated by data, where users from different fields routinely collect,
study, and make decisions supported by data. To aid these users, the current trend in data …

In-database query optimization on SQL with ML predicates

Y Guo, G Li, R Hu, Y Wang - The VLDB Journal, 2025 - Springer
Extended SQL with machine learning (ML) predicates, commonly referred to as SQL+ ML,
integrates ML abilities into traditional SQL processing in databases. When processing SQL+ …

PolyCard: A learned cardinality estimator for intersection queries on spatial polygons

Y Ji, D Amagata, Y Sasaki, T Hara - Journal of Intelligent Information …, 2025 - Springer
How can we estimate the result size for a given query on complex spatial objects like
polygons? Estimating a query's result size, also known as the cardinality estimation, plays a …

MulRF: A Multi-dimensional Range Filter for Sublinear Time Range Query Processing

S Han, X Liu, J Li - IEEE Transactions on Knowledge and Data …, 2024 - ieeexplore.ieee.org
Range query is an important operation on big multi-dimensional data. This paper studies the
problem of multi-dimensional range query filtering for speeding up the range query …

Computing data distribution from query selectivities

PK Agarwal, R Raychaudhury, S Sintos… - arXiv preprint arXiv …, 2024 - arxiv.org
We are given a set $\mathcal {Z}=\{(R_1, s_1),\ldots,(R_n, s_n)\} $, where each $ R_i $ is
a\emph {range} in $\Re^ d $, such as rectangle or ball, and $ s_i\in [0, 1] $ denotes its\emph …