BigDAWG polystore query optimization through semantic equivalences

Z She, S Ravishankar, J Duggan - 2016 IEEE High …, 2016 - ieeexplore.ieee.org
A polystore system evaluates queries that span multiple disparate data models; this
character introduces a unique query optimization challenge. Specialized database engines …

Scalable algorithms for nearest-neighbor joins on big trajectory data

Y Fang, R Cheng, W Tang, S Maniu… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
Trajectory data are prevalent in systems that monitor the locations of moving objects. In a
location-based service, for instance, the positions of vehicles are continuously monitored …

Similarity join over array data

W Zhao, F Rusu, B Dong, K Wu - Proceedings of the 2016 International …, 2016 - dl.acm.org
Scientific applications are generating an ever-increasing volume of multi-dimensional data
that are largely processed inside distributed array databases and frameworks. Similarity join …

Adaptive Quotient Filters

R Wen, H McCoy, D Tench, G Tagliavini… - Proceedings of the …, 2024 - dl.acm.org
Filters trade off accuracy for space and occasionally return false positive matches with a
bounded error. Numerous systems use filters in fast memory to avoid performing expensive …

Cross-engine query execution in federated database systems

AM Gupta, V Gadepally… - 2016 IEEE High …, 2016 - ieeexplore.ieee.org
We have developed a reference implementation of the BigDAWG system: a new architecture
for future Big Data applications, guided by the philosophy that “one size does not fit all” …

Incremental view maintenance over array data

W Zhao, F Rusu, B Dong, K Wu, P Nugent - Proceedings of the 2017 …, 2017 - dl.acm.org
Science applications are producing an ever-increasing volume of multi-dimensional data
that are mainly processed with distributed array databases. These raw arrays …

Lachesis: automatic partitioning for UDF-centric analytics

J Zou, A Das, P Barhate, A Iyengar, B Yuan… - arXiv preprint arXiv …, 2020 - arxiv.org
Persistent partitioning is effective in avoiding expensive shuffling operations. However it
remains a significant challenge to automate this process for Big Data analytics workloads …

Multidimensional array data management

F Rusu - Foundations and Trends® in Databases, 2023 - nowpublishers.com
Multidimensional arrays are a fundamental abstraction to represent data across scientific
domains ranging from astronomy to genetics, medicine, business intelligence, and …

ArrayBridge: Interweaving declarative array processing in SciDB with imperative HDF5-based programs

H Xing, S Floratos, S Blanas, S Byna… - 2018 IEEE 34th …, 2018 - ieeexplore.ieee.org
Scientists are increasingly turning to datacenter-scale computers to analyze massive arrays.
Despite decades of database research that extols the virtues of declarative query …

Optimising queries for pattern detection over large scale temporally evolving graphs

HN Chaudhry, M Rossi - IEEE Access, 2024 - ieeexplore.ieee.org
Large-scale graph processing and Stream processing are two distinct computational
paradigms for big data processing. Graph processing deals with computation on graphs of …