DumpyOS: A data-adaptive multi-ary index for scalable data series similarity search

Z Wang, Q Wang, P Wang, T Palpanas, W Wang - The VLDB Journal, 2024 - Springer
Data series indexes are necessary for managing and analyzing the increasing amounts of
data series collections that are nowadays available. These indexes support both exact and …

Dumpy: A compact and adaptive index for large data series collections

Z Wang, Q Wang, P Wang, T Palpanas… - Proceedings of the ACM …, 2023 - dl.acm.org
Data series indexes are necessary for managing and analyzing the increasing amounts of
data series collections that are nowadays available. These indexes support both exact and …

Bestneighbor: efficient evaluation of knn queries on large time series databases

O Levchenko, B Kolev, DE Yagoubi… - … and Information Systems, 2021 - Springer
This paper presents parallel solutions (developed based on two state-of-the-art algorithms
iSAX and sketch) for evaluating k nearest neighbor queries on large databases of time …

A “big-data” algorithm for KNN-PLS

M Metz, M Lesnoff, F Abdelghafour, R Akbarinia… - Chemometrics and …, 2020 - Elsevier
A well known issue regarding PLS lies in the difficulty to apprehend nonlinearities. As a
solution, an extension of the method,“KNN-PLS”, was developed. However, this solution is …

Spark-based platform for neurophysiological data storage and processing: a proof of concept

J Zheng, J Zhao, J Li, C Zhan… - 2021 6th International …, 2021 - ieeexplore.ieee.org
An internet of things (IoT) framework can be beneficial to the neural scientific research and
development for massive data acquisition, management and processing in a manner of …

[PDF][PDF] Bridging the Gap Between Algorithmic and Learned Index Structures

A Hadian - 2022 - core.ac.uk
Index structures such as B-trees and bloom filters are the well-established petrol engines of
database systems. However, these structures do not fully exploit patterns in data distribution …

Clustering of time-series balance history data streams using apache spark

DQ Dat, PD Hung - … and Engineering: 17th International Conference, CDVE …, 2020 - Springer
Clustering customers, predicting account balances, scoring credits, detecting risk cash flows,
etc. are the problems that have been focused on research in the banking sector. With the …