A data science approach analysing the impact of injuries on basketball player and team performance

V Sarlis, V Chatziilias, C Tjortjis, D Mandalidis - Information Systems, 2021 - Elsevier
The sports industry utilizes science to improve short to long-term team and player
management regarding budget, health, tactics, training, and most importantly performance …

Elpis: Graph-based similarity search for scalable data science

I Azizi, K Echihabi, T Palpanas - Proceedings of the VLDB Endowment, 2023 - dl.acm.org
The recent popularity of learned embeddings has fueled the growth of massive collections of
high-dimensional (high-d) vectors that model complex data. Finding similar vectors in these …

Hercules against data series similarity search

K Echihabi, P Fatourou, K Zoumpatianos… - arXiv preprint arXiv …, 2022 - arxiv.org
We propose Hercules, a parallel tree-based technique for exact similarity search on massive
disk-based data series collections. We present novel index construction and query …

dcam: Dimension-wise class activation map for explaining multivariate data series classification

P Boniol, M Meftah, E Remy, T Palpanas - Proceedings of the 2022 …, 2022 - dl.acm.org
Data series classification is an important and challenging problem in data science.
Explaining the classification decisions by finding the discriminant parts of the input that led …

DumpyOS: A data-adaptive multi-ary index for scalable data series similarity search

Z Wang, Q Wang, P Wang, T Palpanas, W Wang - The VLDB Journal, 2024 - Springer
Data series indexes are necessary for managing and analyzing the increasing amounts of
data series collections that are nowadays available. These indexes support both exact and …

Secure KNN classification scheme based on homomorphic encryption for cyberspace

J Liu, C Wang, Z Tu, XA Wang, C Lin… - Security and …, 2021 - Wiley Online Library
With the advent of the intelligent era, more and more artificial intelligence algorithms are
widely used and a large number of user data are collected in the cloud server for sharing …

Deep learning embeddings for data series similarity search

Q Wang, T Palpanas - Proceedings of the 27th ACM SIGKDD …, 2021 - dl.acm.org
A key operation for the (increasingly large) data series collection analysis is similarity
search. According to recent studies, SAX-based indexes offer state-of-the-art performance …

Fast data series indexing for in-memory data

B Peng, P Fatourou, T Palpanas - The VLDB Journal, 2021 - Springer
Data series similarity search is a core operation for several data series analysis applications
across many different domains. However, the state-of-the-art techniques fail to deliver the …

High-dimensional similarity search for scalable data science

K Echihabi, K Zoumpatianos… - 2021 IEEE 37th …, 2021 - ieeexplore.ieee.org
Similarity search is a core operation of many critical data science applications, involving
massive collections of high-dimensional objects. Similarity search finds objects in a …

ProS: data series progressive k-NN similarity search and classification with probabilistic quality guarantees

K Echihabi, T Tsandilas, A Gogolou, A Bezerianos… - The VLDB Journal, 2023 - Springer
Existing systems dealing with the increasing volume of data series cannot guarantee
interactive response times, even for fundamental tasks such as similarity search. Therefore …