Landscape of High-Performance Python to Develop Data Science and Machine Learning Applications

O Castro, P Bruneau, JS Sottet… - ACM Computing Surveys, 2023 - dl.acm.org
Python has become the prime language for application development in the data science and
machine learning domains. However, data scientists are not necessarily experienced …

Efficient execution of user-defined functions in SQL queries

Y Foufoulas, A Simitsis - Proceedings of the VLDB Endowment, 2023 - dl.acm.org
User-defined functions (UDFs) have been widely used to overcome the expressivity
limitations of SQL and complement its declarative nature with functional capabilities. UDFs …

YeSQL: " you extend SQL" with rich and highly performant user-defined functions in relational databases

Y Foufoulas, A Simitsis, L Stamatogiannakis… - Proceedings of the …, 2022 - dl.acm.org
The diversity and complexity of modern data management applications have led to the
extension of the relational paradigm with syntactic and semantic support for User-Defined …

[PDF][PDF] Daphne: An open and extensible system infrastructure for integrated data analysis pipelines

P Damme, M Birkenbach, C Bitsakos… - … on Innovative Data …, 2022 - pure.itu.dk
Integrated data analysis (IDA) pipelines---that combine data management (DM) and query
processing, high-performance computing (HPC), and machine learning (ML) training and …

Bladedisc: Optimizing dynamic shape machine learning workloads via compiler approach

Z Zheng, Z Pan, D Wang, K Zhu, W Zhao… - Proceedings of the …, 2023 - dl.acm.org
Compiler optimization plays an increasingly important role to boost the performance of
machine learning models for data processing and management. With increasingly complex …

Building a compiled query engine in python

H Shahrokhi, A Shaikhha - Proceedings of the 32nd ACM SIGPLAN …, 2023 - dl.acm.org
The simplicity of Python and its rich set of libraries has made it the most popular language
for data science. Moreover, the interpreted nature of Python offers an easy debugging …

User-defined functions in modern data engines

Y Foufoulas, A Simitsis - 2023 IEEE 39th International …, 2023 - ieeexplore.ieee.org
Modern data management applications involve complex processing tasks over large
volumes of data. Although this falls naturally within the scope of relational databases, many …

Predicate pushdown for data science pipelines

C Yan, Y Lin, Y He - Proceedings of the ACM on Management of Data, 2023 - dl.acm.org
Predicate pushdown is a widely adopted query optimization. Existing systems and prior work
mostly use pattern-matching rules to decide when a predicate can be pushed through …

Containerized execution of UDFs: an experimental evaluation

K Saur, T Mirmira, K Karanasos… - Proceedings of the …, 2022 - dl.acm.org
User-defined functions (UDFs) have long been used as the de facto way to extend the
capabilities of data management systems. However, they are restricted to the specificities of …

[PDF][PDF] Accelerating python udfs in vectorized query execution

S Kläbe - 2022 - db-thueringen.de
Modern analytical database systems offer support for user-defined funtions as a flexible
extension to SQL. Python is one of the most popular UDF languages being easy to use and …