Faa $ T: A transparent auto-scaling cache for serverless applications

F Romero, GI Chaudhry, Í Goiri, P Gopa… - Proceedings of the …, 2021 - dl.acm.org
Function-as-a-Service (FaaS) has become an increasingly popular way for users to deploy
their applications without the burden of managing the underlying infrastructure. However …

End-to-end optimization of machine learning prediction queries

K Park, K Saur, D Banda, R Sen, M Interlandi… - Proceedings of the …, 2022 - dl.acm.org
Prediction queries are widely used across industries to perform advanced analytics and
draw insights from data. They include a data processing part (eg, for joining, filtering …

[PDF][PDF] Comparative analysis of data visualization libraries Matplotlib and Seaborn in Python

AH Sial, SYS Rashdi, AH Khan - International Journal, 2021 - academia.edu
With the tremendous growth in the areas of computing, statistics, and mathematics has led to
the rise of the emerging field of expertise, named 'Data Science'. This paper focuses on the …

Extending relational query processing with ML inference

K Karanasos, M Interlandi, D Xin, F Psallidas… - arXiv preprint arXiv …, 2019 - arxiv.org
The broadening adoption of machine learning in the enterprise is increasing the pressure for
strict governance and cost-effective performance, in particular for the common and …

Improving reproducibility of data science pipelines through transparent provenance capture

L Rupprecht, JC Davis, C Arnold, Y Gur… - Proceedings of the …, 2020 - dl.acm.org
Data science has become prevalent in a large variety of domains. Inherent in its practice is
an exploratory, probing, and fact finding journey, which consists of the assembly, adaptation …

Efficient execution of user-defined functions in SQL queries

Y Foufoulas, A Simitsis - Proceedings of the VLDB Endowment, 2023 - dl.acm.org
User-defined functions (UDFs) have been widely used to overcome the expressivity
limitations of SQL and complement its declarative nature with functional capabilities. UDFs …

Vamsa: Automated provenance tracking in data science scripts

MH Namaki, A Floratou, F Psallidas… - Proceedings of the 26th …, 2020 - dl.acm.org
There has recently been a lot of ongoing research in the areas of fairness, bias and
explainability of machine learning (ML) models due to the self-evident or regulatory …

Lightweight inspection of data preprocessing in native machine learning pipelines

S Grafberger, J Stoyanovich, S Schelter - Conference on Innovative Data …, 2021 - par.nsf.gov
Machine Learning (ML) is increasingly used to automate impactful decisions, and the risks
arising from this wide-spread use are garnering attention from policy makers, scientists, and …

A dataset and analysis of open-source machine learning products

N Nahar, H Zhang, G Lewis, S Zhou… - arXiv preprint arXiv …, 2023 - arxiv.org
Machine learning (ML) components are increasingly incorporated into software products, yet
developers face challenges in transitioning from ML prototypes to products. Academic …

Mlinspect: A data distribution debugger for machine learning pipelines

S Grafberger, S Guha, J Stoyanovich… - Proceedings of the 2021 …, 2021 - dl.acm.org
Machine Learning (ML) is increasingly used to automate impactful decisions, and the risks
arising from this wide-spread use are garnering attention from policymakers, scientists, and …