Amazon Redshift re-invented

N Armenatzoglou, S Basu, N Bhanoori, M Cai… - Proceedings of the …, 2022 - dl.acm.org
In 2013, AmazonWeb Services revolutionized the data warehousing industry by launching
Amazon Redshift, the first fully-managed, petabyte-scale, enterprise-grade cloud data …

Photon: A fast query engine for lakehouse systems

A Behm, S Palkar, U Agarwal, T Armstrong… - Proceedings of the …, 2022 - dl.acm.org
Many organizations are shifting to a data management paradigm called the" Lakehouse,"
which implements the functionality of structured data warehouses on top of unstructured …

End-to-end optimization of machine learning prediction queries

K Park, K Saur, D Banda, R Sen, M Interlandi… - Proceedings of the …, 2022 - dl.acm.org
Prediction queries are widely used across industries to perform advanced analytics and
draw insights from data. They include a data processing part (eg, for joining, filtering …

Cerebro: A platform for {Multi-Party} cryptographic collaborative learning

W Zheng, R Deng, W Chen, RA Popa… - 30th USENIX Security …, 2021 - usenix.org
Many organizations need large amounts of high quality data for their applications, and one
way to acquire such data is to combine datasets from multiple parties. Since these …

EVA: A symbolic approach to accelerating exploratory video analytics with materialized views

Z Xu, GT Kakkar, J Arulraj… - Proceedings of the 2022 …, 2022 - dl.acm.org
Advances in deep learning have led to a resurgence of interest in video analytics. In an
exploratory video analytics pipeline, a data scientist often starts by searching for a global …

Northstar: An interactive data science system

T Kraska - 2021 - dspace.mit.edu
© 2018 VLDB Endowment. In order to democratize data science, we need to fundamentally
rethink the current analytics stack, from the user interface to the “guts.“Most importantly …

Tidy Tuples and Flying Start: fast compilation and fast execution of relational queries in Umbra

T Kersten, V Leis, T Neumann - The VLDB Journal, 2021 - Springer
Although compiling queries to efficient machine code has become a common approach for
query execution, a number of newly created database system projects still refrain from using …

A tensor compiler for unified machine learning prediction serving

S Nakandala, K Saur, GI Yu, K Karanasos… - … USENIX Symposium on …, 2020 - usenix.org
Machine Learning (ML) adoption in the enterprise requires simpler and more efficient
software infrastructure—the bespoke solutions typical in large web companies are simply …

Babelfish: Efficient execution of polyglot queries

PM Grulich, S Zeuch, V Markl - Proceedings of the VLDB Endowment, 2021 - dl.acm.org
Today's users of data processing systems come from different domains, have different levels
of expertise, and prefer different programming languages. As a result, analytical workload …

Efficient execution of user-defined functions in SQL queries

Y Foufoulas, A Simitsis - Proceedings of the VLDB Endowment, 2023 - dl.acm.org
User-defined functions (UDFs) have been widely used to overcome the expressivity
limitations of SQL and complement its declarative nature with functional capabilities. UDFs …