Flash-llm: Enabling cost-effective and highly-efficient large generative model inference with unstructured sparsity

H Xia, Z Zheng, Y Li, D Zhuang, Z Zhou, X Qiu… - arXiv preprint arXiv …, 2023 - arxiv.org
With the fast growth of parameter size, it becomes increasingly challenging to deploy large
generative models as they typically require large GPU memory consumption and massive …

Better together: Unifying datalog and equality saturation

Y Zhang, YR Wang, O Flatt, D Cao, P Zucker… - Proceedings of the …, 2023 - dl.acm.org
We present egglog, a fixpoint reasoning system that unifies Datalog and equality saturation
(EqSat). Like Datalog, egglog supports efficient incremental execution, cooperating …

Building a compiled query engine in python

H Shahrokhi, A Shaikhha - Proceedings of the 32nd ACM SIGPLAN …, 2023 - dl.acm.org
The simplicity of Python and its rich set of libraries has made it the most popular language
for data science. Moreover, the interpreted nature of Python offers an easy debugging …

Efficient Model-Relational Data Management: Challenges and Opportunities

V Sanca, A Ailamaki - IEEE Transactions on Knowledge and …, 2024 - ieeexplore.ieee.org
As modern data pipelines continue to collect, produce, and store various data formats,
extracting and combining value from traditional and context-rich sources becomes …

Compiling structured tensor algebra

M Ghorbani, M Huot, S Hashemian… - Proceedings of the ACM …, 2023 - dl.acm.org
Tensor algebra is essential for data-intensive workloads in various computational domains.
Computational scientists face a trade-off between the specialization degree provided by …

Finch: Sparse and Structured Array Programming with Control Flow

W Ahrens, TF Collin, R Patel, K Deeds, C Hong… - arXiv preprint arXiv …, 2024 - arxiv.org
From FORTRAN to NumPy, arrays have revolutionized how we express computation.
However, arrays in these, and almost all prominent systems, can only handle dense …

Data Management in the Noisy Intermediate-Scale Quantum Era

R Hai, SH Hung, T Coopmans, F Geerts - arXiv preprint arXiv:2409.14111, 2024 - arxiv.org
Quantum computing has emerged as a promising tool for transforming the landscape of
computing technology. Recent efforts have applied quantum techniques to classical …

Fine-tuning data structures for query processing

A Shaikhha, M Kelepeshis, M Ghorbani - Proceedings of the 21st ACM …, 2023 - dl.acm.org
We introduce a framework for automatically choosing data structures for efficient query
processing. Our contributions are twofold. First, we introduce a novel low-level intermediate …

The Duck's Brain: Training and Inference of Neural Networks in Modern Database Engines

ME Schüle, T Neumann, A Kemper - arXiv preprint arXiv:2312.17355, 2023 - arxiv.org
Although database systems perform well in data access and manipulation, their relational
model hinders data scientists from formulating machine learning algorithms in SQL …

A Tensor Algebra Compiler for Sparse Differentiation

A Shaikhha, M Huot… - 2024 IEEE/ACM …, 2024 - ieeexplore.ieee.org
Sparse tensors are prevalent in many data-intensive applications. However, existing
automatic differentiation (AD) frameworks are tailored towards dense tensors, which makes …