Geco: Quality counterfactual explanations in real time

M Schleich, Z Geng, Y Zhang, D Suciu - arXiv preprint arXiv:2101.01292, 2021 - arxiv.org
Machine learning is increasingly applied in high-stakes decision making that directly affect
people's lives, and this leads to an increased demand for systems to explain their decisions …

Data management for machine learning: A survey

C Chai, J Wang, Y Luo, Z Niu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Machine learning (ML) has widespread applications and has revolutionized many
industries, but suffers from several challenges. First, sufficient high-quality training data is …

[HTML][HTML] Approaching sales forecasting using recurrent neural networks and transformers

I Vallés-Pérez, E Soria-Olivas, M Martínez-Sober… - Expert Systems with …, 2022 - Elsevier
Accurate and fast demand forecast is one of the hot topics in supply chain for enabling the
precise execution of the corresponding downstream processes (inbound and outbound …

In-database machine learning with SQL on GPUs

M Schule, H Lang, M Springer, A Kemper… - Proceedings of the 33rd …, 2021 - dl.acm.org
In machine learning, continuously retraining a model guarantees accurate predictions based
on the latest data as training input. But to retrieve the latest data from a database, time …

Architecting intermediate layers for efficient composition of data management and machine learning systems

S Abeysinghe, F Wang, G Essertel, T Rompf - arXiv preprint arXiv …, 2023 - arxiv.org
Modern data analytics workloads combine relational data processing with machine learning
(ML). Most DBMS handle these workloads by offloading these ML operations to external …

Optimizing tensor programs on flexible storage

M Schleich, A Shaikhha, D Suciu - … of the ACM on Management of Data, 2023 - dl.acm.org
Tensor programs often need to process large tensors (vectors, matrices, or higher order
tensors) that require a specialized storage format for their memory layout. Several such …

[PDF][PDF] Identifying insufficient data coverage in databases with multiple relations

Y Lin, Y Guan, A Asudeh, HV Jagadish - Proceedings of the VLDB …, 2020 - par.nsf.gov
In today's data-driven world, it is critical that we use appropriate datasets for analysis and
decision-making. Datasets could be biased because they reflect existing inequalities in the …

Functional collection programming with semi-ring dictionaries

A Shaikhha, M Huot, J Smith, D Olteanu - Proceedings of the ACM on …, 2022 - dl.acm.org
This paper introduces semi-ring dictionaries, a powerful class of compositional and purely
functional collections that subsume other collection types such as sets, multisets, arrays …

Indexed Streams: A Formal Intermediate Representation for Fused Contraction Programs

S Kovach, P Kolichala, T Gu, F Kjolstad - Proceedings of the ACM on …, 2023 - dl.acm.org
We introduce indexed streams, a formal operational model and intermediate representation
that describes the fused execution of a contraction language that encompasses both sparse …

Optimizing in-memory database engine for AI-powered on-line decision augmentation using persistent memory

C Chen, J Yang, M Lu, T Wang, Z Zheng… - Proceedings of the …, 2021 - dl.acm.org
On-line decision augmentation (OLDA) has been considered as a promising paradigm for
real-time decision making powered by Artificial Intelligence (AI). OLDA has been widely …