Algebraic data integration

P Schultz, R Wisnesky - Journal of Functional Programming, 2017 - cambridge.org
In this paper, we develop an algebraic approach to data integration by combining
techniques from functional programming, category theory, and database theory. In our …

Fast left kan extensions using the chase

J Meyers, DI Spivak, R Wisnesky - Journal of Automated Reasoning, 2022 - Springer
We show how computation of left Kan extensions can be reduced to computation of free
models of cartesian (finite-limit) theories. We discuss how the standard and parallel chase …

Algebraic property graphs

J Shinavier, R Wisnesky, JG Meyers - arXiv preprint arXiv:1909.04881, 2019 - arxiv.org
We present a case study in applied category theory written from the point of view of an
applied domain: the formalization of the widely-used property graphs data model in an …

Compositionality and functorial invariants in machine learning

D Shiebler - 2023 - ora.ox.ac.uk
The objective of this thesis is to show that studying the underlying compositional and
functorial structure in machine learning systems allows us to better understand them. In …

Cartesian institutions with evidence: Data and system modelling with diagrammatic constraints and generalized sketches

Z Diskin - arXiv preprint arXiv:2306.16284, 2023 - arxiv.org
Data constraints are fundamental for practical data modelling, and a verifiable conformance
of a data instance to a safety-critical constraint (satisfaction relation) is a corner-stone of …

Kan Extensions in Data Science and Machine Learning

D Shiebler - arXiv preprint arXiv:2203.09018, 2022 - arxiv.org
A common problem in data science is" use this function defined over this small set to
generate predictions over that larger set." Extrapolation, interpolation, statistical inference …

Modelling and Management of Multi-Model Data

P Koupil - 2022 - dspace.cuni.cz
Title: Modelling and Management of Multi-Model Data Author: Pavel Koupil (Čontoš)
Department: Department of Software Engineering Supervisor: doc. RNDr. Irena Holubová …

Functorial model management

S Breiner, B Pollard, E Subrahmanian - Proceedings of the Design …, 2019 - cambridge.org
In this paper we use formal tools from category theory to develop a foundation for creating
and managing models in systems where knowledge is distributed across multiple …

Informal data transformation considered harmful

E Daimler, R Wisnesky - arXiv preprint arXiv:2001.00338, 2020 - arxiv.org
In this paper we take the common position that AI systems are limited more by the integrity of
the data they are learning from than the sophistication of their algorithms, and we take the …

An algebraic theory for data linkage

LT Chen, M Roggenbach, JV Tucker - … , WADT 2018, Egham, UK, July 2–5 …, 2019 - Springer
There are countless sources of data available to governments, companies, and citizens,
which can be combined for good or evil. We analyse the concepts of combining data from …