Provenance and scientific workflows: challenges and opportunities

SB Davidson, J Freire - Proceedings of the 2008 ACM SIGMOD …, 2008 - dl.acm.org
Provenance in the context of workflows, both for the data they derive and for their
specification, is an essential component to allow for result reproducibility, sharing, and …

A systematic review of provenance systems

B Pérez, J Rubio, C Sáenz-Adán - Knowledge and Information Systems, 2018 - Springer
Provenance refers to the entire amount of information, comprising all the elements and their
relationships, that contribute to the existence of a piece of data. The knowledge of …

The foundations for provenance on the web

L Moreau - Foundations and Trends® in Web Science, 2010 - nowpublishers.com
Provenance, ie, the origin or source of something, is becoming an important concern, since it
offers the means to verify data products, to infer their quality, to analyse the processes that …

Provenance as first class cloud data

KK Muniswamy-Reddy, M Seltzer - ACM SIGOPS Operating Systems …, 2010 - dl.acm.org
Digital provenance is meta-data that describes the ancestry or history of a digital object.
Most work on provenance focuses on how provenance increases the value of data to …

Provenance in databases

P Buneman, WC Tan - Proceedings of the 2007 ACM SIGMOD …, 2007 - dl.acm.org
The provenance of data has recently been recognized as central tothe trust one places in
data. It is also important to annotation, todata integration and to probabilistic databases …

Efficient provenance storage

AP Chapman, HV Jagadish, P Ramanan - Proceedings of the 2008 ACM …, 2008 - dl.acm.org
As the world is increasingly networked and digitized, the data we store has more and more
frequently been chopped, baked, diced and stewed. In consequence, there is an increasing …

A provenance-based adaptive scheduling heuristic for parallel scientific workflows in clouds

D de Oliveira, KACS Ocaña, F Baião… - Journal of grid …, 2012 - Springer
In the last years, scientific workflows have emerged as a fundamental abstraction for
structuring and executing scientific experiments in computational environments. Scientific …

Provdb: Lifecycle management of collaborative analysis workflows

H Miao, A Chavan, A Deshpande - Proceedings of the 2nd Workshop on …, 2017 - dl.acm.org
As data-driven methods are becoming pervasive in a wide variety of disciplines, there is an
urgent need to develop scalable and sustainable tools to simplify the process of data …

Provenance collection platform for the weather research and forecasting model

A Tufek, A Gurbuz, OF Ekuklu… - 2018 14th International …, 2018 - ieeexplore.ieee.org
Loss of life and property, disruptions to transportation and trading operations, etc. caused by
meteorological events increasingly highlight the importance of fast and accurate weather …

PROV-IO: An I/O-centric provenance framework for scientific data on HPC systems

R Han, S Byna, H Tang, B Dong, M Zheng - Proceedings of the 31st …, 2022 - dl.acm.org
cData provenance, or data lineage, describes the life cycle of data. In scientific workflows on
HPC systems, scientists often seek diverse provenance (eg, origins of data products, usage …