Challenges for the repeatability of deep learning models

SS Alahmari, DB Goldgof, PR Mouton, LO Hall - IEEE Access, 2020 - ieeexplore.ieee.org
Deep learning training typically starts with a random sampling initialization approach to set
the weights of trainable layers. Therefore, different and/or uncontrolled weight initialization …

The foundations for provenance on the web

L Moreau - Foundations and Trends® in Web Science, 2010 - nowpublishers.com
Provenance, ie, the origin or source of something, is becoming an important concern, since it
offers the means to verify data products, to infer their quality, to analyse the processes that …

Provenance analytics for workflow-based computational experiments: A survey

W Oliveira, DD Oliveira, V Braganholo - ACM Computing Surveys (CSUR …, 2018 - dl.acm.org
Until not long ago, manually capturing and storing provenance from scientific experiments
were constant concerns for scientists. With the advent of computational experiments …

Securing big data provenance for auditors: The big data provenance black box as reliable evidence

D Appelbaum - Journal of emerging technologies in …, 2016 - publications.aaahq.org
The purpose of this article is to highlight a main issue regarding reliable audit evidence
derived from Big Data—that of secure data provenance. Traditionally, audit evidence …

On-demand minimum cost benchmarking for intermediate dataset storage in scientific cloud workflow systems

D Yuan, Y Yang, X Liu, J Chen - Journal of Parallel and Distributed …, 2011 - Elsevier
Many scientific workflows are data intensive: large volumes of intermediate datasets are
generated during their execution. Some valuable intermediate datasets need to be stored for …

[PDF][PDF] Techniques for efficiently querying scientific workflow provenance graphs.

MK Anand, S Bowers, B Ludäscher - EDBT, 2010 - cs.gonzaga.edu
ABSTRACT A key advantage of scientific workflow systems over traditional scripting
approaches is their ability to automatically record data and process dependencies …

A cost-effective strategy for intermediate data storage in scientific cloud workflow systems

D Yuan, Y Yang, X Liu, J Chen - 2010 IEEE international …, 2010 - ieeexplore.ieee.org
Many scientific workflows are data intensive where a large volume of intermediate data is
generated during their execution. Some valuable intermediate data need to be stored for …

Cleaning structured event logs: A graph repair approach

J Wang, S Song, X Lin, X Zhu… - 2015 IEEE 31st …, 2015 - ieeexplore.ieee.org
Event data are often dirty owing to various recording conventions or simply system errors.
These errors may cause many serious damages to real applications, such as inaccurate …

Similarity search for scientific workflows

J Starlinger, B Brancotte… - Proceedings of the …, 2014 - inria.hal.science
With the increasing popularity of scientific workflows, public repositories are gaining
importance as a means to share, find, and reuse such workflows. As the sizes of these …

A data dependency based strategy for intermediate data storage in scientific cloud workflow systems

D Yuan, Y Yang, X Liu, G Zhang… - … Practice and Experience, 2012 - Wiley Online Library
Many scientific workflows are data intensive where large volumes of intermediate data are
generated during their execution. Some valuable intermediate data need to be stored for …