Provenance data in the machine learning lifecycle in computational science and engineering

R Souza, L Azevedo, V Lourenço… - 2019 IEEE/ACM …, 2019 - ieeexplore.ieee.org
Machine Learning (ML) has become essential in several industries. In Computational
Science and Engineering (CSE), the complexity of the ML lifecycle comes from the large …

Workflow provenance in the lifecycle of scientific machine learning

R Souza, LG Azevedo, V Lourenço… - Concurrency and …, 2022 - Wiley Online Library
Abstract Machine learning (ML) has already fundamentally changed several businesses.
More recently, it has also been profoundly impacting the computational science and …

Executing cyclic scientific workflows in the cloud

M Krämer, HM Würz, C Altenhofen - Journal of Cloud Computing, 2021 - Springer
We present an algorithm and a software architecture for a cloud-based system that executes
cyclic scientific workflows whose structure may change during run time. Existing approaches …

A novel approach to provenance management for privacy preservation

O Can, D Yilmazer - Journal of Information Science, 2020 - journals.sagepub.com
Provenance determines the origin of the data by tracing and recording the actions that are
performed on the data. Therefore, provenance is used in many fields to ensure the reliability …

Keeping track of user steering actions in dynamic workflows

R Souza, V Silva, JJ Camata, ALGA Coutinho… - Future Generation …, 2019 - Elsevier
In long-lasting scientific workflow executions in HPC machines, computational scientists (the
users in this work) often need to fine-tune several workflow parameters. These tunings are …

Provenance of dynamic adaptations in user-steered dataflows

R Souza, M Mattoso - Provenance and Annotation of Data and Processes …, 2018 - Springer
Due to the exploratory nature of scientific experiments, computational scientists need to
steer dataflows running on High-Performance Computing (HPC) machines by tuning …

[HTML][HTML] Distributed in-memory data management for workflow executions

R Souza, V Silva, AAB Lima, D de Oliveira… - PeerJ Computer …, 2021 - peerj.com
Complex scientific experiments from various domains are typically modeled as workflows
and executed on large-scale machines using a Parallel Workflow Management System …

[PDF][PDF] Towards a Human-in-the-Loop Library for Tracking Hyperparameter Tuning in Deep Learning Development.

R Souza, L Neves, L Azevedo, R Luiz, E Tady… - LADaS …, 2018 - researchgate.net
The development lifecycle of Deep Learning (DL) models requires humans (the model
trainers) to analyze and steer the training evolution. They analyze intermediate data, fine …

Software tools to enable immersive simulation

F Newberry, C Wetterer-Nelson, JA Evans… - Engineering with …, 2022 - Springer
There are two main avenues to design space exploration. In the first approach, a simulation
is run, analyzed, the problem modified, and the simulation run again. In the second …

Cross-systems multi-level data pipelines optimization for predicting sunspot emergence

R Kapoor - 2024 - aaltodoc.aalto.fi
The proliferation of big data pipelines has spurred collaborative efforts across multiple
disciplines to explore the intricacies of those domains. One notable collaboration involves …