Toward the Geoscience Paper of the Future: Best practices for documenting and sharing research from data to software to provenance

Y Gil, CH David, I Demir, BT Essawy… - Earth and Space …, 2016 - Wiley Online Library
Geoscientists now live in a world rich with digital data and methods, and their computational
research cannot be fully captured in traditional publications. The Geoscience Paper of the …

OpenCitations, an infrastructure organization for open scholarship

S Peroni, D Shotton - Quantitative Science Studies, 2020 - direct.mit.edu
OpenCitations is an infrastructure organization for open scholarship dedicated to the
publication of open citation data as Linked Open Data using Semantic Web technologies …

A survey on collecting, managing, and analyzing provenance from scripts

JF Pimentel, J Freire, L Murta… - ACM Computing Surveys …, 2019 - dl.acm.org
Scripts are widely used to design and run scientific experiments. Scripting languages are
easy to learn and use, and they allow complex tasks to be specified and executed in fewer …

Production machine learning pipelines: Empirical analysis and optimization opportunities

D Xin, H Miao, A Parameswaran… - Proceedings of the 2021 …, 2021 - dl.acm.org
Machine learning (ML) is now commonplace, powering data-driven applications in various
organizations. Unlike the traditional perception of ML in research, ML production pipelines …

Human-agent collectives

NR Jennings, L Moreau, D Nicholson… - Communications of the …, 2014 - dl.acm.org
Human-agent collectives Page 1 80 COMMUNICATIONS OF THE ACM | DECEMBER 2014 |
VOL. 57 | NO. 12 review articles DOI:10.1145/2629559 HACs offer a new science for exploring …

Lightweight distributed provenance model for complex real–world environments

R Wittner, C Mascia, M Gallo, F Frexia, H Müller… - Scientific Data, 2022 - nature.com
Provenance is information describing the lineage of an object, such as a dataset or
biological material. Since these objects can be passed between organizations, each …

YesWorkflow: a user-oriented, language-independent tool for recovering workflow information from scripts

T McPhillips, T Song, T Kolisnik, S Aulenbach… - arXiv preprint arXiv …, 2015 - arxiv.org
Scientific workflow management systems offer features for composing complex
computational pipelines from modular building blocks, for executing the resulting automated …

[HTML][HTML] Newsreader: Using knowledge resources in a cross-lingual reading machine to generate more knowledge from massive streams of news

P Vossen, R Agerri, I Aldabe, A Cybulska… - Knowledge-Based …, 2016 - Elsevier
In this article, we describe a system that reads news articles in four different languages and
detects what happened, who is involved, where and when. This event-centric information is …

A disaster response system based on human-agent collectives

SD Ramchurn, TD Huynh, F Wu, Y Ikuno, J Flann… - Journal of Artificial …, 2016 - jair.org
Major natural or man-made disasters such as Hurricane Katrina or the 9/11 terror attacks
pose significant challenges for emergency responders. First, they have to develop an …

Capturing and querying fine-grained provenance of preprocessing pipelines in data science

A Chapman, P Missier, G Simonelli… - Proceedings of the VLDB …, 2020 - dl.acm.org
Data processing pipelines that are designed to clean, transform and alter data in preparation
for learning predictive models, have an impact on those models' accuracy and performance …