Provenance Datasets Highlighting Capture Disparities

By G. Coe , R. Doty , M. Allen , Dr. Adriane Chapman

Provenance information is inherently affected by the method of its capture. Different capture mechanisms create very different provenance graphs. This paper describes an academic use case that has corollaries in offices everywhere.

Download Resources


PDF Accessibility

One or more of the PDF files on this page fall under E202.2 Legacy Exceptions and may not be completely accessible. You may request an accessible version of a PDF using the form on the Contact Us page.

​Provenance information is inherently affected by the method of its capture. Different capture mechanisms create very different provenance graphs. In this work, we describe an academic use case that has corollaries in offices everywhere. We also describe two distinct possibilities for provenance capture methods within this domain. We generate three data sets using these two capture methods: the capture methods run individually and a trace of what an omniscient capture agent would see. We describe how the different capture methods lead to different graphs and release the graphs for others to use via the ProvBench effort.​