Layering in Provenance Systems

DSpace/Manakin Repository

Layering in Provenance Systems

Citable link to this page


Title: Layering in Provenance Systems
Author: Seltzer, Margo I.; Muniswamy-Reddy, Kiran-Kumar; Braun, Uri Jacob; Holland, David A; Macko, Peter; Maclean, Diana; Margo, Daniel Wyatt; Smogor, Robin

Note: Order does not necessarily reflect citation order of authors.

Citation: Muniswamy-Reddy, Kiran-Kumar, Uri Braun, David A. Holland, Peter Macko, Diana Maclean, Daniel Margo, Margo Seltzer, Robin Smogor. 2009. Layering in Provenance Systems. In Proceedings of the 2009 USENIX Annual Technical Conference (USENIX '09), June 14-19, 2009, San Diego, California. Berkeley, CA: USENIX Association.
Full Text & Related Files:
Abstract: Digital provenance describes the ancestry or history of a digital object. Most existing provenance systems, however, operate at only one level of abstraction: the sys- tem call layer, a workflow specification, or the high-level constructs of a particular application. The provenance collectable in each of these layers is different, and all of it can be important. Single-layer systems fail to account for the different levels of abstraction at which users need to reason about their data and processes. These systems cannot integrate data provenance across layers and cannot answer questions that require an integrated view of the provenance.
We have designed a provenance collection structure facilitating the integration of provenance across multiple levels of abstraction, including a workflow engine, a web browser, and an initial runtime Python provenance tracking wrapper. We layer these components atop provenance-aware network storage (NFS) that builds upon a Provenance-Aware Storage System (PASS). We discuss the challenges of building systems that integrate provenance across multiple layers of abstraction, present how we augmented systems in each layer to integrate provenance, and present use cases that demonstrate how provenance spanning multiple layers provides functionality not available in existing systems. Our evaluation shows that the overheads imposed by layering provenance systems are reasonable.
Published Version:
Other Sources:
Terms of Use: This article is made available under the terms and conditions applicable to Other Posted Material, as set forth at
Citable link to this page:
Downloads of this work:

Show full Dublin Core record

This item appears in the following Collection(s)


Search DASH

Advanced Search