You are here: University of Vienna PHAIDRA Detail o:294078
Title
Will Formal Preservation Models Require Relative Identity? An exploration of data identity statements
Subtitle (en)
Paper - iPRES 2012 - Digital Curation Institute, iSchool, Toronto
Language
English
Description (en)
The problem of identifying and re–identifying data put the notion of of ”same data” at the very heart of preservation, integration and interoperability, and many other fundamental data curation activities. However, it is also a profoundly challenging notion because the concept of data itself clearly lacks a precise and univocal definition. When science is con- ducted in small communicating groups, with homogeneous data these ambiguities seldom create problems and solutions can be negotiated in casual real-time conversations. However when the data is heterogeneous in encoding, con- tent and management practices, these problems can produce costly inefficiencies and lost opportunities. We consider here the relative identity view which apparently provides the most natural interpretation of common identity statements about digitally–encoded data. We show how this view conflicts with the curatorial and management practice of “data” objects, in terms of their modeling, and common knowledge representation strategies. In what follows we focus on a single class of identity statements about digitally–encoded data: “same data but in a different format”. As a representative example of the use of this kind of statements consider the dataset “Federal Data Center Consolidation Initiative (FDCCI) Data Center Closings 2010-2013”1 , available at Data.gov. Anyone can “Down- load a copy of this dataset in a static format”. The available formats include CSV, RDF, RSS, XLS, and XML. Each of this is presumably an encoding of the “same data”. We explore three approaches to formalization into first order logic and for each we identify distinctive tradeoffs for preservation models. Our analysis further motivates the development of a system that will provide a comprehensive treatment of data concepts.
Keywords (en)
iPRES, iSchool, Toronto, Canada, data, identity, scientific equivalence, data curation, digital preservation
Author of the digital object
Simone  Sacchi
Karen M.  Wickett
Allen H.  Renear
Format
application/pdf
Size
506.7 kB
Licence Selected
CC BY-NC-SA 3.0 AT
Conferences
Conference 2012
Name of Publication (en)
"iPres 2012 - Proceedings of the 9th International Conference on Preservation of Digital Objects." Editors: Reagan Moore, Kevin Ashley, Seamus Ross
From Page
328
To Page
329
Name of Collection/Monograph (en)
"iPres 2012 - Proceedings of the 9th International Conference on Preservation of Digital Objects." Editors: Reagan Moore, Kevin Ashley, Seamus Ross
Publishing Address
140 St. George Street, Toronto, ON M5S3G6
Publisher
Digital Curation Institute, iSchool University of Toronto
Publication Date
2012-11-01
Link to bibliographic information
https://ipres.ischool.utoronto.ca/sites/ipres.ischool.utoronto.ca/files/iPres%202012%20Conference%20Proceedings%20Final.pdf
Content
Details
Object type
PDFDocument
Format
application/pdf
Created
21.06.2013 02:48:37
Metadata