You are here: University of Vienna PHAIDRA Detail o:294146
Title
Harvester results in a digital preservation system
Subtitle (en)
Paper - iPRES 2008 - London
Language
English
Description (en)
In the last few years libraries from all around the world have build up OAIS compliant archival systems. The information packages in these systems are often based on METS and the contents are mainly e-journals and scientific publications. On the other hand Web archiving is becoming more and more important for libraries. Most of the member institutions of the International Internet Preservation Consortium (IIPC) use the software Heritrix to harvest selected Web pages or complete domains. The results are stored in the container format ARC or the successor WARC. The files’ quantity and the sizes of these archival packages are significantly different than those of the other publications in the existing archiving systems. This challenges the way the archival packages are defined and handled in current OAIS compliant systems. This paper compares existing approaches to use METS and Web harvesting results in archival systems. It describes the advantages and disadvantages of treating Web harvests in the same way as other digital publications in dedicated preservation systems. Containers based on METS are set side by side with WARC and its possibilities.
Keywords (en)
iPRES, London
Author of the digital object
Tobias  Steinke
Format
application/pdf
Size
30.8 kB
Licence Selected
CC BY-SA 3.0 AT
Conferences
Conference 2008
Content
Details
Object type
PDFDocument
Format
application/pdf
Created
24.06.2013 08:53:14
Metadata