Archivematica can act to facilitate a “dark archive” for a DSpace repository - i.e. providing back-end preservation functionality while DSpace remains the user deposit and access system.
Archivematica has been tested using exports from DSpace 1.7.x. Ingest has not been tested on exports from DSpace 1.8.x; however, there were no changes in the DSpace AIP export structure between 1.7.x and 1.8.x so it is anticipated that performance will be identical.
1. The transfer should be structured similar to the one in Archivematica’s sample data at sampledata/SampleTransfers/DSpaceExport. This is a standard DSpace export with one DSpace AIP for the collection-level description and one for each of the deposited objects.
The inclusion of the collection level export (eg aip_1314.zip) is not mandatory.
2. A typical zipped item folder will contain the uploaded object plus a license file, a METS file and possibly an OCR text file. An example is shown below:
- bitstream_39691_txt = the OCR text file
- bitstream_8272.pdf = the object deposited in DSpace
- bitstream_8273 = the license file
- mets.xml = the METS file for the item
3. To process, configure the source directory containing the transfer in the Administration tab of the browser. See Administrator manual - Transfer source directories.
4. In the transfer tab, use the dropdown menu to select the DSpace transfer type.
Then, browse to the appropriate source directory and add your DSpace export.
5. When the DSpace transfer has loaded to the dashboard, click Start Transfer to begin processing.
- At the normalization step, choose “Normalize for preservation”.
7. The METS file for the AIP will show fileGrps for the different file types: original, submissionDocumentation (the mets.xml files), preservation, license and text/ocr.
Each object in the AIP has 2 dmdSecs. The first contains Xpointers to descriptive and rights metadata in the original mets.xml files exported from DSpace. The second reflects the parent-child relationship between a DSpace object and its community/collection, using the handles as identifiers: