Presentation that looks at developing a systematic approach to preserving digitally born collections. The example from Boston College are the Mary O’Hara papers. This was an opportunity for a collaborative project involving the Digital Libraries, Archives, and the Irish Music Center.
- Important elements of the workflow:
- Chain of Custody,
- Digital Forensics,
- Computed initial checksums,
- File/folder names,
- Local Archival Copies, Distributed Digital Preservation
: find directory-name -type f -exec ls -l {} ; >c:\data\MOH\inventory.txt
Local conventions regarding naming files and folders:
- Use English alphabet and numbers 0 - 9
- Avoid punctuation marks other than underscores or hyphens.
- Do not use spaces.
- Limit file/folder names to 31 characters, including the 3 digit extension . Prefer shorter names.
- Decision: They may remediate folder and file names, but only for the working copies.
- Any files off-limits or expendable? System files,
- Personally Identifiable Information (PII)
- Unsupported Formats (Can normalize using Xena)
- They also use a variety of tools, such as: FITS, JHove
- File migrations
- Obsolete file formats
- Proprietary file formats
- Metadata changes
No comments:
Post a Comment