Showing posts with label standards. Show all posts
Showing posts with label standards. Show all posts

Wednesday, March 27, 2019

Next Phase OAIS Review

Next Phase OAIS Review. Barbara Sierman. Digital Preservation Seeds. March 24, 2019.
     The OAIS standard had its 5-year review in 2017, which resulted in over 200 suggestions for change. All the changes were discussed in the DAI Working Group and and most of them were accepted. The next step is for the updated OAIS standard to go to CCSDS and ISO for final approval. The main part of the changes concerned terminology, both clarification and consistency.

Some concepts got a more extensive description, while others have changed. The new OAIS standard shows that a transparent process can lead to a standard that reflects the current practices. The standards group will now have a final opportunity to decide whether all suggested changes are clear and implementable.

Thursday, December 06, 2018

3 Principles for Selecting a Digital Preservation Solution


3 Principles for Selecting a Digital Preservation Solution. Daniel Greenberg. Ex Libris. November 29, 2018.
   This post was in honor of World Digital Preservation Day and lists some important elements to remember when reviewing digital preservation systems:

1. Interoperability different types of data and integrating with other systems
  • Support common protocols for harvesting, publishing and searching, e.g. Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) and SRU (Search/Retrieve via URL).
  • Ingest content with multiple methods and structures; e.g., BagIt, METS, CSV, and XML.
  • Providing well-documented external APIs
  • Integrating with other information systems
2. Follow Industry Standards, particularly standard metadata schemas and communication protocols. Benefits of doing this:
  • Interoperability between new and existing services and applications.
  • Compliance with policies and regulations.
  • Introduction of innovative features.
  • Enable a robust exit strategy, in case the vendor goes out of business.
3. Scalability:
  • Architectural scalability: Start small and grow big. Ability to expand the throughput over time without compromising performance.
  • Operational scalability: Ability to customize the system to the institutions’ needs.
  • Informational scalability: Keep up with latest strategies, practices, tools and policies by an active user community.
  • Organizational scalability: Administer multiple institutions with a single installation; support a flexible consortium model.

Friday, September 15, 2017

Preservation with PDF/A

Preservation with PDF/A (2nd Edition). Betsy A Fanning. DPC Technology Watch Report 17-01. July 2017. [PDF 34pp.]  [Link updated]
     This report is an updated edition of the original Technology Watch Report 08-02, Preserving the Data Explosion: Using PDF (Fanning,2008). It looks at PDF/Archive as digital document file format for long-term preservation. The PDF/A versions of the PDF format have been developed as a family of open ISO Standards to address preservation of PDF files by removing features that pose preservation risks. It is important for preservation purposes to know how closely a file conforms to the  requirements defined in the standard. There are preservation risks that may exist in the standard PDF file format:
  • any file type can be embedded;
  • the primary document can be conformant as a static document, but the embedded files may not be static;
  • embedded files may be infected by computer viruses;
  • embedded files may have extended metadata requirements, may introduce unexpected dependencies or be subject to format obsolescence;
  • embedded files may complicate matters relating to information security, data protection or the management of intellectual property rights.
By restricting some risk features and thus reducing preservation risks, the PDF/A format seeks to maximize:
  • device independence;
  • self-containment;
  • self-documentation.
Some reasons why an organization might use PDF/A to preserve their digital documents, include:
  • its standardized format for storing digital documents for long periods of time;
  • it allows for digitally signed documents using the very latest digital signature software;
  • it reliably displays special characters for mathematics and languages since all are embedded within the file;
  • it displays correctly on any device as the author intended, including the reading order;
  • platform independence;
  • provision of fully searchable documents through Optical Character Recognition.
History and Features of PDF and PDF/A. The Standard was drafted in multiple in order to make it easier to implement the Standard. "Unfortunately, the committee’s philosophy of multiple parts resulted in confusion in the market place, making it more difficult for users to select the optimum file format." Users  may need to do a file format assessment based on their requirements that can help them decide which PDF/A Standard to implement.

Metadata helps effectively manage a file throughout its life cycle, as well assist in document discovery searches. "Establishing a long-term digital document preservation system requires careful consideration of the metadata that will be needed to locate and render documents years from now." Collecting metadata for the PDF/A documents in optional in the standard, except for the identifier, which is generated when the PDF/A file is created. Preservation metadata should:
  • be appropriate to the materials;
  • support interoperability;
  • use standardized controlled vocabulary;
  • include clear statements on the conditions and terms of use;
  • be authoritative and verifiable;
  • support the long-term management of the document.
Just because a file purports to be a PDF/A does not necessarily mean that it is. Format validation of a file can increase confidence a viewer will be able to render the file correctly.  A number of PDF/A validators are available.The development work on the PDF Standards is a continuing effort. There are additional preservation challenges in the format that are in the process of being addressed.

The report lists some recommendations, which are directed at groups that use the standard. They include:
  • For those evaluating PDF/A as a digital preservation solution:
    • Before adopting PDF/A as a preservation solution it is "essential to understand the organizational requirements and how PDF/A will support" the organization needs.
    • PDF/A is not a preservation solution on its own a part of the wider preservation strategy that must be consistent with other components of the preservation infrastructure, such as backups, integrity checks and documentation.
    • Different versions of PDF/A have different purposes, with different capabilities as well as different preservation risks. These should be understood and decisions should be documented and explained.
    • Different vendors offer different tools to manage PDF/A that should be compared against your requirements..
  • For organizations collecting and preserving digital data:
  • While it may not be possible to control or restrict how documents are produced, it may be useful to give document creators guidance on what is desired.
  • Embed PDF/A validation tools into preservation workflows and record the results to help manage the digital preservation risks associated with PDF/A files received.

Thursday, December 08, 2016

OAIS: a cage or a guide?

OAIS: a cage or a guide? Barbara Sierman. Digital Preservation Seeds. December 3, 2016.    
     Post about the OAIS standard and asking if it is a restriction or a guide. OAIS, the functional model, the data model and metrics in OAIS and the related standards like the audit and certification standard. "OAIS is out there for 20 years and we cannot imagine where digital preservation would be, without this standard." It is helpful for discussing preservation by naming the related functions and meta data groups. But it lacks a link to implementation and application for daily activities. OAIS is a lot of common sense put into a standard. The audit and certification standard, ISO 16363, is meant to explain how compliance can be achieved, a more practical approach.

Many organisations are using this standard to answer to the question "Am I doing it right?" People working with digital preservation want to know the approach that others are using, the issues that they have solved. The preservation community needs to "evaluate regularly whether the standards they are using are still relevant in the changing environment" and a continuous debate is required to do this. In addition, we need evidence that practical implementations that follow OAIS are the best way to do digital preservation. Proof of what worked and what did not work is needed in order to adapt standards, and the DPC OAIS community wiki has been set up to gather thoughts related to the practical implementation of OAIS and to provide practical information about the preservation standards,


Wednesday, March 23, 2016

New Report on Web Archiving Available

New Report on Web Archiving Available. Andrea Goethals. IIPC. 21 March 2016.
     Harvard Library recently released a report to:
  • explore and document current web archiving programs
  • identify common practices, needs, and expectations in the collection of web archives
  • identify the provision and maintenance of web archiving infrastructure and services;
  • identify the use of web archives by researchers.
The environmental scan showed 22 opportunities for future research and development, which includes:
  • Dedicate full-time staff to work in web archiving to keep up on latest developments, best practices and be part of the web archiving community.
  • Conduct outreach, training and professional development for existing staff who are being asked to collect web archives.
  • Institutional web archiving programs should be transparent about holdings, terms of use, preservation commitment, are curatorial decisions made for each capture.
  • Develop a collection development tool to show holdings information to researchers and other collecting institutions.
  • Train researchers to be able to analyze big data found in web archives.
  • Establish a standard for describing the curatorial decisions behind collecting web archives.
  • Establish a feedback loop between researchers and the librarians/archivists.
There is also a need to "radically increase communication and collaboration" among all involved in web archiving. Much more communication and collaboration is needed between those collecting web content and researchers who would like to use it.

Friday, March 18, 2016

Applying DP Standards For Assessment & Planning

Applying DP Standards For Assessment & Planning. Bertram Lyons. PASIG 2016. March, 2016.
     ISO 16363:2012. Audit & Certification of Trustworthy Digital Repositories defines recommended practices for assessing the trustworthiness of digital repositories. The document will help those who audit repositories, but also those to design or redesign their digital repository processes. Some highlights from the standard:

3.1 Governance and Organizational viability: The repository shall have a collection policy or other document that specifies the type of information it will preserve, retain, manage, and provide access to. Without the policy the collection scope is unclear and it becomes difficult to say no to out of scope content. The standard expects a policy to exist and be documented.

4.2 Ingest: Creation of AIPs: Organizations should have a description of how AIPs are constructed from SIPs. It should document all changes to the processes, as well as defining what happens to the content (such as normalization of files, etc.)

5.2 Security Risk Management: The repository should have a written disaster preparedness and recovery plan, including at least one off-site backup of all preserved information together with an off-site copy of the recovery plan. This means the organization should be prepared administratively.

The elements are scored as follows
  • 0 - non-compliant or not started
  • 1 - slightly compliant (needs a lot of work to do in address the requirement.
  • 2 - half compliant: partially addressed but still significant work to do
  • 3 - mostly compliant: mostly addressed and working on full compliance.
  • 4 - fully compliant: can demonstrate the requirement is comprehensively addressed.
Elements needed:
  • Documentation: records of policy, procedure, and outcomes of activities
  • Policy: the definition of approaches and protocol for repository functions and procedures
  • Procedures: specification of preservation and infrastructure management activities
  • Software: development or configuration of preservation systems
  • Infrastructure: procurement, monitoring, and management of hardware infrastructure
  • Organization: organizational infrastructure including funding, staffing, and strategy
  • Action Plan

Tuesday, March 08, 2016

Assessing Digital Preservation at the John F. Kennedy Presidential Library

Assessing Digital Preservation at the John F. Kennedy Presidential Library. Alice Sara Prael, Abbey Potter. The Signal. March 2, 2016.
      The question is with all the data in the library holdings, how to preserve the digital files over the long term? The goal of the project is to ‘develop a long-range digital preservation strategy’ to address all digital archival holdings at the Library. This challenging goal consists of three phases:
  1. assess current infrastructure against community standards and make brief recommendations on how to improve digital preservation practices
  2. explore potential solutions to address the recommendations made in the first phase
  3. determine a single path forward based on the solutions explored and create an action plan for how to implement that solution.
Plans started by interviewing archivists and IT personnel about their processes and how they use the systems and then research the systems. Like many cultural heritage institutions, is in need of better documentation. The biggest gap in the existing documentation for digital archives is a digital preservation policy which is a record of decisions, including those that have been made but not documented or documented elsewhere. Important community standards and guidelines include
  • ISO 14721: Reference Model for an Open Archival Information System (OAIS), 
  • ISO 16363: Audit and Certification of Trustworthy Digital Repositories,
  • the National Digital Stewardship Alliance Levels of Digital Preservation
Each of these items gives a slightly different perspective on what is required for digital preservation. With limited resources and staff time it’s important to recognize when to aim for “good enough” digital preservation, which can be defined for each institution by "the available resources, the needs of the collection, and priorities of the institution".  The NDSA Levels of Digital Preservation are not as in depth as but they are easier to understand. The intermediary levels can address digital preservation in a phased approach and also create a way for identifying strengths and weaknesses. Once the NDSR project is complete the Library will have a picture of the digital practices and a clear implementation plan for improved digital preservation.

Friday, November 20, 2015

Developing Best Practices in Digital Library Assessment: Year One Update

Developing Best Practices in Digital Library Assessment: Year One Update. Joyce Chapman, Jody DeRidder, Santi Thompson. D-Lib Magazine. November 2015.
     While research and cultural institutions have increased focus on online access to special collections in the past decade, methods for assessing digital libraries have yet to be standardized. Because of limited resources and increasing demands for online access, assessment has become increasingly important. Library staff do not know how to begin to assess the costs, impact, use, and usability of digital libraries. The Digital Library Federation Assessment Interest Group is working to develop best practices and guidelines in digital library assessment. The definition of a digital library used is "the collections of digitized or digitally born items that are stored, managed, serviced, and preserved by libraries or cultural heritage institutions, excluding the digital content purchased from publishers."

They are considering two basic questions:
  1.     What strategic information do we need to collect to make intelligent decisions?
  2.     How can we best collect, analyze, and share that information effectively?
There are no "standardized criteria for digital library evaluation. Several efforts that are devoted to developing digital library metrics have not produced, as yet, generalizable and accepted metrics, some of which may be used for evaluation. Thus, evaluators have chosen their own evaluation criteria as they went along. As a result, criteria for digital library evaluation fluctuate widely from effort to effort." Not much has changed in the last 10 years in the area in regards to digitized primary source materials and institutional repositories. "Development of best practices and guidelines requires a concerted engagement of the community to whom the outcome matters most: those who develop and support digital libraries". The article shares "what progress we have made to date, as well as to increase awareness of this issue and solicit participation in an evolving effort to develop viable solutions."

Friday, October 23, 2015

Metadata for your Digital Collections

Metadata for your Digital Collections. Jenn Riley. Indiana Cooperative Library Services Authority. March 6, 2007.
    A slideshow about metadata that I came across while preparing a presentation. A summary:
There are many definitions of metadata; generally it can be defined as structured information about an information resource. The presentation looks at the uses, structure and types of metadata:
  • Descriptive metadata
  • Technical metadata
  • Preservation metadata
  • Rights metadata
  • Structural metadata
Each of the various metadata types have their structures, values, benefits, and limitations, including:
  • Dublin core, inability to "provide robust record relationships".                          
  • Qualified Dublin Core
  • MARC
  • MARCXML, "the exact structure of MARC21 in an XML syntax"
  • MODS, "'MARC-like' but intended to be simpler"
  • Others include Visual Resources Association Core, TEI, EAD, FRBR,
The standards are important now because it will help in migrating to other systems later and the collections will be more inter-operable.  Good digital collections are:
  • Inter-operable, shareable and searchable
  • Persistent
  • Re-usable for multiple purposes
It notes that "good metadata promotes good digital collections". To share the metadata it needs to be prepared to map across other formats and systems. A map or 'crosswalk' can be created to do this. It is "good practice to create and store most robust metadata format possible." You need to find the right balance for your metadata. Good shareable metadata should involve:
  • content
  • consistency
  • coherence
  • context
  • communication
  • conformance
That is what the standards help to do.

Wednesday, October 21, 2015

DPC invites members to review the OAIS Standard

DPC invites members to review the OAIS Standard. William Kilbride, et al. Digital Preservation Coalition,  Open Preservation Foundation. October 21, 2015.
"DPC is delighted to welcome members to participate in the review of OAIS, work that will hold our interest for a couple of years and which we aim to build into a platform for collaboration among our diverse members in the future.

The OAIS standard published by both the Consultative Committee for Space Data Systems (CCSDS) and as ISO14721 has been highly influential in the development of digital preservation. As a reference model it provides a common basis for aligning disparate practice in diverse institutional settings. A range of standards have emerged around and related to OAIS including PREMIS (for preservation metadata), ISO16363 (for certification) and PAIMAS (for exchange between Producers and Archives).

Since OAIS was initially proposed the digital preservation community has grown tremendously in absolute numbers and in diversity. OAIS adoption has expanded far beyond the space data community to include cultural heritage, research data centers, commerce, industry and government.

The digital preservation community has – we have! – a responsibility to keep our standards relevant. The upcoming ISO review of the OAIS standard in 2017 offers a chance for a cooperative, transparent review process. It also creates an opportunity for further community building around OAIS and related initiatives.

"The outcome from this activity is not simply a wiki nor is it a set of recommendations. By providing a shared open platform for the community that gathers around the OAIS we aim to ensure on-going dialogue about our standards and their implementation in the future.
In this sense the 2017 review is a milestone on the way to an engaged and empowered community rather than a destination.
  • OAIS Community forum via a wiki: Your feedback and the discussions on this wiki will provide raw material for an editorial committee of the most active participants to formulate recommendations which will result in a formal submission to the 2017 review. So sign in and add your views!
  • Exploring official mechanisms: Official mechanisms for the review of ISO standards are well established via National Standard Bodies and these will be explored and used to give input for the review.
  • Active Interaction: Ensuring inclusion for this large, diverse community will mean collaborative virtual meetings are necessary but we all recognize the value of meeting face to face and will seek to enable this.
Join the community and contribute your views on the wiki here: http://wiki.dpconline.org/index.php?title=OAIS_Community

Tuesday, September 15, 2015

UK Government: What we’re doing on open standards

What we’re doing on open standards. Government Technology Team. 7 September 2015.
     The UK government technology team has been selecting open standards to help government to adapt to changing needs and technologies. "Open Document Format (ODF) is an important standard - by making documents in a format that is open to all, we are ensuring that there are no barriers or bias when we provide services to the public."  The underlying principle is that people should not have to buy new equipment or software in order to read an official document. The technology team looked at the formats that should be used for government documents and chose the Open Document Format. Since then:
  • all of central government has committed to moving to the ODF 1.2 for their editable documents
  • most departments have published their implementation plans
  • the proportion of ODF documents on GOV.UK is increasing steadily
  • software suppliers are providing better support for open formats in their products
A guidance manual has been created to help the government departments a they move to ODF. The manual includes topics such as:
  • Introduction to Open Document Format (ODF)  
  • Procure ODF solutions
  • Validators and compliance testing 
  • Platforms and devices 
  • Accessibility, Privacy and security 
  • Best practices and other information

Monday, July 06, 2015

TIFF/A

TIFF/A. Gary McGath. File Formats Blog.  July 3, 2015.
   The tiff format has been around for a long time. There have been many changes and additions, such that "TIFF today is the sum of a lot of unwritten rules".  A group of academic archivists have been working on a long term readable version, calling it TIFF/A. A white paper discusses the technical issues. Discussions starting in September will hope to create a version to submit for ISO consideration.

Friday, June 19, 2015

Digital Preservation Metadata and Improvements to PREMIS in Version 3.0

Digital Preservation Metadata  and Improvements to PREMIS in  Version  3.0. Angela Dappert. May 27, 2015. [PDF]
This is the notes from a DCMI/ASIS&T joint webinar about PREMIS v. 3. The PDF document has 63 slides which gives an overview of why digital preservation metadata is needed, shows examples of digital preservation metadata, shows how PREMIS can be used to capture this metadata, and shows some of the changes in version 3.0.

Digital preservation metadata is the metadata needed to ensure long-term accessibility of digital resources. Digital objects must be self-descriptive independently from the systems that were used to create them. PREMIS is the de-facto standard for metadata to support the preservation of digital objects and ensure their long-term usability. It is a common data model for organizing/thinking about preservation metadata, or for exchanging information packages between repositories. It is not an out-of-the-box solution, nor all the metadata needed.

Wednesday, June 17, 2015

IFI Irish Film Archive publishes new Digital Preservation and Access Strategy

IFI Irish Film Archive publishes new Digital Preservation and Access Strategy. Seán Brosnan. Irish Film and Television Network. 16 Jun 2015. [PDF]
The Irish Film Archive has published a new Digital Preservation & Access Strategy that addresses archiving vast quantities of moving image material in a digital environment.  “This crucial document ... outlines a long-term plan and a set of guiding principles, flexibility, scalability and sustainability, which will assist us in preserving and providing access to Ireland’s digital moving image material.”

In order to fully achieve its mission, the organization must be able to preserve and make available the digital collections in its care, by doing the following items:
  • Publish a comprehensive Digital Preservation and Access Strategy document.
  • Upgrade the Archive’s technology and equipment to facilitate digital activities.
  • Develop policies and procedures to support the digital strategy and asset management
  • Develop a trusted digital repository
  • Increase access to the collections
  • Secure funding to ensure digital preservation and access are supported and sustainable.
  • Develop specialist digital management and preservation skills
  • Develop the Archive’s existing database to allow the recording necessary metadata
  • Follow international examples of best practice
  • Create and maintain on-going partnerships

The digital strategies and solutions must ensure efficiency and cost effectiveness by being flexible, scalable, and sustainable.  The Digital Preservation and Access Strategy must consider the short-, medium-and long-term needs of the organization in safeguarding and sharing digital collections. The risks and challenges they face include:
  • Technical obsolescence
  • Lack of Standards
  • Expense: Creating and maintaining the infrastructure, equipment and trained people will be an ongoing, costly and resource-heavy activity, but it is vital to ensure the availability of digital collections to future generations. An efficient digital preservation strategy "cannot be achieved with sporadic investment, but must be supported by regular and predictable funding"

The document also describes benefits, opportunities, stakeholders and strategic priorities. The strategic objectives include:
  • To maintain the Archive to the highest international standards
  • To improve resources within the Archive
  • To  improve access to the collections
  • To heighten awareness of the collections for current and future generations

Digital Preservation & Access Strategy Document will be an essential tool in helping the organization become a Digital Archive that meets internationally recognized standards of excellence. This will be a living document and will reviewed regularly to ensure it still meets the strategic objectives.  It will take into account any changes in access or preservation technology, policies, and budget changes. The document will also be supplemented by additional policies over time.

Tuesday, June 16, 2015

Digital Preservation links

Digital Preservation links . Julie C. Swierczek. scholar.harvard.edu website. June 2015.
A very thorough and up to date list of digital preservation resources on the internet. There are many topics, such as:
  • Digitization
  • Digital Forensics
  • Digital Accessioning Workflows
  • Guidelines/best practices
  • Reports/whitepapers/publications/bibliographies/journals
  • Metadata, vocabularies, XML and harvesting
  • Standards
  • Projects of interest
  • Blogs
  • Web archiving
 A very useful reference resource.

Friday, June 12, 2015

TIFF/A Standard Initiative

TIFF/A Standard Initiative. Website. June, 2015.
The TIFF/A standard initiative intends to create an ISO specification of a Archival TIFF Format. TIFF is a widely used format, but it is complex and has some features not suited for long term preservation. The TIFF/A-specification will be enhanced with mandatory and forbidden tags for archival purposes, similar to PDF/A. "This standard will be created in parallel with DPF Manager, an open source TIFF format validator that, in addition to the current TIFF ISO Standards, will be the first conformance checker for the TIFF/A new standard." The group looks to create a community of experts interested in discussing the initiative in order to prepare a proposal to submit to the ISO.

Friday, May 08, 2015

Digital Curation Tools & Techniques

Digital Curation Tools & Techniques. Nancy McGovern.  Webinar: Metropolitan New York Library Council. April 2015.
This is a recording of Nancy's excellent webinar and it goes along with the management tools website.  As organizations build sustainable digital preservation programs, they work toward being a trusted digital repository by following emerging digital preservation standards. There are now a number of tools to help with technical issues and now some management tools to help with organizational issues. The Digital Preservation Management workshop launched a set of management tools and techniques.

This webinar looks at using a set of digital curation and preservation principles to describe a  framework for developing a digital preservation program. The organizational aspects of digital preservation is more challenging than the technological issues. This helps people to work with tools and techniques, develop good practices, emphasize the organizational aspects and apply the 5 stages:
  1. Acknowledge: understanding that digital preservation is a local concern
  2. Act: initiating digital preservation projects
  3. Consolidate: segueing from project to basic program
  4. Institutionalize: rationalizing local efforts to establish a comprehensive program
  5. Externalize: embracing inter-institutional collaboration and dependency
Each tool and technique is built on the DPM model, the three-legged stool (Organizational Leg, Technological Leg, and Resources Framework) with the five stages of development for a sustainable digital preservation program. There is a tool or technique to help organizations in each of these areas:
  1. Principles: Adopt standards-based principles 
  2. Policy: Develop a high-level policy framework 
  3. Scope: Complete a digital content review to define program scope
  4. Workflow: Document workflows to improve and automate 
  5. Preparedness: Extend disaster preparedness to include digital
  6. Self-assessment: Engage in self-assessment to gauge progress

Wednesday, May 06, 2015

Preparing the Workforce for Digital Curation

Preparing the Workforce for Digital Curation. The National Academies Press. 2015.  
This 105 page report focuses on the need for digital curation education and training in order to provide meaningful use of digital information, now and in the future.  [PDF version] This study defines digital curation as: “The active management and enhancement of digital information assets for current and future use.” Digital curation is more than preserving the digital information in secure storage because curation may add value to digital information and increase its utility.

Digital curation is similar to traditional curation. "Regardless of whether a collection is physical or digital, a curator must appraise its value and relevance to the community of potential users; determine the need for preservation; document provenance and authenticity; describe, register, and catalog its content; arrange for long-term storage and preservation; and provide a means for access and use." But it also has many new challenges: the quantities of material to be curated, the need for active and ongoing management, continually changing uses and technology, and the diversity of organizational contexts in which curation occurs. It is more than simply collecting and storing data and information. Active management denotes planned, systematic, coordinated, purposeful, and directed actions that make digital information fit for a purpose. And to ensure that digital information will remain discoverable, accessible, and useable for as long as users have a need and a right to use it.

A new pattern of data usage puts a greater emphasis on the standardization of digital curation practices so that the data can be shared more easily.  Archiving digital data requires a more active management approach, and a more collaborative partnership between producers, archivists and users.

The Loss of Cultural Heritage Through Deterioration of Records and Technological Change: Sound recordings are a striking example of cultural heritage data at high risk of loss. These include music, oral histories, and radio broadcasts preserved in a wide variety of formats and media.

Some benefits of digital curation include:
  • Increased collaboration and cost sharing;
  • Greater use of data in teaching and research training;
  • New opportunities and uses for data, including data mining;
  • Creation of a more complete record of research;
  • Creation of new areas of research, new industries, or new support services.
Some principal conclusions:
  1. Significant opportunities exist to embed digital curation deeply into an organization’s practices to reduce costs and increase benefits. Digital curation will be increasingly in demand across many sectors of society.
  2. Digital curation can be advanced by various organizations that can serve as leaders, models, and sources of good curation practices, and build trust by preserving assets.
  3. Some barriers to digital curation include: lack of sharing of resources and insufficient resources.
  4. There is a need to identify, segregate, and measure the costs of curation tasks in scientific research and business processes.
  5. Standards and existing practices vary greatly, which can lead to a lack of coordination across different sectors. This in turn can lead to limited adoption of consistent standards for digital curation and fragmented dissemination of good practices.
  6. Automation of at least some digital curation tasks is desirable
  7. The knowledge and skills required of those engaged in digital curation are dynamic and highly interdisciplinary.
Some recommendations include:
  1. Research communities, educational institutions, and others should work together to develop and adopt digital curation standards and good practices.
  2. Work to identify and predict the costs associated with digital curation.
  3. Organizations should identify, explain, and measure the benefits derived from digital curation

Friday, April 17, 2015

Trustworthiness of Preservation Systems

Trustworthiness  of  Preservation Systems. David  Minor. PASIG Presentation. March 11, 2015. [PDF]
We  all  want  to  trust  systems, especially preservation  systems. Trust is an iterative process to verify and clarify. The principles of trust include:
  •  Institutional commitment to collections
  •  Infrastructure demands
  •  Technical system and staffing capabilities
  •  Sustainability (particularly funding, technology, collaboration)
  •  Identify and communicate risks to content, examining “what if” questions

There are three levels of auditing
  •  "Basic certification” is a simple self assessment
  •  "Extended certification" represents a plausibility checked assessment
  •  "Formal certification" is an audit driven by external experts

Major auditing frameworks include:
  •  Data Seal of Approval (Basic)
  •  nestor (Extended)
  •  TRAC/ISO 16363 (Formal)
  •  DRAMBORA (Range)

The DRAMBORA Stages
  1.  Identify organizational context
  2.  Document policy and regulatory framework
  3.  Identify activities, assets, and their owners
  4.  Identify risks
  5.  Assess risks
  6.  Manage risks
In the future, we need to know how these audit frameworks apply to distributed digital preservation environments, and how flexible the questions and the audit models are.


Saturday, March 21, 2015

Reaching Out and Moving Forward: Revising the Library of Congress’ Recommended Format Specifications

Reaching Out and Moving Forward: Revising the Library of Congress’ Recommended Format Specifications. Ted Westervelt, Butch Lazorchak. The Signal. Library of Congress. March 16, 2015.
The Library has created the Recommended Format Specifications, which is the result of years of work by experts from across the institution because it is essential to the mission of the institution. The  Library is committed to making the collection available to its patrons now and for generations to come and must be able to determine the physical and technical characteristics needed to fulfill this goal. The Specifications have hierarchies of characteristics, physical and digital, in order to provide guidance and determine the level of effort involved in managing and maintaining content. In order to continue manage the materials, the Specifications must be constantly reviewed and updated and materials and formats change. An example is exploring the potential value of the SIARD format developed by the Swiss Federal Archives as a means of preserving relational databases.