OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

27
OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA
  • date post

    22-Dec-2015
  • Category

    Documents

  • view

    221
  • download

    0

Transcript of OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

Page 1: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI-PMH at Yale

Report on the DLF OAI Training Session

November 10, 2005Charlottesville, VA

Page 2: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

Overview

• Review of the protocol• OAI best practices• Potential Yale applications• Next steps for the Metadata Committee

Page 3: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI-PMH v.2.0Basic Concepts

• Data provider: administers systems that expose metadata

• Service provider: uses metadata to build value-added services

• Harvester: a client application that issues OAI-PMH requests

• Repository: a network accessible server that can process OAI-PMH requests

Page 4: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI-PMH v.2.0Basic Concepts

• Resource: the physical or digital object that metadata is "about"

• Item: a constituent of a repository from which metadata about a resource can be disseminated

• Record: metadata in a specific format• Identifier: a unique identifier that

unambiguously identifies an item in a repository; must conform to URI syntax

Page 5: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI-PMH v.2.0Harvesting

• Deleted records• Sets• Datestamps

– ISO 8601– UTC

• Selective harvesting

Page 6: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI-PMH v.2.0Protocol Features: HTTP

• Request– GET baseURL?key=value&….&key=value– POST baseURL

Content-Type: application/x-www-form-urlencodedContent-Length: number of characterskey=value&…&key=value

• Response– XML document in message body or error code

Page 7: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI-PMH v.2.0Protocol Features: XML Response

• XML declaration<?xml version="1.0" encoding="UTF-8"?>

• OAI-PMH root element with these attributes:– Default namespace declaration

xmlns=“http://www.openarchives.org/OAI/2.0/”

– Schema instance declarationxmlns:xsi=“http://www.w3.org/2001/XMLSchema-instance”

– Schema locationxsi:schemaLocation=“http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd”

Page 8: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI-PMH v.2.0Protocol Features: XML Response

• responseDate element– YYYY-MM-DDThh:mm:ssZ

• request element<request key=“value” key=“value” key=“value”>baseURL</request>

• response element– It has the same name as the verb used in

the request.

Page 9: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI-PMH v.2.0Protocol Features

• Multiple metadata formats– metadataPrefix

• Flow control– resumptionToken

Page 11: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

Current Work: Resource Harvesting within the OAI-PMH Framework• Datestamps

– Updated record vs. updated resource

• Locating the resource– Multiple URLs: splash page, resource,

etc.– Multiple elements used inconsistently:

dc.identifier, dc.format, dc.relation

Page 13: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI Best PracticesDLF OAI Implementers Workshop

Handouts from the session1. Project Abstract2. The Case for OAI3. OAI “Cheat Sheet”: A Taxonomy of Rapid OAI

Deployment Strategies4. Summary of OAI Metadata Best Practices5. Summary of the DLF Aquifer MODS Profile6. OAI Tools7. OAI Implementation: Administrative Planning

Page 14: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI Best PracticesImplementation Decisions

• Collections– Develop criteria. Prioritize according to

ease of implementation, associated risk, logical dependencies among items, etc.

• Metadata formats– Decide which formats to support.

• Technical infrastructure– E.g., use a gateway that provides a base

URL for multiple individual collections.

Page 16: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI Best Practicesfor Data Providers

• Identifiers– Should be persistent & unique.– Should not be reused.– Specification and XML Schema

• Datestamps– Use UTC.– Support seconds granularity, if possible.

• Deleted records– Provide persistent support, if possible.

Page 17: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI Best Practicesfor Data Providers

• Resumption tokens– For repositories > 2 MB

• Sets– Service providers harvest by set.– How should sets be organized?

• About containers– Rights– Provenance (for 3rd party aggregators)

Page 18: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

Implementation Guidelines

Includes:Guidelines for Repository ImplementersGuidelines for Harvester Implementers

Page 19: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI Validation

• Reap: OAI command line harvesting• Repository explorer: for data providers

& service providers to test harvesting & searching

• W3C validator for XML schema• Utf8conditioner: for character encoding

problems• See OAI Tools handout for more info.

Page 20: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI Best Practicesfor Shareable Metadata

The four C’s of shareable metadata• Consistency• Coherence• Context• Conformance

Page 21: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI Best Practicesfor Shareable Metadata

• Metadata in a shared environment– Context & coherence– Don’t assume a local user.

• Granularity of description– Appropriate for access to the resource– Don’t expose records for subordinate items.

• Use of multiple metadata formats– Need to be expressed as XML schema– Stepped crosswalking to simpler formats.

Page 22: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

OAI Best Practicesfor Shareable Metadata

• Relating versions of a resource– One-to-One Principle– Multiple strategies/compromises

• Document metadata creation practices– In OAI responses– In external documentation

• Communication with service providers

Page 23: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

Potential Applications at YaleImplementation Goals

• Improve user experience– Federated search

• Improve management of resources– Finding aids

• Collaborate with institutional partners– AMEEL

• Develop digital library infrastructure– At Yale and beyond

Page 24: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

Potential Applications at YaleResources & Roles

• Resources– Commitment of stakeholders– Analysis of deployment options– Server infrastructure– Staff hours

• Roles– OAI-PMH Implementation Manager– Programmers & technical staff– Metadata specialists– Digital collection curators

Page 25: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

Potential Applications at YaleSharing Metadata

• 3rd Party Aggregators– OAIster– DLF Portal– MODS Portal

• Registries– Registered OAI repositories– Institutional Archives Registry– OAI Registry at UIUC

Page 26: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

Next Steps for the Metadata Committee

• Centralized implementation at Yale?If yes, – Relate to other digital library initiatives.– Create buy-in.

• Service provider needs– Consult with IAC committees.

• Data provider needs– Consult with digital collection curators.

Page 27: OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.

Next Steps for the Metadata Committee

• Metadata recommendations– Recommend multiple formats– Decide upon a common format

• YES? MODS?• Stepped crosswalking from other formats

– Content & encoding guidelines– Metadata creation tools– Staffing