Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories...

28
Future Search Technical Services Master Class Karen Calhoun Glenn Patton VP Metadata, OCLC Director, WorldCat Quality Marion van Brunschot Lars Svensson Manager, Acquisitions & Metadata IT Specialist University of Amsterdam Deutsche Nationalbibliothek EMEA Regional Council Meeting 2 March 2011

Transcript of Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories...

Page 1: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Future Search

Technical Services

Master Class

Karen Calhoun Glenn PattonVP Metadata, OCLC Director, WorldCat Quality

Marion van Brunschot Lars SvenssonManager, Acquisitions & Metadata IT Specialist

University of Amsterdam Deutsche Nationalbibliothek

EMEA Regional

Council

Meeting

2 March 2011

Page 2: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Q2

Q1

Q3

Q4

Q5

Q6

Q2

Q3

Q4

Q5

Q6

Q1

Q2

Q1

Q3

Q4

Q5

Q6

Q2

Q3

Q4

Q5

Q6

Q1

Q2

Q1

Q3

Q4

Q5

Q6

Q2

Q3

Q4

Q5

Q6

Q1

ROW 1 ROW 2 ROW 3

Future Search Conference Set Up

Warzynski, Chester C. 2004. "Future-Search Conferences at Cornell

University“ New Directions for Institutional Research. (123): 105-112.

Page 3: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Typical Future Search Questions

• What are the most significant trends in the environment?

• How can we prepare to address them?

• What are some issues we need to acknowledge and

address?

• What communities do we serve?

• What do these communities need from us?

Page 4: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Outline of Master Class

• OCLC strategic planning for metadata services

• Environmental scan: Sketch of most significant trends

• High level view of OCLC plans: technical services to metadata

management

• Deeper investigations of certain issues or trends

• Metadata quality – who uses library metadata, and what are

their needs? (Glenn - WorldCat case study)

• Redesigning workflows (Marion - University of Amsterdam)

• Innovation in knowledge organization (Lars - DDC as linked

data, DNB)

Page 5: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

What’s the first thingyou think of when you think of a library?

Some issues to acknowledge and address

De Rosa, Cathy. 2005. Perceptions of libraries and information

resources: a report to the OCLC membership. Dublin, Ohio:

OCLC Online Computer Library Center. p 3-32.

http://www.oclc.org/reports/pdfs/Percept_pt3.pdf

De Rosa, Cathy, et al. 2011. Perceptions of libraries, 2010:

context and community : a report to the OCLC membership.

Dublin, Ohio: OCLC. p. 38.

http://www.oclc.org/reports/2010perceptions.htm

Page 6: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Books = ~85% of member-contributed

WorldCat bibliographic data

WorldCat by Type of Material Described,

1999-2010

E-books/books

2009: 1.8%

2010: 2.9%

Page 7: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Brief summary of the results of our

environment scan -1-

1. Collections of interest to the communities that libraries

serve go well beyond the physical collections that are

housed in library buildings

2. Available cataloguing services (including OCLC’s) were

built to support conventional title-by-title library

workflows for physical library collections, with limited

support for new metadata creation and management

methods for new types of collections.

Page 8: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

WorldCat Cataloging Partners

Page 9: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

WorldCat Collection Sets

• Save staff time and expense of

cataloging hundreds or

thousands of titles

• MARC records for vendor sets

of microform and electronic

collections ready to load

• Holdings automatically set in

WorldCat

Page 10: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Environmental scan results -2-

3. Academic libraries have significantly increased their

spend on e-resources, which are heavily used; metadata

for e-resources is often obtained and managed at the

collection level (not title-by-title)

4. Libraries have redundantly built complex and costly local

infrastructures for e-resource management, including e-

resource metadata management

5. Alternative methods for metadata creation and

management do exist but they are not part of shared,

network-level, cooperative cataloging services

Page 11: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Environmental scan results -3-

6. Digital library collections and open access repositories

are growing in number, visibility, relevance and use

7. The metadata for digital library collections and open

access repositories is not aggregated or managed in any

systematic way on the network that would offer libraries

a way to cost-effectively aggregate this metadata with

other library collections

8. Open source, open data, and open access solutions have

wide support and acceptance in the library community.

Page 12: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

If we were building a cooperative,

network-level metadata creation and

management service today, what would

it look like?

Page 13: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

A Five Year Vision For OCLC Metadata

Management and Collections Services

• A new WorldCat: representing physical, licensed, and digital

library collections

• Innovation through more open data and services

• Easier, faster ways to ingest an OCLC member’s or group’s

data and keep it synchronized with WorldCat and syndicated

on partner sites (e.g., Google Book Search)

• Functionality to efficiently create, manage and share

metadata both at the collection level and at the level of a

single information object

-CONTINUED-

Page 14: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Five Year Vision -2-

• Continuous improvement in quality and cohesiveness of

WorldCat data for metadata experts, end-users and a variety

of other constituencies (more from Glenn)

• A service-neutral open platform (more in another program

from Robin Murray)

Page 15: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Member Libraries’ Collective Collections

WorldCat bibliographic

records

1.6 billion holdings

WorldCat

WorldCat.org

WorldCat Local

WorldCat Knowledge Base

Google Books, HathiTrust,

OAIster

Local special collections

Open access repositories

Digital Collections Gateway

Physical holdings

of libraries

Licensed e-content Digitized and born digital

content

1. Build the New WorldCat: Representing Physical,

Licensed, and Digital Library Collections

Page 16: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

2. Enable more access to WorldCat to

spur innovation

• WorldCat Rights and Responsibilities for the OCLC

Cooperative - applies to copies of WorldCat data in other

systems http://www.oclc.org/worldcat/recorduse/policy/default.htm

• WorldCat data in WorldCat - expose through a

standardized and consistent set of data services and

interface mechanisms. These mechanisms include

syndication, Web-service / API access and linked data.

Page 17: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Early examples of WorldCat data sharing

• Data syndication – particularly to the internet search engines. Get more

attention for library collections.

• WorldCat API - has seen a 20-fold increase in traffic in the past year and now

generates around 5 million transactions per month.

• WorldCat Knowledge Base API –provides programmatic access to the

knowledge base

• The XID services - provide identifier mapping between related ISBN’s and

ISSN’s (generates around 5 million transactions per month from external

services).

• The Dewey linked data service (http://dewey.info)

• VIAF linked data service

• More on this topic: Robin Murray blog post on the open data ecosystem:

http://community.oclc.org/cooperative/author/robin_murray/2010/12/

Page 18: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

3. Making data ingest easier – for every type

of collection

Currently, it is expensive and complex for

libraries to represent their collections in

WorldCat. OCLC Research calls this the ―stitching

cost.‖

There is a need for solutions that reduce the

stitching cost to near zero.

Page 19: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Improve data ingest

DefinitionFoundation for data ingest and exchange between

OCLC services, members and partners.

Key Feature

Ingesting or harvesting holdings and metadata

Library

Impact

Efficiency: Easier synchronization with WorldCat

Visibility: More ways for the library’s users to find the library’s collections

Data

Ingest

Page 20: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Synchronizing ―Group‖ and ―Local‖ Catalogs

with WorldCat

CBS Union Database

SRU

Central Library

District Library

Tech School Library

SRU

Territory Library

Design School Library

Page 21: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Other solutions for (mostly) physical library

collections

Traditional Library System

Z39.50 Client

SQL Query

OAI PMH

ILS API

Local Scripts

SRUILS

Synchronization Gateway

Page 22: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

WorldCat Knowledge Base: network level e-

resource management (including metadata

management)

• Collection data of

databases or packages

•Title data and identifier

metadata

•Holdings data inventory

of titles subscribed to by

your library or consortia

•Linking logic

6.3m records for e-materials from 125 providers in 9 countries

Independent of any particular application

Page 23: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

OCLC Digital Collections Gateway – metadata

aggregation and discovery for digital library

collections

A Web-based, no-fee, self-service tool to contribute digital

repository metadata to WorldCat

OAI (Open Archives Initiative) compliant repositories

(available since July 2010)

Discovery in WorldCat.org,

WorldCat Local

―The function of searching across

collections is a dream frequently

discussed but seldom realized at a

robust level. This paper …

discusses how we might move

from isolated digital collections to

interoperable digital libraries.‖

—Howard Besser

Page 24: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

4. Collection Builder: Provide for metadata

management at the collection level

DefinitionInterface that allows libraries to select their content and service

providers and configure services so that OCLC can deliver metadata and updates at the collection level

Key FeatureCollection-level metadata management for physical, licensed, and

digital content

Library

Impact

Reduces complexity; lowers costs; improves user experience with library systems and services

Page 25: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

5. Improve the quality and cohesiveness

of WorldCat data for end users and

librarians – around the world!

• Glenn’s presentation on WorldCat Quality is next

• The wealth of data from other countries is also an

unprecedented opportunity to create new, multilingual

services around works, names, terminologies, the DDC, and

more

Page 26: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Ap

plic

ati

on

Pla

tfo

rmD

ata

Management Environment

Data Ingest Platform

End User Environment

Data Layer

Business Logic

Services

Core Data Services

Registry KBWC WorldCat Identifiers X-ID …

Management ApplicationsEnd User Apps

Licensed Supply Chain Digital Supply ChainPurchased Supply Chain

6. An open, service-neutral platform

31

Robin Murray

VP, Global Product Management

“OCLC’s Open Platform

Strategy”

Plenary Session

3 March 2011

8:50-10:15 am

Page 27: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

CBS (Central Bibliographic System) Partners

and WorldCat

•Support strong frameworks

for national or regional

union catalogs

•Loading in WorldCat gives

more visibility to citizens,

students and scholars

around the world

•Automated data

synchronization and

syndication

GGC (Netherlands)

UnityUK

GBV (Germany)

Hebis (Germany)

BSZ (Germany)

ABES (France)

Libraries Australia

SwissBib

32

Page 28: Karen Calhoun Glenn Patton - OCLC · 6. Digital library collections and open access repositories are growing in number, visibility, relevance and use 7. The metadata for digital library

Summary of Plan

Serve the whole cooperative

• Continue to serve individual libraries, national libraries, regional service providers and consortia; offer a service-neutral open platform.

• Libraries will participate in multiple consortia and their metadata needs will be multi-dimensional.

Build the new WorldCat

• Represent physical, licensed and digital library collections

• Evolve data exchange, representation models and policies to support a more distributed, open metadata environment

• Make data exchange easier

Improve WorldCat quality

• Global perspective; multilingual services

• Address issues of duplication and holdings scatter

• Improve the WorldCat Registry and Knowledge Base

Add value

• Transition from a record supply service to a collections metadata management service

• Revitalize the online cataloging service (not discussed today!)

• Significantly increase multilingual services --names, terminologies, DDC, more