Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf ·...

25
Jane Greenberg ([email protected]) Metadata Research Center/School of Info. + Lib. Sci. University of North Carolina at Chapel Hill Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011

Transcript of Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf ·...

Page 1: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

Jane Greenberg ([email protected])

Metadata Research Center/School of Info. + Lib. Sci.

University of North Carolina at Chapel Hill

Beyond Zebra: Taking RDA

beyond MARC

ALA Annual Conference

June 25, 2011

Page 2: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

Overview

1. MARC moment

2. Bibliographic universe world of information

output

– Visual aids

3. DRYAD a case study

– Introduce Dryad

– Why not MARC

– RDA potential

4. Concluding remarks

Page 3: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

A MARC Moment

MARC MARC Authority Sources MARBI Concise MARC format

MARC Forum (listserv) MARC Relators MARC FAQ

Unicode-MARC Forum MARC-XML Understanding MARC

Page 4: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

Bibliographic universe ≈??

Traditional Evolved/evolving+

World of information output

data, information, knowledge… World of recorded knowledge

Bibliographic entities Information objects

Books

Sound recordings

Images

Archives

Music

Bibliographic entities

People

Activities/events

Data

Relationships

Places

Page 5: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview
Page 6: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

RDF graphs,

graph theory, information, links,

relationship,

context…

from IA3 (Adaptive

Information, Adaptive

Innovation, Adaptive

Infrastructure)

Mike Bergman http://www.mkbergman.com

Page 7: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

Also from

IA3, Mike

Bergman

Page 8: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

Questions…and the Dryad repository

Graph images are nice, but how to we get there?

Where does RDA fit in?

Consequences?

Page 9: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

ALA Annual Conference 2011

Enter Dryad…

- What is Dryad?

- Why not MARC?

Page 10: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

10

Data underlying peer-reviewed articles in the

basic and applied biosciences

As of Jun 25, 2011, Dryad contains 769 data packages and

1856 data files, associated with articles in 81 journals

Page 11: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

ALA Annual Conference 2011

Why not MARC?

- automatic propagation of metadata

- author generated metadata (low burden)

- handshaking/linking and sharing metadata

- promoting data-reuse, and tracking it

~ versioning

Page 12: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

From: [email protected]

Date: April 19, 2011 3:09:22 PM EDT

To: Author

Cc: [email protected]

Subject: Dryad entry for MEC-11-0140.R1

Dear Author

Many thanks for agreeing to participate in the Dryad project. To upload your data, please click the link below- it will take you directly to your entry in the Dryad database.

http://datadryad.org/submit?journalID=MolEcol&manu=223330

<deleted text>

Once you have uploaded your data please include the Dryad identifier in your manuscript. Please let me know if you have any questions about this process.

All the best,

Tim Vines,

Managing Editor, Molecular Ecology

Page 13: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

Pre-populated

metadata

field

Page 14: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview
Page 15: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview
Page 16: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

DATA

FILE

DATA PACKAGE METADATA

ARTICLE METADATA

ARTICLE

GENBANK

OBJECT

TREEBASE

OBJECT

DRYAD NOT

DRYAD

Data file

identifier

Metadata describing data package

recorded here

Related article citation displayed on

package page

Genbank ID

Tree Base ID

URL

Metadata describing article

recorded here

Article identifier (usually DOI)

OTHER

OBJECT

Data package identifier

DATA FILE METADATA

metadata describing data

file recorded here

DATA

FILE

Data file

identifier

Data package identifier

DATA FILE METADATA

metadata describing data

file recorded here

Page 17: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

ALA Annual Conference 2011

Results of a keyword search in Dryad

Page 18: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

<?xml version="1.0" encoding="UTF-8" ?>

- <rdf:RDF xmlns="http://datadryad.org/" xmlns:rdf="http://www.w3.org/1999/02/22-

rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/"

xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#">

- <rdf:Description rdf:about="http://hdl.handle.net/10255/dryad.82">

<dc:title>Data from: Hunting to extinction: biology and regional economy

influence extinction risk and the impact of hunting in artiodactyls</dc:title>

<dc:creator>Price, Samantha A.</dc:creator>

<dc:creator>Gittleman, John L.</dc:creator>

<dc:subject>phylogenetic comparative methods</dc:subject>

<snip>

<dc:description>Half of all artiodactyls (even-toed hoofed mammals) are…

<dc:publisher>Royal Society Publishing</dc:publisher>

<dc:date>2008-02-27T17:42:57Z</dc:date>

<dc:relation>Proceedings of the Royal Society</dc:relation>

<snip>

<dc:relation>doi:10.1098/rspb.2007.0505</dc:relation>

<dc:relation>http://purl.org/phylo/treebase/phylows/study/T

B2:S1271?format=html</dc:relation> </rdf:Description>

- <rdf:Description rdf:about="http://hdl.handle.net/10255/dryad.234">

<dc:title>Data from: Towards a worldwide wood economics spectrum</dc:title>

<dc:creator>Zanne, Amy E.</dc:creator>

Page 19: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

<rdf:RDF

xmlns:dryadt="http://rio.cs.utep.edu/ciserver/ciprojects/s

data/DryadTypes.owl#"

xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-

ns#" xmlns:owl="http://www.w3.org/2002/07/owl#"

xmlns:xsd="http://www.w3.org/2001/XMLSchema#"

xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" >

<rdf:Description rdf:nodeID="A0"> <wdo:hasOutput

rdf:resource="http://rio.cs.utep.edu/ciserver/ciprojects/CI

Miner/ciminer-workflow.owl#i20"/> <rdf:type

rdf:resource="http://rio.cs.utep.edu/ciserver/ciprojects/sd

ata/DryadTypes.owl#DCType"/> </rdf:Description>

EN"></rdfs:comment> </rdf:Description> </rdf:RDF> ….

Openlink Data Explorer (ODE) / LOD4DataONE

https://notebooks.dataone.org/lod4dataone/

Aida Gandara, Data One

Page 20: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

Data reuse and data object

relationships

Equivalence Derivative

Whole-part

Sequential

A (=same

data set

on paper)

A (=data

set in

Excel)

A (=same

data set

in SAS)

A1 (=part 1

of a data set)

C

(=data set

A revised)

B (=data

set A annotated)

A

(=data set)

A (=data set)

A1 (=a subset

of A)

A2 (=part 2

of a data set)

Page 21: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

DataCite, ver 2.1 (RDA vocabulary) http://www.datacite.org/schema/DataCite-MetadataKernel_v2.1.pdf

dcterms:relation

dcterms:conformsTo:

dcterms:isReferencedBy

dcterms:references

dcterms:isVersionOf

dcterms:hasVersion

dcterms:isFormatOf

dcterms:hasFormat

dcterms:isPartOf

dcterms:hasPart

dcterms:isReplacedBy dcterms:replaces

dcterms:source

RDA at play

- Data sets are

works

- Authors are

entities /

ORCID

Page 22: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

ALA Annual Conference 2011

Back to…Why not MARC?

- automatic propagation of metadata

- author generated metadata (low burden)

- handshaking/linking and sharing metadata

- promoting data-reuse, and tracking it

~ versioning

Page 23: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

Baker, T. (2007), Singapore Framework

Dryad DCAP (Dublin Core

Application Profile), ver. 3.0 https://www.nescent.org/wg/dryad/images/8/8b/Dryad3.0.pdf

bibo (The Bibliographic

Ontology)

dcterms (Dublin Core terms)

dryad (Dryad) (property:

Dryadstatus)

DwC (Darwin Core) Simple: automatic metadata gen;

heterogeneous datasets

Interoperable: harvesting, cross-system

searching

Semantic Web compatible: sustainable;

supporting machine processing

Data-package centric

2 pronged approach ~ DDpace

(Greenberg, et al, 2009)

Next steps: Alignment with Dryad-UK scheme

(Shotton, et al, 2011)

Map to DataCite; ORCID

Page 24: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

Concluding remarks

Alignment of research and

implementation goals (more

immediate needs may not be

the most interesting,

vice/versa)

– Priorities, language barriers,

large team

Infrastructure not “fully”

there; planning for the

future

Synergy between implementation and research (a live lab)

Preparing for new potential

Seeing some benefits…

Intellectually exciting

Challenges

Pros, Benefits

Page 25: Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf · Beyond Zebra: Taking RDA beyond MARC ALA Annual Conference June 25, 2011 . Overview

Many people and organizations to acknowledge

Dryad Consortium Board, journal partners, and data authors: NESCent: Kevin Clarke, Hilmar Lapp, Heather Piwowar, Peggy Schaeffer, Ryan

Scherle, Todd Vision UNC-CH <Metadata Research Center>: Jose R. Pérez-Agüera, Sarah Carrier,

Elena Feinstein, Jane Greenberg, Lina Huang, Robert Losee, Hollie White, Craig Willis

U British Columbia: Michael Whitlock / NCSU Digital Libraries: Kristin Antelman HIVE: Library of Congress, USGS, and The Getty Research Institute; and

workshop hosts Yale/TreeBASE: Youjun Guo, Bill Piel DataONE: Rebecca Koskela, Bill Michener, Dave Veiglais, Aida Gandara, and

many others British Library: Lee-Ann Coleman, Adam Farquhar, Brian Hole Oxford University: David Shotton Atmire.com: Mark Diggory

http://datadryad.org http://blog.datadryad.org http://datadryad.org/wiki http://code.google.com/p/dryad Facebook: Dryad Twitter: @datadryad