Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf ·...

Post on 25-Jun-2020

12 views 0 download

Transcript of Beyond Zebra: Taking RDA beyond MARC - Dryadwiki.datadryad.org/images/1/19/GreenbergALA-2011.pdf ·...

Jane Greenberg (janeg@email.unc.edu)

Metadata Research Center/School of Info. + Lib. Sci.

University of North Carolina at Chapel Hill

Beyond Zebra: Taking RDA

beyond MARC

ALA Annual Conference

June 25, 2011

Overview

1. MARC moment

2. Bibliographic universe world of information

output

– Visual aids

3. DRYAD a case study

– Introduce Dryad

– Why not MARC

– RDA potential

4. Concluding remarks

A MARC Moment

MARC MARC Authority Sources MARBI Concise MARC format

MARC Forum (listserv) MARC Relators MARC FAQ

Unicode-MARC Forum MARC-XML Understanding MARC

Bibliographic universe ≈??

Traditional Evolved/evolving+

World of information output

data, information, knowledge… World of recorded knowledge

Bibliographic entities Information objects

Books

Sound recordings

Images

Archives

Music

Bibliographic entities

People

Activities/events

Data

Relationships

Places

RDF graphs,

graph theory, information, links,

relationship,

context…

from IA3 (Adaptive

Information, Adaptive

Innovation, Adaptive

Infrastructure)

Mike Bergman http://www.mkbergman.com

Also from

IA3, Mike

Bergman

Questions…and the Dryad repository

Graph images are nice, but how to we get there?

Where does RDA fit in?

Consequences?

ALA Annual Conference 2011

Enter Dryad…

- What is Dryad?

- Why not MARC?

10

Data underlying peer-reviewed articles in the

basic and applied biosciences

As of Jun 25, 2011, Dryad contains 769 data packages and

1856 data files, associated with articles in 81 journals

ALA Annual Conference 2011

Why not MARC?

- automatic propagation of metadata

- author generated metadata (low burden)

- handshaking/linking and sharing metadata

- promoting data-reuse, and tracking it

~ versioning

From: managing.editor@molecol.com

Date: April 19, 2011 3:09:22 PM EDT

To: Author

Cc: journal-submit@datadryad.org

Subject: Dryad entry for MEC-11-0140.R1

Dear Author

Many thanks for agreeing to participate in the Dryad project. To upload your data, please click the link below- it will take you directly to your entry in the Dryad database.

http://datadryad.org/submit?journalID=MolEcol&manu=223330

<deleted text>

Once you have uploaded your data please include the Dryad identifier in your manuscript. Please let me know if you have any questions about this process.

All the best,

Tim Vines,

Managing Editor, Molecular Ecology

Pre-populated

metadata

field

DATA

FILE

DATA PACKAGE METADATA

ARTICLE METADATA

ARTICLE

GENBANK

OBJECT

TREEBASE

OBJECT

DRYAD NOT

DRYAD

Data file

identifier

Metadata describing data package

recorded here

Related article citation displayed on

package page

Genbank ID

Tree Base ID

URL

Metadata describing article

recorded here

Article identifier (usually DOI)

OTHER

OBJECT

Data package identifier

DATA FILE METADATA

metadata describing data

file recorded here

DATA

FILE

Data file

identifier

Data package identifier

DATA FILE METADATA

metadata describing data

file recorded here

ALA Annual Conference 2011

Results of a keyword search in Dryad

<?xml version="1.0" encoding="UTF-8" ?>

- <rdf:RDF xmlns="http://datadryad.org/" xmlns:rdf="http://www.w3.org/1999/02/22-

rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/"

xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#">

- <rdf:Description rdf:about="http://hdl.handle.net/10255/dryad.82">

<dc:title>Data from: Hunting to extinction: biology and regional economy

influence extinction risk and the impact of hunting in artiodactyls</dc:title>

<dc:creator>Price, Samantha A.</dc:creator>

<dc:creator>Gittleman, John L.</dc:creator>

<dc:subject>phylogenetic comparative methods</dc:subject>

<snip>

<dc:description>Half of all artiodactyls (even-toed hoofed mammals) are…

<dc:publisher>Royal Society Publishing</dc:publisher>

<dc:date>2008-02-27T17:42:57Z</dc:date>

<dc:relation>Proceedings of the Royal Society</dc:relation>

<snip>

<dc:relation>doi:10.1098/rspb.2007.0505</dc:relation>

<dc:relation>http://purl.org/phylo/treebase/phylows/study/T

B2:S1271?format=html</dc:relation> </rdf:Description>

- <rdf:Description rdf:about="http://hdl.handle.net/10255/dryad.234">

<dc:title>Data from: Towards a worldwide wood economics spectrum</dc:title>

<dc:creator>Zanne, Amy E.</dc:creator>

<rdf:RDF

xmlns:dryadt="http://rio.cs.utep.edu/ciserver/ciprojects/s

data/DryadTypes.owl#"

xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-

ns#" xmlns:owl="http://www.w3.org/2002/07/owl#"

xmlns:xsd="http://www.w3.org/2001/XMLSchema#"

xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" >

<rdf:Description rdf:nodeID="A0"> <wdo:hasOutput

rdf:resource="http://rio.cs.utep.edu/ciserver/ciprojects/CI

Miner/ciminer-workflow.owl#i20"/> <rdf:type

rdf:resource="http://rio.cs.utep.edu/ciserver/ciprojects/sd

ata/DryadTypes.owl#DCType"/> </rdf:Description>

EN"></rdfs:comment> </rdf:Description> </rdf:RDF> ….

Openlink Data Explorer (ODE) / LOD4DataONE

https://notebooks.dataone.org/lod4dataone/

Aida Gandara, Data One

Data reuse and data object

relationships

Equivalence Derivative

Whole-part

Sequential

A (=same

data set

on paper)

A (=data

set in

Excel)

A (=same

data set

in SAS)

A1 (=part 1

of a data set)

C

(=data set

A revised)

B (=data

set A annotated)

A

(=data set)

A (=data set)

A1 (=a subset

of A)

A2 (=part 2

of a data set)

DataCite, ver 2.1 (RDA vocabulary) http://www.datacite.org/schema/DataCite-MetadataKernel_v2.1.pdf

dcterms:relation

dcterms:conformsTo:

dcterms:isReferencedBy

dcterms:references

dcterms:isVersionOf

dcterms:hasVersion

dcterms:isFormatOf

dcterms:hasFormat

dcterms:isPartOf

dcterms:hasPart

dcterms:isReplacedBy dcterms:replaces

dcterms:source

RDA at play

- Data sets are

works

- Authors are

entities /

ORCID

ALA Annual Conference 2011

Back to…Why not MARC?

- automatic propagation of metadata

- author generated metadata (low burden)

- handshaking/linking and sharing metadata

- promoting data-reuse, and tracking it

~ versioning

Baker, T. (2007), Singapore Framework

Dryad DCAP (Dublin Core

Application Profile), ver. 3.0 https://www.nescent.org/wg/dryad/images/8/8b/Dryad3.0.pdf

bibo (The Bibliographic

Ontology)

dcterms (Dublin Core terms)

dryad (Dryad) (property:

Dryadstatus)

DwC (Darwin Core) Simple: automatic metadata gen;

heterogeneous datasets

Interoperable: harvesting, cross-system

searching

Semantic Web compatible: sustainable;

supporting machine processing

Data-package centric

2 pronged approach ~ DDpace

(Greenberg, et al, 2009)

Next steps: Alignment with Dryad-UK scheme

(Shotton, et al, 2011)

Map to DataCite; ORCID

Concluding remarks

Alignment of research and

implementation goals (more

immediate needs may not be

the most interesting,

vice/versa)

– Priorities, language barriers,

large team

Infrastructure not “fully”

there; planning for the

future

Synergy between implementation and research (a live lab)

Preparing for new potential

Seeing some benefits…

Intellectually exciting

Challenges

Pros, Benefits

Many people and organizations to acknowledge

Dryad Consortium Board, journal partners, and data authors: NESCent: Kevin Clarke, Hilmar Lapp, Heather Piwowar, Peggy Schaeffer, Ryan

Scherle, Todd Vision UNC-CH <Metadata Research Center>: Jose R. Pérez-Agüera, Sarah Carrier,

Elena Feinstein, Jane Greenberg, Lina Huang, Robert Losee, Hollie White, Craig Willis

U British Columbia: Michael Whitlock / NCSU Digital Libraries: Kristin Antelman HIVE: Library of Congress, USGS, and The Getty Research Institute; and

workshop hosts Yale/TreeBASE: Youjun Guo, Bill Piel DataONE: Rebecca Koskela, Bill Michener, Dave Veiglais, Aida Gandara, and

many others British Library: Lee-Ann Coleman, Adam Farquhar, Brian Hole Oxford University: David Shotton Atmire.com: Mark Diggory

http://datadryad.org http://blog.datadryad.org http://datadryad.org/wiki http://code.google.com/p/dryad Facebook: Dryad Twitter: @datadryad