Metadata as Standard: improving Interoperability through the Research Data Alliance

29
Metadata INTEREST GROUP Metadata as standard Improving interoperability through the Research Data Alliance Alex Ball University of Bath 8 September 2017 8 September 2017 rd-alliance.org

Transcript of Metadata as Standard: improving Interoperability through the Research Data Alliance

Page 1: Metadata as Standard: improving Interoperability through the Research Data Alliance

MetadataINTEREST GROUP

Metadata as standardImproving interoperability through the Research Data Alliance

Alex Ball

University of Bath

8 September 2017

8 September 2017 rd-alliance.org

Page 2: Metadata as Standard: improving Interoperability through the Research Data Alliance

Metadata

Page 3: Metadata as Standard: improving Interoperability through the Research Data Alliance

Metadata basics

What is metadata?

∠ Literally ‘data about data’

∠ Information that helps you work with other information

In the research context, we are mostly concerned with

∠ Discoverymetadata – help other researchers find the data, and givecredit for them→ impact

∠ Contextual metadata – keeping the institution and funder happy,conveying quality and relevance

∠ Structural & semantic metadata – ensure that researchers canunderstand and use/reuse the data

8 September 2017 rd-alliance.org

Page 4: Metadata as Standard: improving Interoperability through the Research Data Alliance

Why should I use a metadatastandard?

Page 5: Metadata as Standard: improving Interoperability through the Research Data Alliance

Better discovery

versus

8 September 2017 rd-alliance.org

Page 6: Metadata as Standard: improving Interoperability through the Research Data Alliance

Better context

versus

8 September 2017 rd-alliance.org

Page 7: Metadata as Standard: improving Interoperability through the Research Data Alliance

Better reuse

|

versus

|

8 September 2017 rd-alliance.org

Page 8: Metadata as Standard: improving Interoperability through the Research Data Alliance

Better ecosystem

Less working things out from scratch

More complete metadata

Benefits of practising

Better documentation of the standards

Concentration of development attention and effort

Better time-saving tools

etc., etc.

8 September 2017 rd-alliance.org

Page 9: Metadata as Standard: improving Interoperability through the Research Data Alliance

So why doesn’t everyone use ametadata standard?

Page 10: Metadata as Standard: improving Interoperability through the Research Data Alliance

No suitable standard?

0 100 200 300 400 500 600 700

DIFDwCDC

OtherFDGCEML

Open GISISO

My labNone 56.1%

22.1%8.0%8.0%7.9%7.9%6.8%

2.2%1.7%1.0%

Responses (N = 1205/1329)

Metad

atastan

dardsu

sed

Source: Tenopir, C. et al. (2011), ‘Data Sharing by Scientists: Practices and Perceptions’, PLoSONE 6/6: e21101. doi: 10.1371/journal.pone.0021101

8 September 2017 rd-alliance.org

Page 11: Metadata as Standard: improving Interoperability through the Research Data Alliance

Toomany standards?

Source: cbn Randall Munroe

The nice thing about standards is thatyou have so many to choose from

Source: Tanenbaum, A. S. (1988), Computer Networks, (2nd edn., UpperSaddle River, NJ: Prentice-Hall): p. 254

8 September 2017 rd-alliance.org

Page 12: Metadata as Standard: improving Interoperability through the Research Data Alliance

Isn’t that, like, really hard?

Just fill out this simple form . . .

<mods xmlns="http://www.loc.gov/mods/v3" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"xsi:schemaLocation="http://www.loc.gov/mods/v3http://www.loc.gov/standards/mods/v3/mods-3-4.xsd"> <titleInfo> <title> Title goes here </title></titleInfo> <name type="personal"> <namePart>Author name goes here</namePart> <role> <roleTermtype="text">Author</roleTerm> </role> </name> <typeOfResource>dataset</typeOfResource><genre>Dataset</genre> <originInfo> <publisher>Publisher name goes here</publisher></originInfo> <language> <languageTerm type="text">Language name</languageTerm> <languageTermtype="code" authority="iso639-2b">ISO 639-2b code</languageTerm> </language><physicalDescription> <internetMediaType>MIME type goes here, repeat asnecessary</internetMediaType> <digitalOrigin>born digital</digitalOrigin> <extent>Number ofrecords in your database, or size of file in bytes</extent> </physicalDescription> <abstract>Abstract goes here </abstract> <subject authority="scheme name goes here"> <topic>Keyword goeshere, repeat as necessary</topic> <cartographics>Spatial coordinates<cartographics/><temporal>Temporal extente</temporal> <geographic>Spatial extent in words</geographic></subject> <identifier>ID goes here</identifier> <location> <url usage="primary display"access="object in context">Location of record</url> <url access="raw object">Location fordownload</url> </location> <accessCondition type="useAndReproduction"> Usage restrictions orpermissions </accessCondition> <relatedItem ID="relatedMaterials"> <location> <urlusage="primary display" access="object in context">Record of related item</url> </location></relatedItem> <note type="citation"> Sample citation goes here </note> <notetype="software">Required software goes here</note> <subject ID="location"displayLabel="Description of spatial extent again"> <cartographics> <coordinates> List ofcoordinates, comma separated </coordinates> </cartographics> <topic>Type of coordinates goeshere</topic> </subject> </mods>

8 September 2017 rd-alliance.org

Page 13: Metadata as Standard: improving Interoperability through the Research Data Alliance

Metadata Standards Catalog

Page 14: Metadata as Standard: improving Interoperability through the Research Data Alliance

RDA Metadata Standards Directory WG

Key facts

∠ Ran 1 August 2013 – 1 February 2015

∠ 150 members frommany countries and disciplines

Goals1. Develop an RDAMetadata Standards Directory listing standards

relevant for research data– Comprehensive– Easy for anyone to contribute or update

2. Define and develop use cases for research metadata

3. Develop plan for long-term growth andmaintenance of the directory

8 September 2017 rd-alliance.org

Page 15: Metadata as Standard: improving Interoperability through the Research Data Alliance

Existing work

DCCDisciplinaryMetadataCatalogue

Science DataLiteracyProject

SDAPSS

SeeingStandards

MMI ContentStandardReferences

BioSharing

GEOSS SIR

specialist

general

static

updated

8 September 2017 rd-alliance.org

Page 16: Metadata as Standard: improving Interoperability through the Research Data Alliance

The Metadata Standards Directory

Disciplinary Metadata

Contact us

Search

Home > Resources for digital curators > Disciplinary Metadata

Disciplinary Metadata

While data curators, and increasingly researchers, know that good metadata is key for research data access andre-use, figuring out precisely what metadata to capture and how to capture it is a complex task. Fortunately, manyacademic disciplines have supported initiatives to formalise the metadata specifications the community deems to berequired for data re-use. This page provides links to information about these disciplinary metadata standards, includingprofiles, tools to implement the standards, and use cases of data repositories currently implementing them.

For those disciplines that have not yet settled on a metadata standard, and for those repositories that work with dataacross disciplines, the General Research Data section links to information about broader metadata standards that havebeen adapted to suit the needs of research data.

Search by Discipline

Biology Earth Science General Research Data

Physical Science Social Science & Humanities

Search by Resource TypeMetadata Standards

Specifications for the minimum information that should be collected about research data in order for it to be re-used.

Profiles and ExtensionsStandards that have been adapted for use in particular types of repositories, or for particular types of data.

Use casesInstitutional repositories and data portals using standards to determine which metadata should be collected upondata deposit.

ToolsSoftware that has been developed to capture or store metadata conforming to a specific standard.

In this sectionBriefing Papers

How-to Guides

Developing RDM Services

Curation Lifecycle Model

Curation Reference Manual

Policy and legal

Data Management Plans

Tools

Case studies

Repository audit and assessment

Standards

Disciplinary Metadata

DIFFUSE

Publications and presentations

Roles

Curation journals

Informatics research

External resources

Home Digital curation About us News Events Resources Training Projects Community

Disciplinary Metadata | Digital Curation Centre http://www.dcc.ac.uk/resources/metadata-standards

1 of 2 08/01/14 17:25

RDA Metadata Standards Directory

http://www.dcc.ac.uk/resources/metadata-standards

http://rd-alliance.github.io/metadata-directory/

8 September 2017 rd-alliance.org

Page 17: Metadata as Standard: improving Interoperability through the Research Data Alliance

But there is more to be done . . .

∠ Search, not just browse

∠ Access data with machine-to-machine protocols

∠ Richer information

– versions, mapping directionality, endorsements

– greater use of entity relationships

∠ More services

– Extracting what you need from compliant metadata . . .

– Calculating migration pathways . . .

– Comparing elements in different schemes . . .

– Generating ‘first-pass’ converters . . .

8 September 2017 rd-alliance.org

Page 18: Metadata as Standard: improving Interoperability through the Research Data Alliance

The Metadata Standards Catalog

Is this the right one for me?

∠ Name

∠ Description

∠ Research area

∠ Data type

∠ Maintainer, funder

∠ Endorsements

How do I use it?

∠ User guide

∠ Specification

8 September 2017 rd-alliance.org

Page 19: Metadata as Standard: improving Interoperability through the Research Data Alliance

The Metadata Standards Catalog

How do I refer to it/find it again?

∠ Identifiers

Is this the right one for me?

∠ Version history

∠ Parent/child schemes

Can I convert existing metadatato it? Will I be locked in?

∠ Mappings to/from otherschemes

8 September 2017 rd-alliance.org

Page 20: Metadata as Standard: improving Interoperability through the Research Data Alliance

The Metadata Standards Catalog

How do I use it?

∠ Software

∠ Services

∠ Known users

∠ Sample records

8 September 2017 rd-alliance.org

Page 21: Metadata as Standard: improving Interoperability through the Research Data Alliance

The Metadata Standards Catalog

8 September 2017 rd-alliance.org

Page 22: Metadata as Standard: improving Interoperability through the Research Data Alliance

The Metadata Standards Catalog

8 September 2017 rd-alliance.org

Page 23: Metadata as Standard: improving Interoperability through the Research Data Alliance

Future developmentsGU

I

Highlight standards bodies

GUI

Dynamic filtering while browsing

GUI

Side-by-side specifications

API

Query standards by their elements

GUI

Version history as timeline

GUI

Search by article DOI

GUI

Showmaturity rating for schemes

API

Query by element value encoding

API

Query by article DOI

API

Calculate crosswalks

https://www.rd-alliance.org/groups/metadata-standards-catalog-working-group.html

8 September 2017 rd-alliance.org

Page 24: Metadata as Standard: improving Interoperability through the Research Data Alliance

Canonical metadata packages

Page 25: Metadata as Standard: improving Interoperability through the Research Data Alliance

Recommended Metadata Element Set

Dataset

Unique IdentifierName/titleDescriptionKeywordsSpatial coordinatesTemporal coordinatesLocation (e.g. URL)Medium/formatAvailability (e.g. licence)SchemaQualityProvenance

Person

Originator

Activity

Project

Related publicationsRelated softwareCitations

FacilityEquipment

8 September 2017 rd-alliance.org

Page 26: Metadata as Standard: improving Interoperability through the Research Data Alliance

Unpacking the elements

Example: spatial coordinates

∠ X, Y, Z in declared coordinate system– May be connected with temporal coordinate

∠ Precision

∠ Accuracy

∠ Resolution

Need to unpack all elements and validate the result

∠ Join in: https://www.rd-alliance.org/groups/metadata-ig.html

∠ Hope to publish as an RDA output

∠ Basis for converters?

8 September 2017 rd-alliance.org

Page 27: Metadata as Standard: improving Interoperability through the Research Data Alliance

Final thoughts

Page 28: Metadata as Standard: improving Interoperability through the Research Data Alliance

Metadata→ better data

∠ Even bad documentation is better than nothing

∠ Themore structure, the better

– Clear headings and sections in documentation

– Consistent metadata

∠ Look formetadata standards you can use

– Metadata Standards Directory/Catalog

∠ Not an exact fit? Create a local profile

– Avoid completely bespoke schemes

∠ Be consistent

8 September 2017 rd-alliance.org

Page 29: Metadata as Standard: improving Interoperability through the Research Data Alliance

MetadataINTEREST GROUP

Thank you for your attention

Metadata Interest Group:https://www.rd-alliance.org/groups/metadata-ig.html

8 September 2017 rd-alliance.org