RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use...

31
Webinar Mapping the landscape of data standards, data repositories and data policies: Use case – The Life Sciences (biomed, agri, environ) Peter McQuilton, PhD On behalf of the joint RDA/Force11 BioSharing WG

Transcript of RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use...

Page 1: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

WebinarMapping the landscape of data standards, data repositories and

data policies: Use case – The Life Sciences (biomed, agri, environ)

Peter McQuilton, PhD

On behalf of the joint RDA/Force11 BioSharing WG

Page 2: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

What were the aims of the RDA/Force11 BioSharing WG

• To develop guidelines for linking information on databases, content standards and journal and funder data policies in the life sciences

• To develop a curated registry (running since 2011),to access and cross-search this information, suchthat a variety of stakeholders can make decisionson which standards and databases to use orendorse

https://www.biosharing.org

Page 3: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

What were the aims of the RDA/Force11 BioSharing WG

• To develop guidelines for linking information on databases, content standards and journal and funder data policies in the life sciences

• To develop a curated registry (running since 2011),to access and cross-search this information, suchthat a variety of stakeholders can make decisionson which standards and databases to use orendorse

https://www.biosharing.orghttps://www.FAIRsharing.org

In preparation over the summer

Page 4: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences
Page 5: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

• A web-based, curated and searchable portal that monitors the

development and evolution of standards, their use in

databases and the adoption of both in data policies,

to inform and educate

Page 6: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Data policies Databases

Content standards

Formats Terminologies Guidelines

Models/Formats = Conceptual

model, conceptual schema,

exchange formats

Terminologies = Controlled

vocabularies, taxonomies,

thesauri, ontologies etc.

Guidelines = Minimum information

reporting requirements, checklists

Mapping a complex and evolving landscape

Page 7: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Content standards

Data policies Databases

Formats Terminologies Guidelines

By grass-roots

groups, e.g.:By standard

organizations, e.g.:

Mapping a complex and evolving landscape

Page 8: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Formats Terminologies Guidelines

225

115

554

source sourcesource

MIAME

MIRIAMMIQAS

MIXMIGEN

ARRIVEMIAPE

MIASE

MIQE

MISFISHIE….

REMARK

CONSORT

SRAxml

SOFT FASTA

DICOM

MzMLSBRML

SEDML…

GELML

ISA

CML

MITAB

AAO

CHEBIOBI

PATO ENVO

MOD

BTO

IDO…

TEDDY

PRO

XAO

DO

VO

Content standards in numbers

Statistics as of May 2017

Page 9: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Understanding how standards are used

Page 10: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Guideline

Understanding how standards are used

Page 11: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Formats

Guideline

Understanding how standards are used

Page 12: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Formats

Guideline

Formats

Understanding how standards are used

Page 13: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Formats

Guideline

Formats

Terminology

Understanding how standards are used

Page 14: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

My funder’s data policy recommends the use of established standards, but which are widely endorsed and applicable to my crop data?

Am I using the most up-to-date version of this terminology to annotate cell-based assays?

I understand this format has been deprecated; what has it been replaced by and is it appropriate for this data?

Which are the mature standardsand standards-compliant databasesthat we should recommend to our authors?

Helping users make the right decision

Page 15: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences
Page 16: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences
Page 17: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Training

Page 18: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Linking standards and databases to training material

Page 19: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences
Page 20: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Ready for use, implementation, or recommendation

In development

Status uncertain

Deprecated as subsumed or superseded

Manually curated and verified

by the community behind each

resource

Mapping a complex and evolving landscape – indicators of status

Page 21: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Collections group together one or

more types of resource by

domain, project or organization.

Recommendations are a core-set

of resources that are selected and

recommended by a funder or

journal data policy.

All collections and

recommendations created

between BioSharing and the

collection/recommendation

owner.

Grouping records

Page 22: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences
Page 23: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Data Policy

Visualizing relationships

Page 24: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences
Page 25: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

“The interactive browser will allow us to discover which databases and editorial standards are not currently included in our author guidelines, enabling us to regularly monitor and refine our policies as appropriate, in support of our mission to help our authors enhance the reproducibility of their work.” – Holly, F1000

Page 26: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Collections group together one or

more types of resource by

domain, project or organization.

Recommendations are a core-set

of resources that are selected and

recommended by a funder or

journal data policy.

All collections and

recommendations created

between BioSharing and the

collection/recommendation

owner.

Grouping records

Page 27: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences
Page 28: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Standard developing groups:Journal, publishers:

Cross-links, data exchange:

Societies and organisations: Institutional RDM services:

Projects, programmes:

Working with the community/adopters

Page 29: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences
Page 30: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Acknowledgements

Page 31: RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use case - Life Sciences

Thank-you!Any questions?

[email protected]

Acknowledgements