RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use...

Post on 21-Jan-2018

75 views 2 download

Transcript of RDA Webinar - BioSharing - mapping the landscape of data standards, repositories and policies - use...

WebinarMapping the landscape of data standards, data repositories and

data policies: Use case – The Life Sciences (biomed, agri, environ)

Peter McQuilton, PhD

On behalf of the joint RDA/Force11 BioSharing WG

What were the aims of the RDA/Force11 BioSharing WG

• To develop guidelines for linking information on databases, content standards and journal and funder data policies in the life sciences

• To develop a curated registry (running since 2011),to access and cross-search this information, suchthat a variety of stakeholders can make decisionson which standards and databases to use orendorse

https://www.biosharing.org

What were the aims of the RDA/Force11 BioSharing WG

• To develop guidelines for linking information on databases, content standards and journal and funder data policies in the life sciences

• To develop a curated registry (running since 2011),to access and cross-search this information, suchthat a variety of stakeholders can make decisionson which standards and databases to use orendorse

https://www.biosharing.orghttps://www.FAIRsharing.org

In preparation over the summer

• A web-based, curated and searchable portal that monitors the

development and evolution of standards, their use in

databases and the adoption of both in data policies,

to inform and educate

Data policies Databases

Content standards

Formats Terminologies Guidelines

Models/Formats = Conceptual

model, conceptual schema,

exchange formats

Terminologies = Controlled

vocabularies, taxonomies,

thesauri, ontologies etc.

Guidelines = Minimum information

reporting requirements, checklists

Mapping a complex and evolving landscape

Content standards

Data policies Databases

Formats Terminologies Guidelines

By grass-roots

groups, e.g.:By standard

organizations, e.g.:

Mapping a complex and evolving landscape

Formats Terminologies Guidelines

225

115

554

source sourcesource

MIAME

MIRIAMMIQAS

MIXMIGEN

ARRIVEMIAPE

MIASE

MIQE

MISFISHIE….

REMARK

CONSORT

SRAxml

SOFT FASTA

DICOM

MzMLSBRML

SEDML…

GELML

ISA

CML

MITAB

AAO

CHEBIOBI

PATO ENVO

MOD

BTO

IDO…

TEDDY

PRO

XAO

DO

VO

Content standards in numbers

Statistics as of May 2017

Understanding how standards are used

Guideline

Understanding how standards are used

Formats

Guideline

Understanding how standards are used

Formats

Guideline

Formats

Understanding how standards are used

Formats

Guideline

Formats

Terminology

Understanding how standards are used

My funder’s data policy recommends the use of established standards, but which are widely endorsed and applicable to my crop data?

Am I using the most up-to-date version of this terminology to annotate cell-based assays?

I understand this format has been deprecated; what has it been replaced by and is it appropriate for this data?

Which are the mature standardsand standards-compliant databasesthat we should recommend to our authors?

Helping users make the right decision

Training

Linking standards and databases to training material

Ready for use, implementation, or recommendation

In development

Status uncertain

Deprecated as subsumed or superseded

Manually curated and verified

by the community behind each

resource

Mapping a complex and evolving landscape – indicators of status

Collections group together one or

more types of resource by

domain, project or organization.

Recommendations are a core-set

of resources that are selected and

recommended by a funder or

journal data policy.

All collections and

recommendations created

between BioSharing and the

collection/recommendation

owner.

Grouping records

Data Policy

Visualizing relationships

“The interactive browser will allow us to discover which databases and editorial standards are not currently included in our author guidelines, enabling us to regularly monitor and refine our policies as appropriate, in support of our mission to help our authors enhance the reproducibility of their work.” – Holly, F1000

Collections group together one or

more types of resource by

domain, project or organization.

Recommendations are a core-set

of resources that are selected and

recommended by a funder or

journal data policy.

All collections and

recommendations created

between BioSharing and the

collection/recommendation

owner.

Grouping records

Standard developing groups:Journal, publishers:

Cross-links, data exchange:

Societies and organisations: Institutional RDM services:

Projects, programmes:

Working with the community/adopters

Acknowledgements

Thank-you!Any questions?

Biosharing-contact-us@lists.sf.net

Acknowledgements