FAIRness through a novel combination of Web technologies

33
RDA FAIR data management November 14-15, 2016 FAIRness through a novel combination of Web technologies Dr. Erik Schultes LCDS

Transcript of FAIRness through a novel combination of Web technologies

Page 1: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

FAIRness through a novel combination of Web technologies

Dr. Erik SchultesLCDS

Page 2: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

Findable: F1. (meta)data are assigned a globally unique and persistent identifier;

F2. data are described with rich metadata;

F3. metadata clearly and explicitly include the identifier of the data it describes;

F4. (meta)data are registered or indexed in a searchable resource;

Accessible: A1. (meta)data are retrievable by their identifier using a standardized communications protocol;

A1.1 the protocol is open, free, and universally implementable;

A1.2. the protocol allows for an authentication and authorization procedure, where necessary;

A2. metadata are accessible, even when the data are no longer available;

Interoperable: I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation.

I2. (meta)data use vocabularies that follow FAIR principles;

I3. (meta)data include qualified references to other (meta)data;

Reusable: R1. meta(data) are richly described with a plurality of accurate and relevant attributes;

R1.1. (meta)data are released with a clear and accessible data usage license;

R1.2. (meta)data are associated with detailed provenance;

R1.3. (meta)data meet domain-relevant community standards;

What is FAIR ?

Page 3: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

FAIR is not a standard FAIR is not equal to RDF FAIR is not for humans only FAIR is not equal to ‘Semantic Web’ FAIR is not equal to ‘Open’

What FAIR is not.

Page 4: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

Page 5: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

Page 6: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

Page 7: FAIRness through a novel combination of Web technologies

Data Owners

Data Consumers

3rd Party Application & Services

Page 8: FAIRness through a novel combination of Web technologies

Data Owners

Data Consumers

3rd Party Application & Services

Interoperable Data

Layer

Page 9: FAIRness through a novel combination of Web technologies

FAIR Data Resourcesdatasets

ontologiescontrolled vocabularies

taxonomies…

FAIR Data applicationsFAIR Data Point

FAIRifierData FAIRport

FAIR Data search engine…

FAIR Data ProtocolFAIR Profiles

FAIR accessorAPI spec

PIDsLDP…

Any task:email, www, phoneSMTP, HTTP, RTP

TCP, UDP

Any medium:

Internet Protocol (IP)

ethernet, PPPCMSA, async, sonetcopper, glass, radio

FAIR Data inspired by Internet

“Hourglass Architecture”

Page 10: FAIRness through a novel combination of Web technologies

FAIR Data Resourcesdatasets

ontologiescontrolled vocabularies

taxonomies…

FAIR Data applicationsFAIR Data Point

FAIRifierData FAIRport

FAIR Data search engine…

FAIR Data ProtocolFAIR Profiles

FAIR accessorAPI spec

PIDsLDP…

Any application: make data actionable

Any data: large volumes

heterogeneouscomplex

FAIR Principlesuses existing technology& standards

FAIR Data inspired by Internet

Page 11: FAIRness through a novel combination of Web technologies

FAIR Data Resourcesdatasets

ontologiescontrolled vocabularies

taxonomies…

FAIR Data applicationsFAIR Data Point

FAIRifierData FAIRport

FAIR Data search engine…

FAIR Data ProtocolFAIR Profiles

FAIR accessorAPI spec

PIDsLDP…

FAIR Data inspired by Internet

Page 12: FAIRness through a novel combination of Web technologies

FAIR Data Resourcesdatasets

ontologiescontrolled vocabularies

taxonomies…

FAIR Data applicationsFAIR Data Point

FAIRifierData FAIRport

FAIR Data search engine…

FAIR Data ProtocolFAIR Profiles

FAIR accessorAPI spec

PIDsLDP…

• Easy part: combines existing methods & technology

• Hard Part: ontology engineering & data modeling

• Uptake: primarily social challenge (top down / bottom up)

• Internet of Data

• Open Science

Page 13: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

FAIR data point specificationRelease Beta 1.0 https://docs.google.com/document/d/1WZE9oTUn_aPSRZhOur1GjWZ5vYxLsKJQqzmpWMxNpcs/edit

metadata-controller http://dev-vm.fair-dtls.surf-hosted.nl:8082/fdp/swagger-ui.html

FAIR data point API

Page 14: FAIRness through a novel combination of Web technologies

FAIR Data Point

@prefix dcat: <http://www.w3.org/ns/dcat#> . @prefix dct: <http://purl.org/dc/terms/> . @prefix lang: <http://id.loc.gov/vocabulary/iso639-1/> . @prefix fdpo: <http://www.dtls.nl/ontologies/fdpDummy/> . @prefix ldp: <http://www.w3.org/ns/ldp#> . @prefix xsd: <http://www.w3.org/2001/XMLSchema#> . @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> . @prefix fdp: <http://localhost/fdp/> .

<> dct:title "RD connect FAIR data point(beta)" ; dct:language lang:en; ldp:contains fdp:textmining ; dct:identifier "rdc-fdp" ; rdfs:label "RD connect FAIR data point(beta)" ; dct:description "This FDP provides metadata on textmining data sets." ; dct:title "DTL FAIR data point(beta)"; dct:license <http://rdflicense.appspot.com/rdflicense/cc-by-nc-nd3.0>; dct:hasVersion "0.1"; dct:publisher <http://orcid.org/0000-0002-1215-167X>; fdpo:contact <http://orcid.org/0000-0002-1215-167X>; fdpo:APIVersion "0.1 Beta" .

Page 15: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

FAIR Data PointCatalog 1

API

DATASET 1

DIST 1

Catalog 2

DATASET 2 DATASET 3

DIST 2 DIST 3 DIST 4 DIST 5

Page 16: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

FDPtitle language identifier label description license version publisher contant API version

Cataloglabel title identifier language version

Datasettitle label identifier language version description keywords landingpage publisher theme creator

Distributiontitle label identifier version license access url mediatype

FAIR Data Point Metadata

Page 17: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

Page 18: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

inhibitors p65

Page 19: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

FAIR Data Query Siri?

Yes, master!

Please find me all known low molecular weight inhibitors of the Human p65 Protein. Please separate the list based on those that were found in curated databases, and those that were found in self-deposited data archives. Also, keep track of the license and citation information for each one. If you find data that is relevant, but not public, please provide me with the contact information for the person I need to request the data from.

Mark D. Wilkinson CBGP-UPM/INIA, Madrid

Page 20: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

FAIRfor

data

Page 21: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

PRIMED

FAIR

for alg

orithms

for data

Page 22: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

FAIR Data Training

Track 1: Awareness building - Problems & Responsibilities - What is data stewardship ? - What is FAIR ?

Track 2: Skill building - BYOD for data - FAIR Hackathons for applications - Data stewardship planing hackathons

http://www.dtls.nl/fair-data/fair-data-training/

Page 23: FAIRness through a novel combination of Web technologies

3

PRESENTERS

https://www.youtube.com/watch?v=sSQPY5Mc5Rs

Page 24: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

The NAR online Molecular Biology Database Collection has been updated with the addition of 88 new resources and removal of 23 obsolete websites, which brought the current listing to 1685 databases.

BYOD

Page 25: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

Five FAIR Use Cases

Andrew Stubbs

David van Enckevort

Jildau Bouwman

Arie Baak

born FAIR

https://www.youtube.com/watch?v=hioPJzTsdtg

FAIR Hackathon

becoming FAIR

not yet FAIR

Derk Arts

FAIR

ness

Page 26: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

Bedankt

Page 27: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

1. FAIR Principles address barriers to data reuse.

2. FAIR Principles stand on the shoulders of giants. FAIR technology is largely combinations of existing technology.

3. FAIR Enough! There are easy ways to make any dataset ‘more FAIR’.

4. In FAIR, no one is a follower, everyone is a collaborator.

Page 28: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

Page 29: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

Page 30: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

Page 31: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

Page 32: FAIRness through a novel combination of Web technologies

https://vimeo.com/138977162 http://www.dtls.nl/events/personal-health-train-workshop/

Personal Health Train

Page 33: FAIRness through a novel combination of Web technologies

RDA FAIR data managementNovember 14-15, 2016

kinds of data- files (excel)- databases- document base (database of files)- repositories- streaming data