BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017
-
Upload
susanna-assunta-sansone -
Category
Data & Analytics
-
view
120 -
download
2
Transcript of BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017
![Page 1: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/1.jpg)
Susanna-Assunta Sansone,Associate Director, Oxford e-Research Centre,
University of Oxford, UKdx.doi.org/10.6084/m9.figshare.4055496.v1
@biosharing
bioCADDIE – DATS and CDEs Workshop, Bethesda, 8 May 2017
![Page 2: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/2.jpg)
Formats Terminologies Guidelines
CommonData
Elements
Types of content standards
Content standards: descriptors essential for interpretation, verification, reproducibility, sharing etc. of datasets
![Page 3: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/3.jpg)
Minimum information reporting requirements, checklists
o Report the same core, essential information
o e.g. MIAME guidelines
Controlled vocabularies, taxonomies, thesauri, ontologies etc.
o Unambiguous identification and definition of concepts
o e.g. Gene Ontology
Conceptual model, schema, exchange formats etc
o Define the structure and interrelation of information, and the transmission format
o e.g. FASTA Formats Terminologies Guidelines
Types of content standards
CommonData
Elements
![Page 4: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/4.jpg)
de jure de factograss-roots
groupsstandard
organizations
Nanotechnology Working Group
Formats Terminologies Guidelines
Community-driven efforts, just few examples
![Page 5: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/5.jpg)
Formats Terminologies Guidelines
224
115
500+
source sourcesource
MIAMEMIRIAM
MIQASMIXMIGEN
ARRIVEMIAPE
MIASE
MIQE
MISFISHIE….
REMARK
CONSORT
SRAxml
SOFT FASTADICOM
MzMLSBRML
SEDML…
GELML
ISA
CML
MITAB
AAOCHEBIOBI
PATO ENVOMOD
BTOIDO…
TEDDY
PROXAO
DO
VO
Content standards in numbers
![Page 6: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/6.jpg)
![Page 7: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/7.jpg)
![Page 8: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/8.jpg)
Aweb-based,curatedandsearchableportalthat monitorsthedevelopment and
evolution ofstandards,theiruse indatabases andtheadoptionofbothindata
policies,toinform andeducate theusercommunity
![Page 9: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/9.jpg)
Data policies by funders, journals and other organizations
Content standards
Formats Terminologies Guidelines
Map this complex and evolving landscape
Databases
![Page 10: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/10.jpg)
Data policies by funders, journals and other organizations
Databases
Content standards
Formats Terminologies Guidelines
Using indicators to describe ‘status’
Readyforuse,implementation,orrecommendation
Indevelopment
Statusuncertain
Deprecatedassubsumedorsuperseded
Allrecordsaremanuallycurated
in-houseandverifiedbythe
communitybehindeachresource
![Page 11: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/11.jpg)
![Page 12: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/12.jpg)
![Page 13: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/13.jpg)
Understanding how standards are used
![Page 14: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/14.jpg)
Understanding how standards are used
Guideline
![Page 15: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/15.jpg)
Understanding how standards are used
Formats
Guideline
![Page 16: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/16.jpg)
Understanding how standards are used
Formats
Guideline
Formats
![Page 17: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/17.jpg)
Understanding how standards are used
Formats
Guideline
Formats
Terminology
![Page 18: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/18.jpg)
Technologically-delineated views of the world
Biologically-delineated views of the world
Generic features (‘common core’)- description of source biomaterial- experimental design components
Arrays
Scanning Arrays &Scanning
Columns
GelsMS MS
FTIR
NMR
Columns
transcriptomics proteomics metabolomics
plant biologyepidemiology microbiology
Duplications & lack of interoperability among standards
![Page 19: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/19.jpg)
Arrays
Scanning Arrays &Scanning
Columns
GelsMS MS
FTIR
NMR
Columns
transcriptomics proteomics metabolomics
plant biologyepidemiology microbiology
Hard to use them in combinations, e.g. to represent:
Proteomics-based gut microbiota profiling
Proteomics and metabolomics based gut microbiota profiling
![Page 20: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/20.jpg)
Arrays
Scanning Arrays &Scanning
Columns
GelsMS MS
FTIR
NMR
Columns
transcriptomics proteomics metabolomics
plant biologyepidemiology microbiology
Enhancing modularization
Proteomics-based gut microbiota profiling
Proteomics and metabolomics based gut microbiota profiling
![Page 21: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/21.jpg)
Arrays
Scanning Arrays &Scanning
Columns
GelsMS MS
FTIR
NMR
Columns
transcriptomics proteomics metabolomics
plant biologyepidemiology microbiology
Proteomics-based gut microbiota profiling
Proteomics and metabolomics based gut microbiota profiling
Enhancing modularization
![Page 22: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/22.jpg)
bsg-000174
biosharing:ReportingGuideline
bsg-000161
MINSEQE
MIMARKS
sample information
sample identifier
taxonomyidentifier
sequence read
geo location
High-level information about the metadata standards
Representations of the standards elements
Template elementsfor
el-000001
el-000002
el-000003
provenance: MINSEQE
provenance: MINSEQE
and MIMARKS
provenance:MIMARKS
• Serve machine-readable content metadata standards, providing provenance for their elements• Inform the creation of metadata templates, rendering standards invisible to the researchers
Modularize and combine
![Page 23: BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May 2017](https://reader034.fdocuments.net/reader034/viewer/2022051521/5a64c37f7f8b9a6d5d8b48af/html5/thumbnails/23.jpg)
Standard developing groups:Journal, publishers:
Cross-links, data exchange:
Societies and organisations: Institutional RDM services:
Projects, programmes: