How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev...

27
How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT ViBRANT Tools for DNA taxonomists, 11 June 2013, Brussels

Transcript of How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev...

Page 1: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

How to publish genomic Data papers based on BOL data - Biodiversity

Data Journal Lyubomir Penev

Bulgarian Academy of Sciences & Pensoft Publishers

ViBRANT

ViBRANT Tools for DNA taxonomists, 11 June 2013, Brussels

Page 2: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

Life cycle of data associated with biodiversity manuscripts

BIODIVERSITYMANUSCRIPT

Occurrence data Genomic dada

Image galleries

Morphometric data

Environmental data

Phylogenetic data

Any other data

XML MARK UP

Structured text (data!)

ARTICLESOccurr-

ence dataTaxon namesTaxon treatments

Plazi

BHL

Wiki COL

Biblio-graphies

Page 3: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

The problem ?

Page 4: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

Primary data Drawings: slavenapeneva.com

Page 5: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

Primary data

Publishing and sharing of primary data

RE-USEof

CONTENT

Page 6: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

Key features Collaborative article authoring Online peer-review and editing Community peer review; options

for “open” and “public” review Standard-compliant (DwC, NLM

DTD) Biological Codes compliant

article templates No lower/upper limit of

manuscript size Semantically enhanced “articles

of the future” Integrated with GBIF, EOL,

Dryad Scratchpads, etc.

ALL DATA MATTERS!

Page 7: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

Multiple Data Publishing Model of BDJ

1. Supplementary data files downloadable from the journals’ website

2. Data deposited at specialized data repositories (Dryad, Pangaea)

3. Data published through data repositories but indexed and collated with other data (GenBank, GBIF IPT)

4. Data published in the form of marked-up and machine-readable text.

5. Extended use of multimedia and semantic enhancements

Page 8: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

Types of genomic data publishing

Data papers describing genome datasets Descriptions of “dark taxa”More??

Page 9: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

For genomic datalet’s build up on:

GBIF-Pensoft workflow for publishing

occurrence data:data papers

Page 10: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
Page 11: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

Metadata based on the Ecological Metadata

Language (EML)

Page 12: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

Data publishing through Data Papers

Page 13: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
Page 14: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
Page 15: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
Page 16: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

We need to:

Identify the specific metadata descriptors used for genomic data

and integrate these into the data paper concept

Page 17: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

Then we need to:

Map the existing metadata descriptors for

genomic data and automatically generate data paper manuscripts

from them

Page 18: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

The testbed

ViBRANT special issue:

DNA barcoding: a practical tool for fundamental and applied

biodiversity researchEdited b: Zoltan Nagy, Kurt Jordaens, Marc de

Meyer, Thierry Backeljau

Page 19: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
Page 20: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
Page 21: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

Rod Page’s ‘Dark Taxa’:

R. Page, iPhylo blogspot, 12 April 2011

Page 22: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

Cumulative Records in GenBankGrowth of COI BARCODE vs. Cyt B, all taxa

Courtesy: David Schindel

Page 23: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

Bishop Museum – 19 June 2008

Barcode Sequence

Voucher Specimen

Species Name

Specimen Metadata

Literature(link to content

or citation)

BARCODE Records

Indices - Catalogue of Life - GBIF/ECAT

Nomenclators - Zoo Record - IPNI - NameBank

Publication links - New species

GeoreferenceHabitat

Character setsImages

BehaviorOther genes

Trace files

Other Databases

PhylogeneticPop’n Genetics

Ecological

Primers

Databases - Provisional sp.

Page 24: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

Description of “dark” taxa PWT – COLLABORATIVE

ARTICLE AUTHORING TOOLDark taxon sequenced

BDJ – PEER-REVIEW

Automated submission to Pensoft Writing Tool

MANUSCRIPT PUBLISHED

Metadata: voucher specimen,

images, locality, etc.

MANUSCRIPT FINALISATION &SUBMISSION

Automated update of bibliographic metadata, taxon name, Zoobank record, etc.

Page 25: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

Diagnostic characters of Study

Specimens:• Traits, Sequences

Taxonomic Units:

• Clusters, OTUs

Formal Names(sometimes, with varying certainty)

Reverse Taxonomy

Diagnostic characters

In Reference Databases:• GenBank,

MorphBank

Nomenclatural Precedence

Courtesy: David Schindel

Page 26: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.

We have

Some tools and workflows for that…

Page 27: How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.