Sharing, linking and publishing biodiversity data the ViBRANT way

18
Scratchpad virtual research environments: sharing, linking and publishing biodiversity data the ViBRANT way Vince Smith 1 , Dave Roberts 1 & Lyubomir Penev 2 1. Natural History Museum, London 2. Pensoft Publishers, Sofia, Bulgaria

description

Presentation given by Dave Roberts at the pro-ibiosphere conference in Leiden on February 11, 2013

Transcript of Sharing, linking and publishing biodiversity data the ViBRANT way

Page 1: Sharing, linking and publishing biodiversity data the ViBRANT way

Scratchpad virtualresearch environments:

sharing, linking and publishingbiodiversity data the ViBRANT way

Vince Smith1, Dave Roberts1 & Lyubomir Penev2

1. Natural History Museum, London2. Pensoft Publishers, Sofia, Bulgaria

Page 2: Sharing, linking and publishing biodiversity data the ViBRANT way

Our informatics grand challenge…

“Link together evolutionary data… by developing analytical tools and proper documentation and then use this framework to conduct comparative analyses, studies of evolutionary process and biodiversity analyses”

Cyndy Parr, Rob Guralnick, Nico Cellinese and Rod Page. TREE. doi:10.1016/j.tree.2011.11.001

Page 3: Sharing, linking and publishing biodiversity data the ViBRANT way

Our informatics grand challenge…

Cyndy Parr, Rob Guralnick, Nico Cellinese and Rod Page. TREE. doi:10.1016/j.tree.2011.11.001

This requires data, information & knowledge to be…

•  Digital Not printed paper

•  Openly accessible Not behind barriers

•  Linked-up Not in silos

“Link together evolutionary data… by developing analytical tools and proper documentation and then use this framework to conduct comparative analyses, studies of evolutionary process and biodiversity analyses”

Page 4: Sharing, linking and publishing biodiversity data the ViBRANT way

•  15-20k new spp. described annually (2M total)1

•  30k nomenclatural acts (12M total) 1 •  20k phylogenies (750k total)2

•  31k taxa sequenced (360k taxa total)3

•  800k BioMed papers (40M total pp. of taxonomy) 4 •  Countless specimens, images, maps, keys…

Most of our output is not digital, open or linked

Typically generated by small communities for “local” research projects

Figures from 1) Zhang, Zootaxa 2011 4, 1-4; 2) Web-of-Science; 3) Genbank and 4) PubMed.

Page 5: Sharing, linking and publishing biodiversity data the ViBRANT way

Virtual BiodiversityViBRANT

-infrastructureSEVENTH FRAMEWORK PROGRAMME

Magic

Your data Your web site

A website for you & your community

Page 6: Sharing, linking and publishing biodiversity data the ViBRANT way

Virtual BiodiversityViBRANT

-infrastructureSEVENTH FRAMEWORK PROGRAMME

• Hosted websites for biodiversity data

• Virtual research & publication platform

• Completely open access & open source

• Modular & flexible

What are Scratchpads?

Page 7: Sharing, linking and publishing biodiversity data the ViBRANT way

Virtual BiodiversityViBRANT

-infrastructureSEVENTH FRAMEWORK PROGRAMME

• A single biodiversity database

• Restricted thematically, geographically or taxonomically

• A tool just for taxonomists

• Owned or controlled by anyone other than the data creator

What Scratchpads are not!

Page 8: Sharing, linking and publishing biodiversity data the ViBRANT way

Virtual BiodiversityViBRANT

-infrastructureSEVENTH FRAMEWORK PROGRAMME

How are Scratchpads funded?

2007 2011 2014

Virtual BiodiversityViBRANT

&

Page 9: Sharing, linking and publishing biodiversity data the ViBRANT way

Virtual BiodiversityViBRANT

-infrastructureSEVENTH FRAMEWORK PROGRAMME

Taxonomy & LiteratureLice, mosquitos, freeloader flies, ...

(rapid upload and management of names, synonyms & bibliographic data)

Freeloader Flies, fungus gnats, ...(publication of Scratchpad data in the ZooKeys journal and export to Encyclopedia of Life)

Taxon descriptions & Publications

European Mosquito Bulletin, Phasmid Studies, ...(submission, review & dissemination of articles)

eJournals

Termites, bryozoa, ... (character matrices exporting to SDD and Nexus format, phylogenies, specimen records & maps)

Characters, Phylogeny & Specimens

Image GalleriesDragon trees, nanno fossils, cockroaches, fungi, polychaetes, ...

(rapid upload, annotation & display of images)

ICZN, GBIF, Sampled Red List Index for Plants, Global Plants Initiative ...(space for data collection, services, discussion & organisation)

Societies, Organisations & Projects

SitesUsers

2007 2008 2009 2010 2011 2012

Active Users

Site

s

Use

rs

ViBRANTScratchpads 2

500

1000

2000

3000

4000500060007000

20

50

100

200

300400

Scratchpadsbiodiversity online

Page 10: Sharing, linking and publishing biodiversity data the ViBRANT way

Virtual BiodiversityViBRANT

-infrastructureSEVENTH FRAMEWORK PROGRAMME

ViBRANT Goals

VisionConnecting the people, data & science of biodiversity

PositionOpen & sustainable development of a federated network of biodiversity informatics infrastructures

MissionFacilitate the mobalisation, sharing, reuse and publication of biodiversity data

http://vbrant.eu

ScratchpadsVirtual Research

Environment

Bioclimaticmodelling

Manuscript publishing

Sustainability

Data mining

Citizen science

Field recording

Sociology

Support services

Training& outreach

Data standards

Visualisation

Controlled vocabulary

Data aggregation

GBIF integration

Scratchpad hosting

Software inte-gration

Matrix data editor

Data publishing

Communal literature

Literature mark up

Phylogeny tools

Identification tools

NetworkingTraining

StandardsMobilisation

ServiceData

Publishing

ResearchArchitecture

Literature

Page 11: Sharing, linking and publishing biodiversity data the ViBRANT way

Virtual BiodiversityViBRANT

-infrastructureSEVENTH FRAMEWORK PROGRAMME

Nexus

DwCA

CSV/tab

Newick

EoL Transfer schema (SPM) XML

SDD, Lucid, Nexus

RDF

Taxonomic Concept Schema XML

Excel file

CSV, XLS, Microsoft Word .DOC, TXT

Page 12: Sharing, linking and publishing biodiversity data the ViBRANT way

Virtual BiodiversityViBRANT

-infrastructureSEVENTH FRAMEWORK PROGRAMME

What can Scratchpads do?• Taxon pages (generated from tagged content)• Distribution maps (from specimens and TDWG regional distributions - Brummitt, 2001)• Specimen records• Bibliography management• Images, video and sound (bulk import)• Excel spreadsheet import• Tabular data editing & Character matrixes• Custom content• User management• Custom webforms• Analytics• Darwin Core Archive export (links to eMonocot Portal and EOL)• EOL data import (taxonomy, species information)• GBIF Map integration

Page 13: Sharing, linking and publishing biodiversity data the ViBRANT way

Virtual BiodiversityViBRANT

-infrastructureSEVENTH FRAMEWORK PROGRAMME

http://www.comber.hcmr.gr

Page 14: Sharing, linking and publishing biodiversity data the ViBRANT way

Virtual BiodiversityViBRANT

-infrastructureSEVENTH FRAMEWORK PROGRAMME

Oxford Batch Operations Enginehttps://oboe.oerc.ox.ac.uk/

Page 15: Sharing, linking and publishing biodiversity data the ViBRANT way

Virtual BiodiversityViBRANT

-infrastructureSEVENTH FRAMEWORK PROGRAMME

BDJThe Biodiversity Data Journal

Making small data big!

Page 16: Sharing, linking and publishing biodiversity data the ViBRANT way

Virtual BiodiversityViBRANT

-infrastructureSEVENTH FRAMEWORK PROGRAMME

B iodiversity D ata Journal

1t2011

ISSN 1314-2828 (online) ISSN 1314-2836 (print)

Launched to accelerate biodiversity data journal

http://www.pensoft.net/biodiversitydata

A peer-reviewed open-access journal

Editor-in-Chief: VINCENT SMITH Natural History Museum, London, UK

Plazi

I . P . N . I

1. Define the publication

2. Enter metadata

3. Select taxa & content

4. Organise manuscript

5. Submit to journal

Articles

Bibliographies

Occurrence

Taxon treatments

Taxon names

Page 17: Sharing, linking and publishing biodiversity data the ViBRANT way

Virtual BiodiversityViBRANT

-infrastructureSEVENTH FRAMEWORK PROGRAMME

Acknowledgements• Scratchpad technical development - Simon Rycroft, Ben Scott, Ed Baker, Alice Heaton & Katherine Boulton

• Scratchpad outreach - Laurence Livermore & Dimitris Koureas

• E-Monocot - Paul Wilkin & the Kew team, Charles Godfray & the Oxford team

• ViBRANT - Vince Smith, Dave Roberts & Lucy Reeve

• Our 7,000+ users

Page 18: Sharing, linking and publishing biodiversity data the ViBRANT way

Virtual BiodiversityViBRANT

-infrastructureSEVENTH FRAMEWORK PROGRAMME

Thank you for yourattention.

Any questionse-mail: [email protected]

e-mail: [email protected]

http://vbrant.eu http://scratchpads.eu