Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates,...

27
Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative http://dx.doi.org/10.6084/m9.figshare.722897

Transcript of Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates,...

Page 1: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Biodiversity Informatics at the Natural History Museum

Ed BakerTerrestrial Invertebrates, Department of Life Sciences& NHM Informatics Initiative

http://dx.doi.org/10.6084/m9.figshare.722897

Page 2: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Science as a Slow Cooker• Only the surface visible

• Lid kept on for extended periods of time

• Uses cheap cuts of raggy meat

• Ingredient lose their nutritional value

• Children at risk due to high temperatures

http://ispiders.blogspot.co.uk/2011/11/realtime-web.html

Page 3: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

We like data• 70 million+ specimens collected over 400 years

• 350,000+ books

• ??? Unpublished datasets in archive, notebooks, computers

• ??? In the minds of staff

Page 4: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

How do we provide access?• Digitisation of specimens and associated data

• Scanning and transcribing books, journals, archives

• Providing tools for managing the data life cycle

• Changing the way we publish: data publication

Page 5: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Flowing Data

Publication

Collection Curation Use

Page 6: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Flowing Data

Collection Curation

Somebody retires Somebody dies Project is cancelled

Sits in desk drawer or on a hard drive until….

Page 7: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Flowing Data

Collection Curation Use

Data Publication

Re-use

Publication

Re-use Re-use Re-use

Page 8: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Flowing Data: from collection to reuse

Collection Curation Use

Data Publication

Re-use

Publication

Re-use Re-use Re-use

Page 9: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Collection

Citizen Science

Automated identification and monitoring

Traditional taxonomic sources

Page 10: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Flowing Data: from collection to reuse

Curation Use

Data Publication

Re-use

Publication

Re-use Re-use Re-use

Page 11: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Curation

Websites for communities to publish and curate:• Taxonomy / nomenclature• Bibliographies• Specimen information• Character matricies

Page 12: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Flowing Data: from collection to reuse

Use

Data Publication

Re-use

Publication

Re-use Re-use Re-use

Page 13: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Use: Oboe

Page 14: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Use: Oboe

Page 15: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Flowing Data: from collection to reuse

Data Publication

Re-use

Publication

Re-use Re-use Re-use

Page 16: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Publication (Data)

• Datasets

• Single species descriptions

• Checklists

• Software

Page 17: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Flowing Data: from collection to reuse

Re-use

Publication

Re-use Re-use Re-use

Page 18: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Publication (Research)

• Traditional research

• Systematic zoology

• Phylogeny

• Biogeography

Page 19: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Flowing Data: from collection to reuse

Re-use Re-use Re-use Re-use

Page 20: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

The Problem of Scale

Data is being generated by tens of thousands of researchers, in thousands of institutions

• Hard to find what you need

• Hard to know if what you need actually exists

• Impossible to go through researcher by researcher

Page 21: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

NHM Data Portal

• Aggregator for NHM science data

• Visualisation tools for datasets

• Allows export of NHM data for re-use

Page 22: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

The Informatics Landscape

>18K specimen records(local small scale coverage)

>276M specimen records(worldwide coverage)

Page 23: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

The Informatics Landscape

A webpage for every species

Aggregate specimen and observation data globally

Page 24: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Wikimedian in Residence

• Make NHM content available under open licenses for use on Wikimedia projects (and elsewhere)

• Reach of Wikipedia: BBC, Encyclopedia of Life

• Wikisource: Transcription and translation crowd-sourcing

Page 25: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

Flowing Data: from collection to reuse

?

Page 26: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .

"Everybody makes mistakes. And if you don't expose your raw data, nobody will find your

mistakes." Jean-Claude Bradley

http://bit.ly/146ugIv

Page 27: Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative .