The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

19
The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles School of Chemistry, University of Southampton, U.K. [email protected] This work is licensed under a Creative Commons Licence Attribution-ShareAlike 3.0 http://creativecommons.org/licenses/by-sa/3.0/

description

This work is licensed under a Creative Commons Licence Attribution-ShareAlike 3.0. http://creativecommons.org/licenses/by-sa/3.0/. The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles School of Chemistry, University of Southampton, U.K. [email protected]. - PowerPoint PPT Presentation

Transcript of The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

Page 1: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

The Central Role of Data

‘Capturing and Sharing Chemistry Research Data’

Simon Coles

School of Chemistry,

University of Southampton, U.K.

[email protected]

This work is licensed under a Creative Commons LicenceAttribution-ShareAlike 3.0

http://creativecommons.org/licenses/by-sa/3.0/

Page 2: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

Current Situation - Data Generation

Synthesis Characterisation

Page 3: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

Current Situation – Data Management

“Data from experiments conducted as recently as six months ago might be suddenly deemed important, but those researchers may never find those numbers – or if they did might not know what those numbers meant”

“Lost in some research assistant’s computer, the data are often irretrievable or an undecipherable string of digits”

“To vet experiments, correct errors, or find new breakthroughs, scientists desperately need better ways to store and retrieve research data”

“Data from Big Science is … easier to handle, understand and archive. Small Science is horribly heterogeneous and far more vast. In time Small Science will generate 2-3 times more data than Big Science.”

‘Lost in a Sea of Science Data’ S.Carlson, The Chronicle of Higher Education (23/06/2006)

Page 4: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

Current Situation – Data and Publishing

Page 5: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

Separating Data from Interpretations Underlying data

(Institutional data repository)

Intellect & Interpretation

(Journal article, report,

etc)

Page 6: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

Smart Labs

Page 7: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

Laboratory IRs and Information Management

Page 8: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

The R4L Repository

Deposit

Search / Browse

Create new compound Add experiment data and metadata

Page 9: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

Blogging Experiments

A repository can…

• Allow one to put, store and get digital objects

• Provide minimal search and browse functions

• NOT provide the presentation and discussion functions essential to a scientific study

• Social networking tools and approaches can provide a way…

Page 10: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

Facilitating Research

• Facilitates ‘geographically distributed collaborative research’

• Useful approach for sharing ‘failed’ experiments?

Page 11: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

Machines Blogging Experiments

• Automatic upload by scientific instrument

Page 12: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

Comments and Annotation

• A picture says a thousand words! • Chemists like to sketch!• Need for more advanced Blog tools / technology

Page 13: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

Current Situation - Data Deluge

Cl

Cl

Cl

Cl

Cl

Cl

ClCl Cl

Cl

Cl

ClCl

O

O

O

O

N

N

N

N

N+

O

O

O

N+

O

O

O

30,000,000

1.5,000,000

450,000

Page 14: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

Laboratory Data Management and Archive

Page 15: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

The eCrystals Public Data Archive

http://ecrystals.chem.soton.ac.uk

Page 16: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

NCS Data Publication Policy

• Joint publication: Timed release of data tied to conventional journal article

• Separate publication: Independent release of data so that it can be cited e.g. from a journal article, grant report, poster

• ‘Accidental’ or ‘undesired’ results: Immediate release after agreement with concerned parties

• Never to be formally published results: Automatic release after three years

• Embargo feature: default 3 years, but timescale can be defined by depositor

• Record can be made public at any time (following agreement from all concerned parties)

• Roles of all concerned parties defined (originator, etc)• Data citation, DOI, Rights

http://www.ncs.chem.soton.ac.uk/pub_pol.htm

Page 17: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

Linking and aggregating

• Link data and associated ‘publications’

• Dataset annotated with metadata

• Semantic publishing on WWW and in journals

http://www.rsc.org/Publishing/Journals/ProjectProspect/index.asp

http://www.ukoln.ac.uk/projects/ebank-uk/pilot/

Page 18: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

Aggregator services

Institutional data repositories

Deposit , Validation

Publication

ValidationData analysis

Search, harvest

Presentation services / portals

Data discovery, linking, citation

Laboratory repository

Deposit

eCrystals ‘Global Federation’ Model

Publishers: peer-review journals, conference proceedings, etc

Curation

Preservation

Subject Repository

Institution Library & Information Services

Data creation & capture in “Smart lab”

Data discovery, linking, citation

Search, harvest

Search, harvest

Deposit

Deposit

Deposit

Page 19: The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles

                                                             

Changing Times!

Information Providers

Information Consumers

All I am saying is that now is the time to develop the technology to deflect an asteroid