Linking chemistry: wider lessons for how we publish research

48
Linking chemistry: wider lessons for how we publish research

Transcript of Linking chemistry: wider lessons for how we publish research

Page 1: Linking chemistry: wider lessons for how we publish research

Linking chemistry: wider lessons for how we publish research

Page 2: Linking chemistry: wider lessons for how we publish research

Anomalocarisfloreslivroselua.files.wordpress.com

Page 3: Linking chemistry: wider lessons for how we publish research
Page 4: Linking chemistry: wider lessons for how we publish research

What have we gained?

Very interesting Very clever

We’re all innovating into niches, growing extra legs, fins, ears...

news.sky.com

Page 5: Linking chemistry: wider lessons for how we publish research

Who has this been for?

US!

All good publicity, shows we’ll all good at our jobs, pats on the back all round

Have any of these widgets solved any more research problems? Saved time? Money?

And how much are they used? Really?

But helps become flexible to try out new things, test models, approaches, technologies

Page 6: Linking chemistry: wider lessons for how we publish research

OK, what do we think? 2007 version

Publishing methods are changing Reading methods are changing We know what our readers want We know how the researchers work

How many did we get right?

Page 7: Linking chemistry: wider lessons for how we publish research

RSC Prospect – semantic publishing

What were we trying to improve? Discoverability Use Understanding Linking

Steering how the chemical sciences develop on the web...

Page 8: Linking chemistry: wider lessons for how we publish research
Page 9: Linking chemistry: wider lessons for how we publish research
Page 10: Linking chemistry: wider lessons for how we publish research

What do we learn with Prospect?

This is probably the way to go How do we cover all subjects? Scale-up in manual QA Scale-up during huge growth and scope

How to use all that real chemistry data Pump prime to change what

we ask from authors Is our vision the day-glo article?

“Free headache for every user”

blogs.guardian.co.uk

Page 11: Linking chemistry: wider lessons for how we publish research

Do we know what’s going on?

Page 12: Linking chemistry: wider lessons for how we publish research

Vision …

http://www.optimumengineering.com.au/cyclone_silos/cyclone_silo_4.jpg

Page 13: Linking chemistry: wider lessons for how we publish research

Market Research

Page 14: Linking chemistry: wider lessons for how we publish research

Students

“Starting point for finding information is Wikipedia” “I’d like to use images from articles” “Colleagues tell me about papers so I don’t really

need to go looking for them”

Not very sophisticated outlook: have access to expensive services and content but don’t use.

Page 15: Linking chemistry: wider lessons for how we publish research

Academic Faculty

“Too busy to spend much time online” “My students tell me about everything that I need

to know” “I use email alerts to find content”

Page 16: Linking chemistry: wider lessons for how we publish research

Industrial Researchers

“Need to find information any way we can” “Don’t have access to expensive systems”

Most ingenious and sophisticated users.

Page 17: Linking chemistry: wider lessons for how we publish research

Librarians

“Don’t change any URLs” “Users are confused when they don’t have access

to all the content” “When things go wrong, we have to sort it out” “I need my customers to know that the library has

paid for this content”

Page 18: Linking chemistry: wider lessons for how we publish research

Key Findings

Page 19: Linking chemistry: wider lessons for how we publish research

Key Findings

Page 20: Linking chemistry: wider lessons for how we publish research

So what have we built?

Simple interface to fulfill the main use – come in, find a paper

RSC Publishing platform

All our journal, books, databases XML, in Marklogic. Normalised. A single query interface.

Page 21: Linking chemistry: wider lessons for how we publish research

The article of the future

Page 22: Linking chemistry: wider lessons for how we publish research

PDFs

Page 23: Linking chemistry: wider lessons for how we publish research
Page 24: Linking chemistry: wider lessons for how we publish research

Dumbing down can be good

Lowest common denominator PDF Word & ChemDraw JPG

Enables participation, exchange & comparison familiarity for the user

Page 25: Linking chemistry: wider lessons for how we publish research

Authors?

number

tech ability/interest of authors & readers

Page 26: Linking chemistry: wider lessons for how we publish research

Rzepa,H.

Page 27: Linking chemistry: wider lessons for how we publish research
Page 28: Linking chemistry: wider lessons for how we publish research

• Phil Bourne

• Lynn Fink

Source code and binary:http://research.microsoft.com/ontology/

Relationships: Ontology browser

Relationships: Ontology browser

Intent: Term recognition & disambiguation based on OBO or OWL formats

Intent: Term recognition & disambiguation based on OBO or OWL formats

John WilbanksServices: Ontology download web service

Ontology Add-in for Word 2007

Page 29: Linking chemistry: wider lessons for how we publish research

<?xml version="1.0" ?><cml version="3" convention="org-synth-report" xmlns="http://www.xml-cml.org/schema"> <molecule id="m1"> <atomArray> <atom id="a1" elementType="C" x2="-2.9149999618530273" y2="0.7699999809265137" /> <atom id="a2" elementType="C" x2="-1.5813208400249916" y2="1.5399999809265137" /> <atom id="a3" elementType="O" x2="-0.24764171819695613" y2="0.7699999809265134" /> <atom id="a4" elementType="O" x2="-1.5813208400249912" y2="3.0799999809265137" /> <atom id="a5" elementType="H" x2="-4.248679083681063" y2="1.5399999809265137" /> <atom id="a6" elementType="H" x2="-2.914999961853028" y2="-0.7700000190734864" /> <atom id="a7" elementType="H" x2="-4.248679083681063" y2="-1.907348645691087E-8" /> <atom id="a8" elementType="H" x2="1.0860374036310796" y2="1.5399999809265132" /> </atomArray> <bondArray> <bond atomRefs2="a1 a2" order="1" /> <bond atomRefs2="a2 a3" order="1" /> <bond atomRefs2="a2 a4" order="2" /> <bond atomRefs2="a1 a5" order="1" /> <bond atomRefs2="a1 a6" order="1" /> <bond atomRefs2="a1 a7" order="1" /> <bond atomRefs2="a3 a8" order="1" /> </bondArray> </molecule></cml>

<?xml version="1.0" ?><cml version="3" convention="org-synth-report" xmlns="http://www.xml-cml.org/schema"> <molecule id="m1"> <atomArray> <atom id="a1" elementType="C" x2="-2.9149999618530273" y2="0.7699999809265137" /> <atom id="a2" elementType="C" x2="-1.5813208400249916" y2="1.5399999809265137" /> <atom id="a3" elementType="O" x2="-0.24764171819695613" y2="0.7699999809265134" /> <atom id="a4" elementType="O" x2="-1.5813208400249912" y2="3.0799999809265137" /> <atom id="a5" elementType="H" x2="-4.248679083681063" y2="1.5399999809265137" /> <atom id="a6" elementType="H" x2="-2.914999961853028" y2="-0.7700000190734864" /> <atom id="a7" elementType="H" x2="-4.248679083681063" y2="-1.907348645691087E-8" /> <atom id="a8" elementType="H" x2="1.0860374036310796" y2="1.5399999809265132" /> </atomArray> <bondArray> <bond atomRefs2="a1 a2" order="1" /> <bond atomRefs2="a2 a3" order="1" /> <bond atomRefs2="a2 a4" order="2" /> <bond atomRefs2="a1 a5" order="1" /> <bond atomRefs2="a1 a6" order="1" /> <bond atomRefs2="a1 a7" order="1" /> <bond atomRefs2="a3 a8" order="1" /> </bondArray> </molecule></cml>

Relationships: Navigate and link referenced chemistry

Relationships: Navigate and link referenced chemistry

Available soon:http://research.microsoft.com/chem4word/

Data: Semantics stored in Chemistry Markup Language

Data: Semantics stored in Chemistry Markup Language

Intent: Recognizes chemical dictionary and ontology terms

Intent: Recognizes chemical dictionary and ontology terms

Author and edit 1D and 2D chemistry. Author and edit 1D and 2D chemistry.

Intelligence: Verifies validity of authored chemistry

Intelligence: Verifies validity of authored chemistry

Authoring: Chem4Word – Chemistry Drawing in Word

Page 30: Linking chemistry: wider lessons for how we publish research

We forget our place

We assume our readers spend their lives in our platforms

www.visitingdc.com

Page 31: Linking chemistry: wider lessons for how we publish research

media.obsessable.com

As few interfaces as possible

What do humans want?

Page 32: Linking chemistry: wider lessons for how we publish research

What do computers want?

Web services

flickr.com/photos/microcosmos

Page 33: Linking chemistry: wider lessons for how we publish research

What is ChemSpider?

+ +

Free to use

Page 34: Linking chemistry: wider lessons for how we publish research

Quick history Launched 2007 with a vision Bedroom project for 2 years Acquired by RSC in 2009

Since acquisition Infrastructure and development resource Community and user support added Integration with RSC Publishing resources Engagement with RSC’s community of influence

Page 35: Linking chemistry: wider lessons for how we publish research

ChemSpider as aggregator Public and non-public compound sets Publications Domain resources – DailyMed, Wikipedia Personal data sets SureChem patents Multimedia additions, blogs, Open Notebook Sci

Data normalised, errors rejected

Page 36: Linking chemistry: wider lessons for how we publish research

ChemSpider as chemical search

Single search across 400 data sources Text, structure and substructure search tools

Simple and very advanced Validated names for external search expansion

Google Scholar, PubMed, RSC, Google patents Links to original sources Mobile version

Page 37: Linking chemistry: wider lessons for how we publish research

ChemSpider as wikiAdds quality and quantity

Users can comment, others action Registered users can curate

names, add links, spectra, data and multimedia resources

Depositors can load data sets, curated on load

Curators and Master Curators approve additions and decide what is the correct structure

Page 38: Linking chemistry: wider lessons for how we publish research

ChemSpider as resource Download and reuse structures Download and reuse search structure set 2D and 3D images Embed and link - never draw a structure again Predicted properties

Physical, Environmental, Biological Spectra, crystal structures ChemSpider SyntheticPages Spectral game

Page 39: Linking chemistry: wider lessons for how we publish research

ChemSpider as compound hub Extensive web services

Utility conversion services MassSpec services Name + structure lookups Prediction services

Validated name:structure pairs provides public disambiguation service

InChI Resolver – links InChIKeys to compounds: MYPYJXKWCTUITO-LYRMYLQWSA-N

Page 40: Linking chemistry: wider lessons for how we publish research

For the RSC (more selfishly) Critical mass for a chemistry portal

Support for semantic developments Proven user need to search for compounds Shop window for our other publications

Ability to test other business models (some of which are a little scary)

Engagement with new publishing methods

Page 41: Linking chemistry: wider lessons for how we publish research

ChemSpider as chemistry portal

Page 42: Linking chemistry: wider lessons for how we publish research

What will the RSC be doing?

Facilitating scientific communication

Publish articles – with the semantics

But what else can be part of this?

Partial publication syntheses, blogs, unpublished data

How does all this sit alongside traditional articles?

Page 43: Linking chemistry: wider lessons for how we publish research

Micropublication?

Page 44: Linking chemistry: wider lessons for how we publish research

Nanopublication?

Slide from Jan Velterop

Page 45: Linking chemistry: wider lessons for how we publish research

Exploring standards and extraction of semantic assertions

Pistoia Alliance“An initiative to provide an open foundation of data

standards, ontologies and web-services to streamline the Pharmaceutical Drug Discovery workflow”

Semantic Enrichment of the Scientific Literature (SESL) Oct09-Oct10

Pistoia Alliance-funded EBI Elsevier, NPG, OUP, RSC

Page 46: Linking chemistry: wider lessons for how we publish research
Page 47: Linking chemistry: wider lessons for how we publish research

Do we know what the answer is?

Probably not, but RSC Publishing & ChemSpider allows us to cover pretty much everything. Serve existing behaviours Allow experimentation

Science – keying into the readers as well as the researchers is more important than the technology.

Page 48: Linking chemistry: wider lessons for how we publish research