Building an integrated system for chemistry markup and online publishing integrated to online...
-
Upload
orcid-0000-0002-2668-4821 -
Category
Technology
-
view
1.676 -
download
0
description
Transcript of Building an integrated system for chemistry markup and online publishing integrated to online...
![Page 1: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/1.jpg)
Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources
![Page 2: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/2.jpg)
Electronic Publishing
Publishers know that embracing electronic publication is a must.
“Cell Press and Elsevier have launched a project called Article of the Future … to redefine how the scientific article is presented online. ” Prototype: http://beta.cell.com/
![Page 3: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/3.jpg)
Publishers are experimenting
…invited researchers to prototype tools dealing with the ever-increasing amount of online life sciences information
The winners built: Reflect: Automated Annotation of Scientific
Terms : http://reflect.ws
![Page 4: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/4.jpg)
Reflect
![Page 5: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/5.jpg)
Entity-Extraction, Mark-up, Annotate
![Page 6: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/6.jpg)
Entity-Extraction, Mark-up, Annotate
![Page 7: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/7.jpg)
And linked to STITCH…
![Page 8: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/8.jpg)
Success Depends on Dictionaries
![Page 9: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/9.jpg)
Concept Web Alliance, Knewco, Others
![Page 10: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/10.jpg)
NextBio and ScienceDirect
![Page 11: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/11.jpg)
Semantic Mark-up for Chemistry
Semantic mark-up for chemistry is here
RSC project prospect (structure linking, IUPAC Gold Book ontology and other ontologies
Nature publishing group compound linking
ChemSpider Journal of Chemistry
![Page 12: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/12.jpg)
Nature Chemistry Compound Pages
![Page 13: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/13.jpg)
Project Prospect
![Page 14: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/14.jpg)
ChemSpider and Publishing
The curation efforts on ChemSpider led to a set of validated dictionaries
Integrate best-in-class entity extraction (SureChem) with validated name dictionaries
Additional dictionaries gave reactions, groups, families, hardware and software vendors etc
![Page 15: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/15.jpg)
ChemMantis and CJOC
![Page 16: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/16.jpg)
Name-Structure Pairs
![Page 17: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/17.jpg)
Converting Detected Names…
Names are searched against a validated dictionary (this expands as ChemSpider is curated)
If not found then they are passed through a Name to Structure algorithm
If they cannot convert then ChemSpider is searched for non-validated names
![Page 18: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/18.jpg)
Manual Curation is Necessary
![Page 19: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/19.jpg)
Deposit Structures
![Page 20: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/20.jpg)
Custom Dictionaries
Entity Extraction built around modified algorithms from SureChem
Optimized for “publications”
Dictionaries for chemical entities, groups, reactions, elements, families, species…
Dictionaries can be expanded
![Page 21: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/21.jpg)
Dictionaries are Easily Enhanced
Copy-Paste into appropriate Entity Dictionary
Impacts all future markups
Expanding knowledge bases of information
![Page 22: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/22.jpg)
Build Dictionaries
![Page 23: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/23.jpg)
Species – linked to Wikipedia
![Page 24: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/24.jpg)
Semantic Linking of Structures
What would you want to link off a structure? Chemical suppliers Other publications Analytical Data Related Reactions Wikipedia Patents “Everything”
![Page 25: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/25.jpg)
ChemSpider and its content
ChemSpider is:
A link farm for > 21 million compounds and 200 data sources
A curation platform to improve the quality of data online
A deposition platform for chemicals and content
![Page 26: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/26.jpg)
Link off a structure in ChemSpider
Chemical suppliers Other publications Analytical Data Related Reactions Wikipedia Patents “Everything”
![Page 27: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/27.jpg)
SureChem Services
The SureChem Portal is a gateway for patent searching – can be searched by structure/substructure
ChemSpider previously integrated by depositing structures and linking out
![Page 28: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/28.jpg)
SureChem Services
Previous integration lacked any sense of numbers of patents, titles etc
New integration:
![Page 29: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/29.jpg)
SureChem Services
![Page 30: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/30.jpg)
Pubmed Articles Linked
![Page 31: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/31.jpg)
From Compounds to Syntheses
ChemSpider will support synthesis procedures moving forward
The ChemSpider Journal of Chemistry is an ideal platform for text-based procedures
![Page 32: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/32.jpg)
Org Prep Daily (Blog)
![Page 33: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/33.jpg)
Molbank (Open Access Journal)
![Page 34: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/34.jpg)
Synthetic Pages (Website)
![Page 35: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/35.jpg)
RSC Supplementary Info
![Page 36: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/36.jpg)
RSC Supplementary Info
![Page 37: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/37.jpg)
ChemSpider Synthesis
ChemSpider Synthesis will be a home for all things “synthetic”
An online resource for synthetic procedures from blogs, other online resources, RSC supplementary info, other publishers etc.
We will mine the RSC supplementary info backfile for reactions
Public peer-review and feedback for synthetic procedures
![Page 38: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/38.jpg)
Online Journals and Live Data
![Page 39: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/39.jpg)
Moving forward
Integrate ChemMantis and Project Prospect as appropriate – best of both worlds
Expand ChemSpider validated dictionaries with RSC content
Expand information in “Compound Boxes” after markup – take advantage of all ChemSpider resources
Invite the community to help build ChemSpider Synthesis
![Page 40: Building an integrated system for chemistry markup and online publishing integrated to online chemistry resources](https://reader035.fdocuments.net/reader035/viewer/2022070315/554e8e9ab4c90526358b4c94/html5/thumbnails/40.jpg)
Acknowledgments
SureChem (Nicko Goncharoff and Richard Koks)
RSC – Richard Kidd and Colin Batchelor
CJOC – multiple authors and reviewers ChemSpider curation – a cast of many