USUGM 2014 - Zhengwei Peng (Merck): Construction of a vast virtual chemical space with ChemAxon
USUGM 2014 - András Strácz (ChemAxon): Creation & Acquisition in Evolution of the ChemAxon Product...
27
Creation and Acquisition
Transcript of USUGM 2014 - András Strácz (ChemAxon): Creation & Acquisition in Evolution of the ChemAxon Product...
Chemical file formats
.mol/.sdf, smiles, .skc, .cdxIUPAC, InChI, common names
.rdf, smarts, smirks, .rgffasta, sequence, helm, xhelm
Loading data
name,type,page #,context
leucine,common,Page 1,… X-ray coordinatesof the leucine transporter LeuT, a bacterial ...
IUPAC, common names, InChI, CAS, SMILES
Corporate IDOCR with error correctionOSR for structure images
Data mining
New language support
Chinese Name to Structure• 2-(乙酰氧基)苯甲酸• 阿司匹林
Japanese Name to Structure v6.3• 2 - (アセチルオキシ)安息香酸• アスピリン