Www.ccdc.cam.ac.uk Insight into Molecular Geometry and Interactions using Small Molecule...

31
www.ccdc.cam.ac.u k Insight into Molecular Geometry and Interactions using Small Molecule Crystallographic Data John Liebeschuetz Cambridge Crystallographic Data Centre, 12 Union Road, Cambridge, UK [email protected]

Transcript of Www.ccdc.cam.ac.uk Insight into Molecular Geometry and Interactions using Small Molecule...

www.ccdc.cam.ac.uk

Insight into Molecular Geometry and

Interactions using Small Molecule

Crystallographic Data

John LiebeschuetzCambridge Crystallographic Data Centre,

12 Union Road, Cambridge, UK

[email protected]

www.ccdc.cam.ac.uk

How much Data is Available?

CSD Growth 1970-2006

419,768 entries June 2007

0

100000

200000

300000

400000

500000

600000

2001 2003 2005 2007 2009

Growth of the Cambridge Structural Database over 40 years

Predicted Growthto 2010

>500,000 entries during 2009

www.ccdc.cam.ac.uk

CSD Data Content

Literature Reference:

G. Bringmann, M. Ochse,

K. Wolf, J. Kraus, K. Peters,

E-M. Peters, M. Herderich,

L. Ake, F. Tayman

Phytochemistry 51,1999, 271

Other text:R-factor: .0506

Colour: pale yellow

Habit: acicular

Polymorph: Form IV

Source: Rothmannia longiflora

4-Oxonicotinamide-1-(1’-beta-D-2’,3’,5’-tri-O-acetyl-

ribofuranoside)

C17 H20 N2 O9

www.ccdc.cam.ac.uk

Molecular Interactions as well as Geometry

HEPPEX

www.ccdc.cam.ac.uk

Cambridge Structural Database System

CambridgeStructuralDatabase

PreQuestDatabase Production

VISTAStatisticalanalysis

MercuryGraphical display,packing analysis

ConQuestDatabase

Search

MogulLibrary of

Molecular Geometry

IsoStarLibrary of

Intermolecular Interactions

Knowledge Bases

www.ccdc.cam.ac.uk

Using Structural Data in molecular modelling for pharmaceutical design

• Intramolecular – 3D geometry– Designing in the desired Conformer

– Validation that models have correct geometry

• Intermolecular – Interactions between molecules– Design of pharmacophores

– Validation of interactions found during modelling

– Identification of new ways to satisfy binding motifs

– Knowledge-based scoring functions for docking

www.ccdc.cam.ac.uk

Designing in the right Conformation (1)

Brameld. K.A., Kuhn, B., Reuter, D.C. and Stahl, M. J. Chem. Inf. Mod, 48(1), 1-24 2008)

It is possible using Conquest to generate incidence histograms for any geometric feature , for any substructure, if sufficient high quality structures including that substructure, are present in the CSD

www.ccdc.cam.ac.uk

Designing in the right Conformation (2)

Sulphonamide is common in drug molecules. Its conformational behaviour well captured by CSD

Ortho Substitution (Blue histogram) shifts the maximum

Pyramidalisation of the N of the sulphonamide can also be explored. This is a common effect in sulphonamides (and piperidines) but is poorly reproduced by modelling software

Brameld. K.A., Kuhn, B., Reuter, D.C. and Stahl, M. J. Chem. Inf. Mod, 48(1), 1-24 2008)

www.ccdc.cam.ac.uk

Designing in the right Conformation (3) : Example 1:

CSD analysis indicates the bioactive conformation is stable only for most active structureBrameld. K.A., Kuhn, B., Reuter, D.C. and Stahl, M. J. Chem. Inf. Mod, 48(1), 1-24 2008)

www.ccdc.cam.ac.uk

Rapid access of geometric information from the CSD

Incorporates pre-computed libraries of bond lengths, valence angles and torsion angles

>20 million individual geometrical parameters derived entirely from the CSD and updated annually

Sketch or import molecule, then click on feature of interest to view distribution, mean values and statistics

Validation of Model Geometry: Mogul

Bruno et al., J. Chem. Inf. Comput. Sci., 44, 2133-2144, 2004

www.ccdc.cam.ac.uk

Select fragment

www.ccdc.cam.ac.uk

Validation of PDB Ligand Geometry

PDB structures suffer from less well defined electron density

Protein X-Ray refinement force fields often are poorly parameterised to reproduce ligand geometries

Sometimes protein crystallographers start with a poor ligand model

www.ccdc.cam.ac.uk

Validation of Ligand structures in the PDB via Protein/Ligand analysis tool Relibase+*

*www.ccdc.cam.ac.uk

Validation of ligand structures found in the PDB

Ligand from 1HAK, Two abnormal torsions indicated

Further examination reveals the piperidine to be Boat form

www.ccdc.cam.ac.uk

• 15% of 100 recent PDB entries have ligand geometry that are almost certainly in significant error (in house analysis using Relibase+/Mogul)

• The good news - For structures deposited before 2000 the figure is 26%

evaluation of pdb ligand dataset from 1990's with Mogul and Relibase

correct34%

wrong26%

not unusual40% correct

wrong

not unusual

evaluation of most recent pdb ligand dataset with Mogul and Relibase

correct29%

wrong16%

not unusual55%

correct

wrong

notunusual

Pre 2000 2006

Validation of ligand structures found in the PDB using Mogul

www.ccdc.cam.ac.uk

Designing in the right Conformation (3) : Large Rings

Brameld. K.A., Kuhn, B., Reuter, D.C. and Stahl, M. J. Chem. Inf. Mod, 48(1), 1-24 2008)

www.ccdc.cam.ac.uk

Validation of ligand structures found in the PDB

Ligand from 1HAK, Two abnormal torsions indicated

Further examination reveals the piperidine to be Boat form

www.ccdc.cam.ac.uk

Mogul 1.3: Ring Conformations

• Mogul currently holds data on bonds, angles and torsions.

• In the 2010 release of the Cambridge Structural Database System Mogul will also contain a comprehensive ring knowledge base

• Ring libraries from α-Mogul 1.3 have been introduced into Gold 4.1 to allow knowledge- based ring-flexing during docking

www.ccdc.cam.ac.uk

A Knowledge Base of Intermolecular Interactions

Experimental data from:

Cambridge Structural Database

Protein Data Bank (protein-ligand complexes only)

Theoretical potential energy minima (DMA, IMPT)

Typical Uses:

Probability of an interaction occurring

Preferred geometries

Design Strategies

IsoStar

www.ccdc.cam.ac.uk

central group: -CONH2

contact group: NH

IsoStar Methodology

Search CSD or PDB for structures containing contact

Superimpose hits and display distribution

www.ccdc.cam.ac.uk

IsoStar Scatterplots vs. Density MapsN-H donors around amide C=O

Scatterplot Contour surface

www.ccdc.cam.ac.uk

IsoStar –indole and isoxazole interactions with faces of phenyl rings

www.ccdc.cam.ac.uk

Using Intermolecular information to build pharmacophores from proteins

Use intermolecular information (IsoStar) to map a protein binding site (e.g. using SuperStar, an extra module to the CSDS )

Create a pharmacophore from this information (possible in SuperStar) c.f. GRID/FLAP

www.ccdc.cam.ac.uk

Motif searching

Certain signature interaction motifs might be key to identifying inhibitor substructures of interest. Can we identify such motifs in the CSD and thereby uncover new ideas?

Materials Mercury: A new tool for the drug development and crystal design community

Most tools are specific to small molecule crystals ....

However ….

www.ccdc.cam.ac.uk

1. Comparison of crystal structures: polymorphs, solvates etc can identify significant ‘packing features’.

2. We can then search the CSD using ‘Packing Feature Search’

Packing Feature Search

www.ccdc.cam.ac.uk

H-bonding Motif Search: Kinase Binding Motifs

CDK2 Complex – 1ke8 Set up a ‘Packing Feature Search’ around Hinge Region

www.ccdc.cam.ac.uk

H-bonding Motif Search: Kinase Binding Motifs

MISTOX WUSQAC

Provides ideas for new motifs – Fragment based design

www.ccdc.cam.ac.uk

• Protein/Ligand Docking relies on a scoring function to rank binding poses

– Scoring functions may be Molecular Mechanics based, Empirical or Knowledge Based

• A Knowledge Based score is calculated by the sum of atom-atom potentials derived from a crystallographic database

The atom-atom potential = - log

• Knowledge based scoring functions (PMF, Bleep, DrugScore, ASP) have been developed using protein-ligand data (PDB)

• The CSD contains better resolved structures and a much greater variety of chemical functionality than the PDB

– DrugScoreCSD has demonstrably improved performance over DrugScore (Velec, Gohlke & Klebe, J. Med. Chem., 48 (2005), 6296 )

observed interactions

Knowledge-based scoring using small molecule structural data

reference state

www.ccdc.cam.ac.uk

Uses of Small Molecule Structural Data in Drug Design: Conclusions

• Use in Model Validation -

– Geometry of designed synthetic candidates

– Geometry of X-ray derived Ligand Structures

– Intermolecular interactions of a candidate structure with a model of binding site

• Design of Pharmacophores

• Search for fragments fitting a binding motif

• Creation of robust and versatile Knowledge-Based scoring functions for docking

www.ccdc.cam.ac.uk

Acknowledgements

Thank you for your attention

Jana HennemanJames Chisholm