Martin Waldseemüller's World Map of 1507; the FIRST map...

56
Obernai Martin Waldseemüller's World Map of 1507; the FIRST map to use the name "America" to label the New World Martin Martin Waldseemüller's Waldseemüller's World Map of 1507; the FIRST map World Map of 1507; the FIRST map to use the name "America" to label the New World to use the name "America" to label the New World

Transcript of Martin Waldseemüller's World Map of 1507; the FIRST map...

Page 1: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Obernai

Martin Waldseemüller's World Map of 1507; the FIRST map to use the name "America" to label the New WorldMartinMartin Waldseemüller'sWaldseemüller's World Map of 1507; the FIRST map World Map of 1507; the FIRST map to use the name "America" to label the New Worldto use the name "America" to label the New World

Page 2: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Roberto TodeschiniMilano Chemometrics and QSAR Research Group

Molecular descriptorsAn introduction

Prof. Roberto Prof. Roberto TodeschiniTodeschini

DrDr. Davide Ballabio. Davide Ballabio

Dr. Viviana Dr. Viviana ConsonniConsonni

Dr. Alberto ManganaroDr. Alberto Manganaro

Dr. Andrea MauriDr. Andrea Mauri

Page 3: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

The chemical data

synthesis: chemistry produces the objetcs of its own study

chemical composition: a unifying concept for all the experimental sciences

molecular structure: one the most fruitful scientific concepts of this century

synthesissynthesis: chemistry produces the : chemistry produces the objetcs of its own studyobjetcs of its own study

chemical compositionchemical composition: a unifying concept : a unifying concept for all the experimental sciencesfor all the experimental sciences

molecular structuremolecular structure: one the most fruitful : one the most fruitful scientific concepts of this centuryscientific concepts of this century

Page 4: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Molecular structure

The concept of molecular structure is one of the The concept of molecular structure is one of the most reach of this century.most reach of this century.

Page 5: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Molecular structure

The basic assumptions are that different The basic assumptions are that different molecular structures have different chemical molecular structures have different chemical properties and similar molecular structures have properties and similar molecular structures have similar molecular properties.similar molecular properties.

Page 6: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Molecular structure

Each molecular representation represents a Each molecular representation represents a different way to look at the molecular structure different way to look at the molecular structure and its chemical meaning is strongly immersed in and its chemical meaning is strongly immersed in the framework of the chemical theories.the framework of the chemical theories.

Page 7: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Some historical notes

“... : “... : benchè certamente si traveggano già dei benchè certamente si traveggano già dei rapporti fra rapporti fra la la costituzione chimica costituzione chimica ((composizione composizione e e strutturastruttura) e ) e le proprietà le proprietà fisichefisiche loroloro, è , è ancor certamente di gran lunga troppo ristretto ancor certamente di gran lunga troppo ristretto il numero dei fattiil numero dei fatti, per , per dedurne delle conseguenzededurne delle conseguenze, , che oltre che oltre al al carattere d’una semplice ipotesi possono pretendere carattere d’una semplice ipotesi possono pretendere anche quello della probabilitàanche quello della probabilità..In In ogni caso tali rapporti ogni caso tali rapporti non non sono di natura tanto semplice sono di natura tanto semplice come a priori come a priori forse forse era era lecito aspettarsilecito aspettarsi..Di certo Di certo le proprietà fisiche dei corpile proprietà fisiche dei corpi sonosono in in primo luogo primo luogo una una funzionefunzione della composizione della composizione e e strutturastruttura loroloro, , sulla di cui sulla di cui forma forma nulla ancora si sanulla ancora si sa; ; funzione probabilmente molto funzione probabilmente molto complessa complessa e per e per il di cui il di cui studio studio occorrerà un imprevedibile occorrerà un imprevedibile numero di fattinumero di fatti, , onde poter sufficientemente restringere onde poter sufficientemente restringere la la cerchia delle rappresentazioni possibilicerchia delle rappresentazioni possibili.” .”

Page 8: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Some historical notes

Page 9: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Some historical notes

Studi sull’isomeria delle così dette sostanze aromatiche Studi sull’isomeria delle così dette sostanze aromatiche a a sei atomi di carboniosei atomi di carbonio..Gazzetta Chimica ItalianaGazzetta Chimica Italiana, , volvol. IV, p.. IV, p.305305

18741874

Wilhelm KÖRNERWilhelm KÖRNER

Page 10: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Molecular descriptors

Definition of molecular descriptorDefinition of molecular descriptorDefinition of molecular descriptor

“The molecular descriptor is the final result of a logic

and mathematical procedure which transforms

chemical information encoded within a symbolic

representation of a molecule into a useful number or

the result of some standardized experiment.”

R. Todeschini and V. Consonni

Page 11: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Molecular descriptors

≈ 3300 molecular descriptors≈≈ 3300 molecular descriptors3300 molecular descriptors

Page 12: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Molecular descriptors

unicornunicornbull bodybull body

dragon headdragon head

scorpion tailscorpion tailsnake necksnake neck

lion forefeetlion forefeeteagle hind legseagle hind legs

Page 13: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

symmetrysymmetryelectronic aspectselectronic aspects

Molecular descriptors

branchingbranching

H H -- bondingbonding

stericsteric

hydrophobicityhydrophobicity

sizesize shapeshapereactivityreactivity

cyclicitycyclicity

Page 14: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

symmetrysymmetryelectronic aspectselectronic aspects

Molecular descriptors

branchingbranching

several several meanings in just meanings in just

one numberone number

H H -- bondingbonding

stericsteric

hydrophobicityhydrophobicity

sizesize shapeshapereactivityreactivity

cyclicitycyclicity

Page 15: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

“Molecular Descriptors for Chemoinformatics”

Roberto Todeschini and Viviana ConsonniWiley-VCH2 volumes

• 6400 bibliographic references• 1300 pages• 3000 entries• 7000 cited authors• unknown number of formulas

In press

Page 16: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Molecular descriptors

graph theory discrete mathematics physical chemistrygraph theory discrete mathematics physical chemistryinformation theory quantum chemistry organic chemistryinformation theory quantum chemistry organic chemistrydifferential topology algebraic topologydifferential topology algebraic topology

derived from ….derived from ….

QSAR/QSPR medicinal chemistry pharmacology genomicsQSAR/QSPR medicinal chemistry pharmacology genomicsdrug design toxicology proteomics analytical chemistrydrug design toxicology proteomics analytical chemistryenvironmetrics virtual screening library searchingenvironmetrics virtual screening library searching

applied in ….applied in ….

statisticsstatisticschemometricschemometricschemoinformaticschemoinformatics

processed by ….processed by ….Molecular descriptorsMolecular descriptors

Page 17: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Molecular descriptors

molecule

physico - chemicalproperties

µ

biologicalactivities

αmoleculardescriptors

δ

Page 18: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

The role of the molecular descriptors

PhysicoPhysico--chemical propertieschemical properties

boiling pointmelting pointdipole momentmolar refractivityparachoroctanol/water partition coefficientvapor pressuredensitysolubility.............................

Page 19: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

The role of the molecular descriptors

Biological activitiesBiological activities

binding affinitylethal doseinhibition concentrationmutagenicitycarcinogenicityantiinflammatory activityantidepressant activityskin sensitization................

Page 20: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

The role of the molecular descriptors

Environmental propertiesEnvironmental properties

biodegradationbioconcentrationBODCODhalf - life timemobilityatmospheric persistance.........................

Page 21: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

.... and more.... and more

The role of the molecular descriptors

conductivityretention timeglass transition temperaturereological behaviours

.........................

Page 22: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Representations of a molecular structure

a real objecta real object

molecule

moleculardescriptors

δ molecular structure

representation

numbersnumbers

Page 23: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Representations of a molecular structure

Page 24: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Representations of a molecular structure

3D - geometrical3D 3D -- geometricalgeometrical

0D - counts0D 0D -- countscounts

Cl Cl

ClCl

H

H

H

H

H

H

2D - topochemical2D 2D -- topochemicaltopochemical

2D - topostructural2D 2D -- topostructuraltopostructural

. .· ·

··· ·

· ···

· ·..

.

...

. .C

C

C

C

C C

C C

CC

CC

C l C l

C l C l

H

H

H

H

H

H

1D – fragment counts1D 1D –– fragment countsfragment counts. .· ·

··· ·

· ···

· ·..

.

...

. .C

C

C

C

C C

C C

CC

CC

C l C l

C l C l

H

H

H

H

H

H

Page 25: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Representations of a molecular structure

probesprobes interaction energy valueinteraction energy valueat each pointat each pointfor each probefor each probe

•• stericsteric•• electronicelectronic•• hydrophobichydrophobic 4D4D

Page 26: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Properties of a molecular descriptor

Several scientists are involved in searching for new molecular descriptors able to catch new aspects of the molecular structure. This kind of reasearch involves creativity and imagination together with solid theoretical basis allowing to obtain numbers with some structural chemical meaning.

"There are no restriction on the design of structural invariants, the limiting factor is one's own imagination." [1].

M. Randic (1996), Molecular bonding profiles, J. Math. Chem., 19, 375-392

Page 27: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Properties of a molecular descriptor

a descriptor MUST have ...

invariance with respect to labeling and numbering of atomsinvariance with respect to roto-translationan unambiguous algorithmically computable definitionvalues in a suitable numerical range for the set of molecules where it is applicable to

invariance with respect to labeling and invariance with respect to labeling and numbering of atomsnumbering of atomsinvariance with respect to rotoinvariance with respect to roto--translationtranslationan unambiguous algorithmically computable an unambiguous algorithmically computable definitiondefinitionvalues in a suitable numerical range for the values in a suitable numerical range for the set of molecules where it is applicable toset of molecules where it is applicable to

Page 28: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Properties of a molecular descriptor

a descriptor should have ...a structural interpretationa good correlation with at least one propertyno trivial correlation with other molecular descriptorsgradual change in its values with gradual changes in the molecular structurenot including in the definition experimental propertiesnot restricted to a too small class of molecular structurespreferably, some discrimination power among isomerspreferably, not trivially including in the definition other molecular descriptorspreferably, allowing reversible decoding (back from the descriptor value to the structure)

Page 29: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Molecular descriptors

... some more details about molecular descriptors

Page 30: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Molecular graph

1 2 3 4

5 6

7

Page 31: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Molecular graph

Mathematical object defined asMathematical object defined asG = (V, E)

set set VV verticesset et EE edges

atomsatomsbondsbonds

1 2 3 4

5 6

7

Page 32: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Topological matrices

Adjacency matrixAdjacency matrixAdjacency matrix

Derived from a molecular graph, it represents the Derived from a molecular graph, it represents the

whole set of whole set of connectionsconnections between adjacent pairs of between adjacent pairs of

atoms. atoms.

aaijij ==

1 if atom 1 if atom ii and and jj are bondedare bonded

0 otherwise0 otherwise

Page 33: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Local vertex invariants

atom vertex degreeatom vertex degree

iδ It is the row sum of the vertex adjacency matrix

0 0 010 0 0

0

0

0

01 0

0

1 1 1

1 1 1 0

10 0 0 0 0 0

10 0 0 0 0 0

0 10 0 0 0 0

1 00 0 0 0 0

1 2 3 4 5 6 7

2

1

3

4

5

6

7

1

4

3

1

1

1

1

1 2 3 4

5 6

7

Page 34: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Distance matrix

vertex distance matrix degreevertex distance matrix degreesi It is the row sum of the vertex distance matrix

1 2 3 4

5 6

7

The distance dij between two vertices is the smallest number of edges between them.

2 3 210 3 2

2

2

0

01 2

2

1 1 1

1 1 1 2

13 2 0 3 2 3

12 2 3 0 3 2

3 12 2 3 0 3

1 22 3 2 3 0

1 2 3 4 5 6 7

2

1

3

4

5

6

7

13 3

8 2

9 2

14 3

13 3

14 3

13 3

si ηi

si is high for terminal vertices and low for central vertices

Page 35: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Strategies for molecular descriptors

From local vertex invariants you can: From local vertex invariants you can:

( ) ( ) ( )

( ) ( ) ( )

( ) ( ) ( ) ( ) ( )

( ) ( ) ( )

1 21 1 1

3 41 1 1

5 61 1

7 ,

1. ; 2. ;

3. ; 4. ;

5. max 6. ; ; ;

7. ; ; max ;

A A A

i i ji i j

AA A

ij i j ii j i

A A

i A i i j iji j

i j A i j ij

k k k k j i

k k a k k

k k k m k d m

k m k d m

αα

= = =

αα

= = =

α

∈= =

α

α = ⋅ α = ⋅ ⋅ ≠

α = ⋅ ⋅ ⋅ α = ⋅

= ⋅ α = ⋅ ⋅ ⋅ δ

α = ⋅ ⋅ ⋅ δ

∑ ∑∑

∑∑ ∏

∑∑

D L D L L

D L L D L

D L D L L

D L L

Page 36: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Strategies for molecular descriptors

Molecular matrices from molecular topology:Molecular matrices from molecular topology:-- adjacency, distance, detour, Laplace, ...adjacency, distance, detour, Laplace, ...

Functions of the basic molecular matrices:Functions of the basic molecular matrices:reciprocal, combined, extended, reciprocal, combined, extended, complementary, weighted, layered, ....complementary, weighted, layered, ....

... more than 100!

Page 37: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Strategies for molecular descriptors

From molecular matrices you can: From molecular matrices you can:

( ) ( ) ( ) ( )

1 21 1 1 1

3 4

1 11. 2.2 2

3. det 4.

A A A A

ij ij iji j i j

m a m

k k Sp f Spectrum

= = = =

= ⋅ = ⋅ ⋅

= ⋅ =

∑∑ ∑∑

Μ

D D

D D

Page 38: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Strategies for molecular descriptors

From the spectrum eigenvalues of a matrix: From the spectrum eigenvalues of a matrix:

( ) ( ) ( ) ( )

( ) ( )

( ) { } ( ) { }

( ) { } ( )

1 1 1

1 1

, , ,

, , /

, min , max

, max ,

n n nk kkk k ki i i

i i i

n n

i ii i

i i i i

i i

SpSum w SpSum w SpSum w

SpAD w SpMAD w n

MinSp w MaxSp w

MaxSpA w SpDiam w MaxSp - MinSp

+ −

+ −+ −

= = =

= =

= λ = λ = λ

= λ − λ = λ − λ

= λ = λ

= λ =

∑ ∑ ∑

∑ ∑

M M M

M M

M M

M M

Page 39: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Strategies for molecular descriptors

3D atom coordinates and geometry matrix: 3D atom coordinates and geometry matrix:

12 1

21 2

1 2

00

0

A

A

A A

r rr r

r r

≡G

……

… … … ……

1 1 1

2 2 2

A A A

x y zx y z

x y z

=M… … …

... a lot of new local invariants and 3D molecular ... a lot of new local invariants and 3D molecular descriptors are derived !descriptors are derived !

Page 40: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Atom listAtom list Substructure listSubstructure list

molecular graphmolecular graph

graph invariantsgraph invariants

topostructural topostructural descriptorsdescriptors

topographic topographic descriptorsdescriptors

topochemical topochemical descriptorsdescriptors

topological information indicestopological information indices

2D2D

0D0D 1D

countingcounting summingsumming

1D

gridgrid--based QSAR based QSAR techniquestechniques

interaction energy interaction energy valuesvalues

4D4D

geometrical geometrical descriptorsdescriptors

bulk descriptorsbulk descriptors

countingcounting structural keysstructural keys

molecular geometrymolecular geometryx, y, z coordinatesx, y, z coordinates

3D3D

quantumquantum--chemical chemical descriptorsdescriptors

molecular surface molecular surface descriptorsdescriptors

Page 41: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

molecular geometrymolecular geometryx, y, z coordinatesx, y, z coordinates

topographic topographic descriptorsdescriptors

graph invariantsgraph invariants

topostructural topostructural descriptorsdescriptors

topochemical topochemical descriptorsdescriptors

molecular graphmolecular graph

Wiener index, Hosoya Z indexZagreb indices, Mohar indicesRandic connectivity indexBalaban distance connectivity indexSchultz molecular topological indexKier shape descriptorseigenvalues of the adjacency matrixeigenvalues of the distance matrixKirchhoff numberdetour indextopological charge indices...............

Wiener index, Hosoya Z indexZagreb indices, Mohar indicesRandic connectivity indexBalaban distance connectivity indexSchultz molecular topological indexKier shape descriptorseigenvalues of the adjacency matrixeigenvalues of the distance matrixKirchhoff numberdetour indextopological charge indices...............

total information content on .....mean information content on .....total information content on .....mean information content on .....

Kier-Hall valence connectivity indicesBurden eigenvaluesBCUT descriptorsKier alpha-modified shape descriptors2D autocorrelation descriptors...............

Kier-Hall valence connectivity indicesBurden eigenvaluesBCUT descriptorsKier alpha-modified shape descriptors2D autocorrelation descriptors...............

3D-Wiener index3D-Balaban indexD/D index...............

3D-Wiener index3D-Balaban indexD/D index...............

topological information indicestopological information indices

Page 42: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

geometrical geometrical descriptorsdescriptors

interaction energy interaction energy valuesvalues

gridgrid--based QSAR based QSAR techniquestechniques

quantumquantum--chemical chemical descriptorsdescriptors

gravitational indices3D-Morse descriptorsEVA descriptorsEEVA descriptorsWHIM descriptorsGETAWAY descriptors..............

gravitational indices3D-Morse descriptorsEVA descriptorsEEVA descriptorsWHIM descriptorsGETAWAY descriptors..............

CoMFA, GRIDG-WHIM descriptors............

CoMFA, GRIDG-WHIM descriptors............

van der Waals volumegeometric volume...........

van der Waals volumegeometric volume...........

chargeselectronegativitiessuperdelocalizabilityhardnesssoftnessELUMO

EHOMO

..............

chargeselectronegativitiessuperdelocalizabilityhardnesssoftnessELUMO

EHOMO

..............solvent-accessible surface areaCPSA descriptorsmolecular shape analysisMezey 3D shape analysis...........

solvent-accessible surface areaCPSA descriptorsmolecular shape analysisMezey 3D shape analysis...........

molecular surfacemolecular surfacemolecular surface

volume volume descriptorsdescriptors

molecular geometrymolecular geometryx, y, z coordinatesx, y, z coordinates

Page 43: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

QSAR strategy

models ...

regression models (quantitative response)classification models (qualitative response)ranking models (ordered response)

regression models (quantitative response)regression models (quantitative response)classification models (qualitative response)classification models (qualitative response)ranking models (ordered response)ranking models (ordered response)

Page 44: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

QSAR strategy - Regression

Page 45: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

QSAR strategy - Classification

Page 46: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

QSAR strategy - Ranking

Toxicity1

2

3

4

5

6

7

8 9

10

11

12

13

14

15

16

17

18

19

20

21

Toxicity1

2

3

4

5

6

7

8 9

10

11

12

13

14

15

16

17

18

19

20

21

Page 47: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

QSAR strategy

experimental responses

molecular descriptors

SRC (QSAR, QSPR, ... )

fitting

molecular descriptors

newmolecules

reversible decoding

molecular descriptors

training set

set ofmolecules

MODELprediction

power

experimental responses

test set

predicted newresponses

Page 48: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

QSAR strategy

The true interest is inThe true interest is inpredictive power of the modelpredictive power of the model

Model validationModel validation

ChemometricsChemometrics

Page 49: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

FAQ - Frequently Asked Questions

1. What is the meaning of that descriptor ?1. What is the meaning of that descriptor ?

2. Why are there some models with the same prediction 2. Why are there some models with the same prediction power but different molecular descriptors ?power but different molecular descriptors ?

3. Why use a huge number of molecular descriptors ?3. Why use a huge number of molecular descriptors ?

4. Is a model explaining the known facts of a system 4. Is a model explaining the known facts of a system better than a model predicting the future events of better than a model predicting the future events of that system ?that system ?

Page 50: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

FGA FGA -- ourour Frequently Given AnswersFrequently Given Answers

1. What is the meaning of that descriptor ?1. What is the meaning of that descriptor ?

A A molecular descriptormolecular descriptor is a number extracted by a well is a number extracted by a well defined algorithm from a molecular representation of a defined algorithm from a molecular representation of a complex system, i.e. the molecule. Therecomplex system, i.e. the molecule. There are are goodgood reasonsreasonsto believeto believe that that often our difficulties to attribute a meaning to often our difficulties to attribute a meaning to this number ultimately flow from the this number ultimately flow from the lacking of deeper lacking of deeper chemical theories and higher level languageschemical theories and higher level languages and not from and not from exoteric approaches to the descriptor definition. exoteric approaches to the descriptor definition.

R. Todeschini and V. ConsonniR. Todeschini and V. Consonni

Page 51: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

FGA FGA -- ourour Frequently Given AnswersFrequently Given Answers

2. Why are there some models with the same prediction 2. Why are there some models with the same prediction power but different molecular descriptors ?power but different molecular descriptors ?

Molecular descriptors are often intercorrelated, therefore Molecular descriptors are often intercorrelated, therefore

different molecular descriptors can, in turn, take part in a different molecular descriptors can, in turn, take part in a

model.model.

Any alternative viewpoint with a different emphasis Any alternative viewpoint with a different emphasis leads to an leads to an inequivalent descriptioninequivalent description. There is only one . There is only one reality but there are reality but there are many points of viewmany points of view..

Hans PrimasHans Primas

Page 52: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

FGA FGA -- ourour Frequently Given AnswersFrequently Given Answers

3. Why use a huge number of molecular descriptors ?3. Why use a huge number of molecular descriptors ?

Complexity is not an intrinsic property of systems, but Complexity is not an intrinsic property of systems, but

rather arises from the number of ways in which we are rather arises from the number of ways in which we are

able (or desire) to interact with a system. able (or desire) to interact with a system.

A molecule is undoubtedly a complex systemA molecule is undoubtedly a complex system

Page 53: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

FGA FGA -- ourour Frequently Given AnswersFrequently Given Answers

4. Is a model explaining the known facts of a system 4. Is a model explaining the known facts of a system better than a model predicting the future events of that better than a model predicting the future events of that system ?system ?

Don’t forget your goal!Don’t forget your goal!

An understanding of the behavior of a system does not An understanding of the behavior of a system does not

always coincide with the prediction of the system’s future always coincide with the prediction of the system’s future

behavior!behavior!

fitting versus predictionfitting versus prediction

Page 54: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

www.moleculardescriptors.eu

Page 55: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Milano ChemometricsMilano Chemometrics and QSAR Research Groupand QSAR Research Group

ProfProf. Roberto. Roberto TodeschiniTodeschiniDr.Dr. Viviana ConsonniViviana ConsonniDr. Manuela PavanDr. Manuela PavanDr. Andrea MauriDr. Andrea MauriDr.Dr. Davide BallabioDavide BallabioDr. Alberto Manganaro

chemometricschemometricsmolecular descriptorsmolecular descriptorsQSARQSARmulticriteriamulticriteria decision makingdecision makingenvironmetricsenvironmetricsexperimental designexperimental designartificial neural networksartificial neural networksstatistical process controlDr. Alberto Manganaro statistical process control

Department of Environmental SciencesDepartment of Environmental SciencesUniversity ofUniversity of MilanoMilano -- BicoccaBicocca

P.P.za della Scienzaza della Scienza, 1 , 1 -- 2012620126 MilanoMilano (Italy)(Italy)WebsiteWebsite: www.: www.disatdisat..unimibunimib.it/.it/chmchm//

Page 56: Martin Waldseemüller's World Map of 1507; the FIRST map ...infochim.u-strasbg.fr/CS3/program/material/Todeschini.pdf · Martin Waldseemüller's World Map of 1507; ... Martin Waldseemüller's

Milano ChemometricsMilano Chemometrics and QSAR Research Groupand QSAR Research Group

ProfProf. Roberto. Roberto TodeschiniTodeschiniDr.Dr. Viviana ConsonniViviana ConsonniDr. Manuela PavanDr. Manuela PavanDr. Andrea MauriDr. Andrea MauriDr.Dr. Davide BallabioDavide BallabioDr. Alberto ManganaroDr. Alberto Manganaro

Department of Environmental SciencesDepartment of Environmental SciencesUniversity ofUniversity of MilanoMilano -- BicoccaBicocca

P.P.za della Scienzaza della Scienza, 1 , 1 -- 2012620126 MilanoMilano (Italy)(Italy)WebsiteWebsite: www.: www.disatdisat..unimibunimib.it/.it/chmchm//

THANK YOU