Materials (MARVEL), Department of Chemistry,...

51
Quantum Mechanics, Chemical Space, And Machine Learning Or B Anatole von Lilienfeld Institute of Physical Chemistry and National Center for Computational Design and Discovery of Novel Materials (MARVEL), Department of Chemistry, University of Basel, Switzerland Many of the most relevant chemical properties of matter depend explicitly on atomistic details, rendering a first principles approach mandatory. Alas, even when using high-performance computers, brute force high-throughput screening of compounds is beyond any capacity for all but the simplest systems and properties due to the combinatorial nature of chemical space, i.e. all compositional, constitutional, and conformational isomers. Consequently, efficient exploration algorithms need to exploit all implicit redundancies present in chemical space. I will discuss recently developed statistical learning approaches for interpolating quantum mechanical observables in compositional and constitutional space.

Transcript of Materials (MARVEL), Department of Chemistry,...

Page 1: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Quantum Mechanics, Chemical Space,And Machine Learning

Or

BAnatole von Lilienfeld

Institute of Physical Chemistry and National Center for Computational Design and Discovery of Novel Materials (MARVEL), Department of Chemistry, University of Basel, Switzerland

Many of the most relevant chemical properties of matter depend explicitly on atomistic details, rendering a first principles approach mandatory. Alas, even when using high-performance computers, brute force high-throughput screening of compounds is beyond any capacity for all but the simplest systems and properties due to the combinatorial nature of chemical space, i.e. all compositional, constitutional, and conformational isomers. Consequently, efficient exploration algorithms need to exploit all implicit redundancies present in chemical space. I will discuss recently developed statistical learning approaches for interpolating quantum mechanical observables in compositional and constitutional space.

Page 2: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular
Page 3: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

1. Guess a law

2. If it doesn’t compare to experiment it’s wrong

How do we do physics?

Feynman

Page 4: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

1. “ … when you cannot measure it, when you cannot express it in numbers, your knowledge is of a meagre and unsatisfactory kind.” Lord Kelvin

2. “ … we understand a molecule when we can predict its properties with a degree of accuracy considered quantitative.” M. Quack, ETHZ (2000)

3. “ … It is nice to know that the computer understands the problem. But I would like to understand it too.” E. Wigner

→ compare to experiment (arbiter)

Theory to understand chemistry … to help design experiments

→ predictions that can be falsified

Page 5: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

???

Page 6: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Configuration + Composition → Chemical Space

Young-Tae Chang et al C&E News 93 (12) 39-40 (2015)

Page 7: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

correlations (inductive)

supervised learning

Chang, OAvL, CHIMIA (2014)

Page 8: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Configuration + Composition → Chemical Space

Young-Tae Chang et al C&E News 93 (12) 39-40 (2015)

Page 9: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

How many are possible?

Page 10: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

``The greatest shortcoming of the human race is our inability to understand the exponential function’’Al Bartlett, U of Colorado Boulder

J.-L. Reymond and coworkers, J Am Chem Soc (2009) and ff

Page 11: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

``The greatest shortcoming of the human race is our inability to understand the exponential function’’Al Bartlett, U of Colorado Boulder

J.-L. Reymond and coworkers, J Am Chem Soc (2009) and ff

Page 12: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Conclusions1. Instantaneous QM quality predictions

2. Learning curves reveal quality of ML model

3. Rate and offset depend ona. Baseline: Δ-ML (prior expert knowledge about property)

b. Representation: Uniqueness and energy (property independent)

c. Domain of applicability: Data (properties and structures)

Page 13: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

( )

Kernel Ridge Regression

Solution

Kernel

e.g.

Regression

Page 14: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Virshup, Yang, Beratan et al J Am Chem Soc (2013)

Page 15: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Virshup, Yang, Beratan et al J Am Chem Soc (2013)

Page 16: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Hansen et al, J Phys Chem Lett (2015)

From molecule to Coulomb matrix (CM) to Bag of Bonds (BOB)

Rupp et al, Phys Rev Lett (2012)

Coulomb matrix (CM) Bag of Bonds (BoB)Molecule

● Unique but overcomplete● Invariances (Tra&Rot)● Compact● Physical meaning● Fast● Simple metrics are not

smooth if sorted

● Not unique (homometricity)● Invariant (Tra&Rot)● Compact● Physical meaning● Fast● Simple metrics are smooth

Page 17: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Big DataN

Train

Model(N)

Page 18: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Big DataN

Train

Model(N)Query Predict

Rupp et al, Phys Rev Lett (2012)

QM: ~1000 seconds ML: ~milli seconds

Page 19: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

20161. "Machine learning for many-body physics: efficient solution of dynamical mean-field theory", L.-F. Arsenault, OAvL, A. J. Millis, submitted , arxiv.org/abs/1506.088582. "Constant Size Molecular Descriptors For Use With Machine Learning", C. R. Collins, G. J. Gordon, OAvL, D. J. Yaron, submitted to J. Chem. Inf. Model. 3. "Machine Learning, Quantum Mechanics, and Chemical Compound Space" R. Ramakrishnan, OAvL (in press) Reviews in Computational Chemistry,

arxiv.org/abs/1510.07512 4. "Understanding molecular representations in machine learning: The role of uniqueness and target similarity", B. Huang, OAvL, accepted J. Chem. Phys.

(2016), arxiv.org/abs/1608.061945. "Machine Learning Energies of 2 M Elpasolite (ABC2D6) Crystals", F. Faber, A. Lindmaa, OAvL, R. Armiento, Phys. Rev. Lett. 117 135502 (2016)

arxiv.org/abs/1508.053152015

6. "Electronic Spectra from TDDFT and Machine Learning in Chemical Space", R. Ramakrishnan, M. Hartmann, E. Tapavicza, OAvL, J. Chem. Phys. 143 084111 (2015) arxiv.org/abs/1504.01966

7. "Machine Learning for Quantum Mechanical Properties of Atoms in Molecules", M. Rupp, R. Ramakrishnan, OAvL, J. Phys. Chem. Lett. 6 3309 (2015). arxiv.org/abs/1505.00350

8. "Machine Learning Predictions of Molecular Properties: Accurate Many-Body Potentials and Non-Locality in Chemical Space", K. Hansen, F. Biegler, R. Ramakrishnan, W. Pronobis, OAvL, K.-R. Mueller, A. Tkatchenko, J. Phys. Chem. Lett. 6 2326 (2015).

9. "Transferable atomic multipole machine learning models for small organic molecules", T. Bereau, D. Andrienko, OAvL, J. Chem. Theory Comput. 11 3225 (2015) arxiv.org/abs/1503.05453

10. "Machine Learning of Parameters for Accurate Semiempirical Quantum Chemical Calculations", P. Dral, OAvL, W. Thiel, J. Chem. Theory Comput. 11 2120 (2015)11. "Big Data meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach", R. Ramakrishnan, P. O. Dral, M. Rupp, OAvL, J. Chem. Theory

Comput. 11 2087 (2015), arxiv.org/abs/1503.0498712. "Crystal Structure Representations for Machine Learning Models of Formation Energies", F. Faber, A. Lindmaa, OAvL, R. Armiento, Int. J. Quantum Chem.

doi:10.1002/qua.24917 (2015), arxiv.org/abs/1503.0740613. "Many Molecular Properties from One Kernel in Chemical Space", R. Ramakrishnan, OAvL, CHIMIA 69 182 (2015) ; arxiv.org/abs/1502.04563 14. "Fourier series of atomic radial distribution functions: A molecular fingerprint for machine learning models of quantum chemical properties", OAvL, R. Ramakrishnan, M.

Rupp, A. Knoll, Int. J. Quantum Chem. doi:10.1002/qua.2491 (2015) arxiv.org/abs/1307.29182014

15. "Machine learning for Many-Body Physics : The case of the Anderson impurity model", L.-F. Arsenault, A. Lopez-Bezanilla, OAvL, A. Millis, Phys. Rev. B 90 155136 (2014) arxiv.org/abs/1408.1143

16. "Quantum chemistry structures and properties of 134 kilo molecules", R. Ramakrishnan, P. O. Dral, M. Rupp, OAvL, Scientific Data 1 140022 (2014)17. "Modeling electronic quantum transport with machine learning", A. Lopez-Bezanilla, OAvL, Phys Rev B 89 235411 (2014), arxiv.org/abs/1401.8277

201318. "Assessment and validation of machine learning methods for predicting molecular atomization energies", K. Hansen, G. Montavon, F. Biegler, S. Fazli, M. Rupp, M.

Scheffler, OAvL, A. Tkatchenko, K-R. Mueller J Chem Theory Comput 9 3404 (2013)19. "Machine Learning of Molecular Electronic Properties in Chemical Compound Space", G. Montavon, M. Rupp, V. Gobre, A. Vazquez-Mayagoitia, K. Hansen, A.

Tkatchenko, K-R. Mueller, OAvL, New J. Phys. 15 095003 (2013) arxiv.org/abs/1305.70742012

20. "Learning Invariant Representations of Molecules for Atomization Energy Prediction", G. Montavon, K. Hansen, S. Fazli, M. Rupp, F. Biegler, A. Ziehe, A. Tkatchenko, OAvL, K.-R. Mueller, Advances in Neural Information Processing Systems 449-457 25 (2012)

21. "Fast and accurate modeling of molecular atomization energies with machine learning", M. Rupp, A. Tkatchenko, K.-R. Mueller, OAvL Phys. Rev. Lett. 108 058301 (2012) arxiv.org/abs/1109.2618E

201122. "Towards quantitative structure-property relationships for charge transfer rates of polycyclic aromatic hydrocarbons" M. Misra, D. Andrienko, B. Baumeier , J.-L. Faulon,

OAvL J. Chem. Theory Comput. 7 2549 (2011)“Navigating Chemical Compound Space”, IPAM, UCLA

Page 20: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Conclusions1. Instantaneous QM quality predictions

2. Learning curves reveal quality of ML model

3. Rate and offset depend ona. Baseline: Δ-ML (prior expert knowledge about property)

b. Representation: Uniqueness and energy (property independent)

c. Domain of applicability: Data (properties and structures)

Page 21: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular
Page 22: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Error ~ a/Nb

→ log(Error) = a - blog(N)

K.-R. Mueller et al, Neural Comput (1996)

The bigger the data the better …

Page 23: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Ramakrishnan et al, Scientific Data (2014)

Reymond, Acc Chem Res (2015)

6k constitutional isomers of C7O

2H

10

``Enumeration surpasses imagination’’J.-L. Reymond

Page 24: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Ramakrishnan, OAvL, CHIMIA (2015)

Error ~ a/Nb

→ log(Error) = a - blog(N)

K.-R. Mueller et al, Neural Comput (1996)

Page 25: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Error ~ a/Nb

→ log(Error) = a - blog(N)

K.-R. Mueller et al, Neural Comput (1996)K ~ Ψα ~ Ô

Ramakrishnan, OAvL, CHIMIA (2015)

Page 26: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Error ~ a/(N’)b , e.g. N’ = N/3

→ log(Error) = a + 3b - b log(N)

K.-R. Mueller et al, Neural Comput (1996)

Ramakrishnan, OAvL, CHIMIA (2015)

Page 27: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Conclusions1. Instantaneous QM quality predictions

2. Learning curves reveal quality of ML model

3. Rate and offset depend ona. Baseline: Δ-ML (prior expert knowledge about property)

b. Representation: Uniqueness and energy (property independent)

c. Domain of applicability: Data (properties and structures)

Page 28: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Δ-ML

Ramakrishnan et al, J Chem Theory Comput (2015)

Page 29: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Ramakrishnan et al, J Chem Theory Comput (2015)

CCSD(T)G4MP2Δ-ML

6k constitutional isomers of C7O

2H

10

Page 30: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Ramakrishnan et al, J Chem Theory Comput (2015)

Ranking 10k diastereomers derived from 6k constitutional isomers of C7O

2H

10→ Global minimum, and its 10 closest isomers …

Δ-ML

Page 31: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

R Ramakrishnan et al JCTC (2015)

CCSD(T)G4MP2

Δ-ML

Page 32: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

PM7 1k 10k86 → 74 → 58 kcal/mol

R Ramakrishnan et al JCTC (2015)

Δ-ML

Page 33: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

ML

R Ramakrishnan et al JCTC (2015)

Δ-ML

Page 34: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Conclusions1. Instantaneous QM quality predictions

2. Learning curves reveal quality of ML model

3. Rate and offset depend ona. Baseline: Δ-ML (prior expert knowledge about property)

b. Representation: Uniqueness and energy (property independent)

c. Domain of applicability: Data (properties and structures)

Page 35: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Representation

Huang, OAvL, accepted in J Chem Phys Comm (2016) arxiv.org/abs/1608.06194

Page 36: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Huang, OAvL, accepted in J Chem Phys Comm (2016) arxiv.org/abs/1608.06194

Representation

OAvL et al, IJQC (2013)

Page 37: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Representation

Huang, OAvL, accepted in J Chem Phys Comm (2016) arxiv.org/abs/1608.06194

Page 38: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Representation

Huang, OAvL, accepted in J Chem Phys Comm (2016) arxiv.org/abs/1608.06194

Page 39: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Representation

Huang, OAvL, accepted in J Chem Phys Comm (2016) arxiv.org/abs/1608.06194

6k constitutional isomers of C7O

2H

10

Page 40: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Representation

Huang, OAvL, accepted in J Chem Phys Comm (2016) arxiv.org/abs/1608.06194

6k constitutional isomers of C7O

2H

10

QM9 (134k molecules)

Page 41: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Representation

Huang, OAvL, accepted in J Chem Phys Comm (2016) arxiv.org/abs/1608.06194

Page 42: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Conclusions1. ML makes instantaneous QM quality predictions

2. Learning curves reveal quality of ML model

3. Rate and offset depend ona. Uniqueness and target similarity (model dependent)b. Properties and structures (data dependent)c. Baseline (expert dependent)

Page 43: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Elpasolite (K2NaAlF

6-symmetry) is a vitreous, transparent, luster, colorless and soft

quaternary crystal in the Fm3m space group which can be found in the Rocky Mountains, Virginia, or the Apennines. It is the most abundant quaternary crystal present in the Inorganic Crystal Structure Database. Some Elpasolites emit light when exposed to ionic radiation. This makes them interesting material candidates for scintillator devices.

Crystals

Faber et al, Phys Rev Lett (2016) arxiv.org/abs/1508.05315

Page 44: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Elpasolite (K2NaAlF

6-symmetry) is a vitreous, transparent, luster, colorless and soft

quaternary crystal in the Fm3m space group which can be found in the Rocky Mountains, Virginia, or the Apennines. It is the most abundant quaternary crystal present in the Inorganic Crystal Structure Database. Some Elpasolites emit light when exposed to ionic radiation. This makes them interesting material candidates for scintillator devices.

Reduction in cost:DFT: ~20 M CPU hours (optimized)ML: ~20 CPU hours (not optimized)

Crystals

Faber et al, Phys Rev Lett (2016) arxiv.org/abs/1508.05315

Page 45: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Crystals

Faber et al, Phys Rev Lett (2016) arxiv.org/abs/1508.05315

Page 46: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Crystals

Faber et al, Phys Rev Lett (2016) arxiv.org/abs/1508.05315

Page 47: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Crystals

Faber et al, Phys Rev Lett (2016) arxiv.org/abs/1508.05315

Page 48: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Crystals

Faber et al, Phys Rev Lett (2016) arxiv.org/abs/1508.05315

Page 50: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Conclusions1. ML makes instantaneous QM quality predictions

2. Learning curves reveal quality of ML model

3. Rate and offset depend ona. Uniqueness and target similarity (model dependent)b. Properties and structures (data dependent)c. Baseline (expert dependent)

Page 51: Materials (MARVEL), Department of Chemistry, …helper.ipam.ucla.edu/publications/mpsws1/mpsws1_13945.pdfInstitute of Physical Chemistry and National Center ... 2. "Constant Size Molecular

Inductive 1. Assume a law2. Metric3. Examples4. Infer5. New combination

Fast (ms)Arbitrary reference Automatic improvement

Transferable?Minimally condensed

Deductive1. Assume a law2. Approximate3. Solve4. Predict5. New regimes

Slow (depending on approx.)Approximation dependentHuman improvement

Transferable?Maximally condensed

Conclusions II

Scientific method - how to gain knowledge