Introduction to Knowledge Based Potentials

download Introduction to Knowledge Based Potentials

of 15

Transcript of Introduction to Knowledge Based Potentials

  • 8/12/2019 Introduction to Knowledge Based Potentials

    1/15

    Protein Folding Problem: Given sequence of polypeptide, find the 3D arrangement of all atomsin the polypeptide in its physiological environment.

    Central Dogma of Molecular Biology:Sequence Structure Function

    Tentative Definition of Knowledge Based Pseudopotential:The free energy function corresponding to a statistical mechanical model of a proteinderived from the experimentally determined structural coordinates of several differentproteins.

    An Introduction to Knowledge Based PseudopotentialsFor Proteins

  • 8/12/2019 Introduction to Knowledge Based Potentials

    2/15

    Protein Modeling Methods-ab initio - methods based on potential

    functions that describe the

    physics of the situation.(ie.. Blue Gene)

    -Knowledge based - methods based upondatabase statistics.- Homology Modeling- Threading- Fold Recognition

    Knowledge

    BasedPseudo-Potentials ?

    Physicist Perspective -

    An N-body problemBiologist Perspective -

    Information

    Genomics Proteomics

  • 8/12/2019 Introduction to Knowledge Based Potentials

    3/15

    Perspective

  • 8/12/2019 Introduction to Knowledge Based Potentials

    4/15

    Methods Developed by BiologistsDefinitions:Neutral drift - the random mutations which, throughout time, change a proteins primary

    structure but do not significantly change its function (some unknown element of

    structure is conserved).Homologous proteins - evolutionarily related proteins.Analogous proteins - proteins which have low sequence identity, but can be well superimposed.

    They have different functions and their structural similarity is the result of convergentevolution.

    Concepts: The primary structures of a given protein from related species closely resemble one another.

    If the species have evolved from a common ancestor, then the proteins have evolved from thecorresponding protein in that ancestor.

    If a protein is well adapted to its function and is not subject to any real physiologicalimprovement, it will, nevertheless continue to evolve (neutral drift).

  • 8/12/2019 Introduction to Knowledge Based Potentials

    5/15

  • 8/12/2019 Introduction to Knowledge Based Potentials

    6/15

    Knowledge Based PseudopotentialsGoal: Concoct a formulation for the determination of the most probableconfiguration of atoms in a protein using only information from

    a structural database.Simplified Protein Representations:

  • 8/12/2019 Introduction to Knowledge Based Potentials

    7/15

    Proteins as solutions- Tanaka and Scheraga:

    Model is a mirror of the regular solution(Bragg-Williams latticemodel of solutions)

    Assumptions:1) Liquid solution of components A and B are packed closely resembling a crystal2) A and B are similar in molecular size and shape to allow perfectly random mixing

    in the lattice.

    3) Intermolecular forces are appreciable only for nearest neighbors.

  • 8/12/2019 Introduction to Knowledge Based Potentials

    8/15

    - Miyazawa and Jernigan:Model mirrors the Bethe - Peierl quasichemical approximation(also a lattice model)

    - Sidechain Centroid is the reduced representationqini

    2= nij

    j= 0

    20

    ! 0 is for solvent

    W = niiw

    ii+ n

    jjw

    jj + n

    ijw

    ij

    QN(V,T) = [q1I

    (T)e! q1w11/ 2kT]

    n1 [q2I

    (T)e!q2w22/ 2 kT]

    n2"(n1

    n12

    # , n2 ,n12)en12w / 2kT

    w =w11+w

    22! 2w

    12

    "(n1n12

    # ,n2,n12) =N!

    n1!n

    2!

    Quasichemical Approx. for species i and j:ii + jj 2ij

    nij2

    niinjj! exp(2wij "wii "wjj)

  • 8/12/2019 Introduction to Knowledge Based Potentials

    9/15

    The Pair PMF Approach

    w(r) =!kTln[g(r)]

    where,

    g(r) = g(2 )

    (!r

    1,!r

    2) =

    f ( 2) (!r

    1, !r

    2)

    f

    (1)

    (

    !r1)f

    (1)

    (

    !r2 )

    =

    1

    "

    2

    #

    zN

    $

    3N

    (N! 2 )!N=

    2

    %

    & ...V

    ' e!(U({q})

    ' d3(N!2)

    q

    so,

    )w(r)

    )!r1

    =

    !kT

    g(r)

    )g(r )

    )!r1

    =

    zN

    $3N(N!2)!N=2

    %

    & ...V

    ' e! (U({q}) )U({q})

    )!r1

    *

    +,-

    ./'d3(N!2 )q

    zN

    $3N

    (N! 2 )!N=2

    %

    & ...V'

    e!(U({q})

    'd

    3(N! 2)q

  • 8/12/2019 Introduction to Knowledge Based Potentials

    10/15

    g(r) =nobs (r)

    nhom

    (r)=

    nobs (r)

    4!r2"r#

  • 8/12/2019 Introduction to Knowledge Based Potentials

    11/15

  • 8/12/2019 Introduction to Knowledge Based Potentials

    12/15

    Further Critique of PMF approach:The PMF of a protein is:

    w({r}) = !kTlng(!r1,

    !r2 ,...,

    !rN) " !kTln[g(r12)g(r23)...etc.]

    or,

    w({r}) " w(r12) + w(r23)+ ...

    In some cases this superposition principle is a good approximation,but never completely

    trueNot even for clusters as small as three particles.

    Approximation was developed to solve integral equations of Born, Green, and Kirkwood.

  • 8/12/2019 Introduction to Knowledge Based Potentials

    13/15

    Correspondence of PMFs in ProteinsAnd Liquids

    By Shan and Zhou (JCP: 9/15/00)1)Use a configuration after 1billion collisions of a hard

    sphere liquid (500 spheres).2)Generate chains 90 res long from the configuration using

    some algorithm.3)3.7 million chains found with pairwise connectivity

    less than 1.5!4)Select 90 residues from each of a set of real 210

    nonhomologous proteins which correspondto the smallest radius of gyration in the protein.- convert identities to a 2-letter alphabet: N/P

  • 8/12/2019 Introduction to Knowledge Based Potentials

    14/15

    5)Calculate interaction energies using some pairwise contact parameters.

    6)Select 10,000 structures for each of the 210 sequences w/ lowest energy.7)Calculate PMF using..

    8)Chain connectivity of model proteins was modeled with a gauusian distributionfor the reference state.

    9)The liquid state PMF was calculated as usual.

  • 8/12/2019 Introduction to Knowledge Based Potentials

    15/15

    Protein Liquid